AI Paper Daily | 2026-03-27

今日概览

重点推荐 ⭐

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

作者: Xiangru Jian, Shravan Nayak, Kevin Qinghong Lin et al.
来源: huggingface (69 upvotes)
链接: arXiv | PDF
关键贡献: （需人工补充）
代码/权重: 待确认

📄 Abstract 中文翻译

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

作者: Jeonghye Kim, Xufang Luo, Minbeom Kim et al.
来源: huggingface (27 upvotes)
链接: arXiv | PDF
关键贡献: （需人工补充）
代码/权重: 待确认

📄 Abstract 中文翻译

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

作者: Zichuan Lin, Feiyu Liu, Yijun Yang et al.
来源: huggingface (29 upvotes)
链接: arXiv | PDF
关键贡献: （需人工补充）
代码/权重: 待确认

📄 Abstract 中文翻译

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

作者: Yunzhe Wang, Runhui Xu, Kexin Zheng et al.
来源: huggingface (16 upvotes)
链接: arXiv | PDF
关键贡献: （需人工补充）
代码/权重: 待确认

📄 Abstract 中文翻译

🔊 Audio LLM

BioVITA: Biological Dataset, Model, and Benchmark for Visual-Textual-Acoustic Alignment

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

🤖 AI Agents

CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

🔥 通用热门

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Matching

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

Toward Physically Consistent Driving Video World Models under Challenging Trajectories

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

📌 其他值得关注

Understanding the Challenges in Iterative Generative Optimization with LLMs

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

GenMask: Adapting DiT for Segmentation via Direct Mask

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

来自 HuggingFace 热门或 Semantic Scholar 的较早论文，虽超出严格时间窗口但仍值得关注。

Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

链接: arXiv | PDF
摘要: （暂无摘要）

📄 Abstract 中文翻译

生成时间：2026-03-27 00:05:24 UTC | 数据来源：arXiv、HuggingFace、Semantic Scholar

今日概览

重点推荐 ⭐

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

🔊 Audio LLM

BioVITA: Biological Dataset, Model, and Benchmark for Visual-Textual-Acoustic Alignment

YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance

🤖 AI Agents

CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

🔥 通用热门

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Matching

Toward Physically Consistent Driving Video World Models under Challenging Trajectories

📌 其他值得关注

Understanding the Challenges in Iterative Generative Optimization with LLMs

SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision

GenMask: Adapting DiT for Segmentation via Direct Mask

🔥 Trending 补充（非 24-48h 但值得关注）

Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG