TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration Paper • 2606.04743 • Published 15 days ago • 45
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents Paper • 2606.11182 • Published 9 days ago • 18
Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization Paper • 2606.11180 • Published 9 days ago • 33
SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories Paper • 2606.01311 • Published 18 days ago • 36
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts Paper • 2606.02404 • Published 17 days ago • 56
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 21 days ago • 76
VGenST-Bench: A Benchmark for Spatio-Temporal Reasoning via Active Video Synthesis Paper • 2605.22570 • Published 28 days ago • 24
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories Paper • 2605.04036 • Published May 5 • 70
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 507
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published Apr 14 • 110
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 327