Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer Paper • 2605.30940 • Published 23 days ago • 37
ThriftAttention: Selective Mixed Precision for Long-Context FP4 Attention Paper • 2605.23081 • Published about 1 month ago • 41
Macaron-A2UI: A Model for Generative UI in Personal Agents Paper • 2605.24830 • Published 28 days ago • 82
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published May 14 • 146
CogOmniControl: Reasoning-Driven Controllable Video Generation via Creative Intent Cognition Paper • 2605.19995 • Published May 19 • 34
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published May 13 • 273
F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking Paper • 2605.12995 • Published May 13 • 2
Crowded in B-Space: Calibrating Shared Directions for LoRA Merging Paper • 2604.16826 • Published Apr 18 • 18