World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 6 days ago • 25
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 19 days ago • 58
Relit-LiVE: Relight Video by Jointly Learning Environment Video Paper • 2605.06658 • Published May 7 • 15
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published May 1 • 84
Repurposing Geometric Foundation Models for Multi-view Diffusion Paper • 2603.22275 • Published Mar 23 • 48