RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO Paper ⢠2605.15190 ⢠Published May 14 ⢠13
Seedance 2.0: Advancing Video Generation for World Complexity Paper ⢠2604.14148 ⢠Published Apr 15 ⢠166
Spatia: Video Generation with Updatable Spatial Memory Paper ⢠2512.15716 ⢠Published Dec 17, 2025 ⢠35
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper ⢠2512.04677 ⢠Published Dec 4, 2025 ⢠178
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement Paper ⢠2511.23475 ⢠Published Nov 28, 2025 ⢠43
From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model Paper ⢠2510.19871 ⢠Published Oct 22, 2025 ⢠30
UniVideo: Unified Understanding, Generation, and Editing for Videos Paper ⢠2510.08377 ⢠Published Oct 9, 2025 ⢠81
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper ⢠2510.02283 ⢠Published Oct 2, 2025 ⢠98
Seedream 4.0: Toward Next-generation Multimodal Image Generation Paper ⢠2509.20427 ⢠Published Sep 24, 2025 ⢠85
Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation Paper ⢠2509.18824 ⢠Published Sep 23, 2025 ⢠23
Cosmos-Preidct1 Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/cosmos3 ⢠14 items ⢠Updated 9 days ago ⢠304
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis Paper ⢠2404.13686 ⢠Published Apr 21, 2024 ⢠29
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis Paper ⢠2402.18078 ⢠Published Feb 28, 2024 ⢠2
ByteEdit: Boost, Comply and Accelerate Generative Image Editing Paper ⢠2404.04860 ⢠Published Apr 7, 2024 ⢠25
UniFL: Improve Stable Diffusion via Unified Feedback Learning Paper ⢠2404.05595 ⢠Published Apr 8, 2024 ⢠24