SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published 19 days ago • 38
Cosmos3 Collection Omnimodal World Models for Physical AI • 15 items • Updated 4 days ago • 120
facebook/dinov3-vitl16-pretrain-sat493m Image Feature Extraction • 0.3B • Updated Aug 19, 2025 • 7.94k • 44
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 666
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 5.68M • • 3.09k