Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2511.04670

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 31
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 15
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 45
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 24

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 116
Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39
MagicWorld: Interactive Geometry-driven Video World Exploration

Paper • 2511.18886 • Published Nov 24, 2025 • 19
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published Dec 16, 2025 • 73

Cambrian-S Models

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39
nyu-visionx/Cambrian-S-7B-LFP

8B • Updated Nov 6, 2025 • 204 • 3
nyu-visionx/Cambrian-S-7B

Image-to-Text • 8B • Updated Nov 7, 2025 • 1.18k • 5
nyu-visionx/Cambrian-S-3B

Image-to-Text • 3B • Updated Nov 7, 2025 • 345 • 1

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Paper • 2602.17100 • Published Feb 19 • 4
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Paper • 2603.01059 • Published Mar 1 • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Paper • 2603.00618 • Published Feb 28
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 197

Cambrian-S-Data

Data used during Cambrian-S's 4-stage training

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39
nyu-visionx/VSI-590K

Preview • Updated Nov 7, 2025 • 2.33k • 22
nyu-visionx/VSI-Train-10k

Viewer • Updated Nov 7, 2025 • 10k • 321 • 4
nyu-visionx/Cambrian-S-3M

Updated Jan 22 • 71.7k • 6

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 31
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 15
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 45
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 24

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Paper • 2602.17100 • Published Feb 19 • 4
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Paper • 2603.01059 • Published Mar 1 • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Paper • 2603.00618 • Published Feb 28
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 197

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 116
Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39
MagicWorld: Interactive Geometry-driven Video World Exploration

Paper • 2511.18886 • Published Nov 24, 2025 • 19
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published Dec 16, 2025 • 73

Cambrian-S-Data

Data used during Cambrian-S's 4-stage training

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39
nyu-visionx/VSI-590K

Preview • Updated Nov 7, 2025 • 2.33k • 22
nyu-visionx/VSI-Train-10k

Viewer • Updated Nov 7, 2025 • 10k • 321 • 4
nyu-visionx/Cambrian-S-3M

Updated Jan 22 • 71.7k • 6

Cambrian-S Models

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39
nyu-visionx/Cambrian-S-7B-LFP

8B • Updated Nov 6, 2025 • 204 • 3
nyu-visionx/Cambrian-S-7B

Image-to-Text • 8B • Updated Nov 7, 2025 • 1.18k • 5
nyu-visionx/Cambrian-S-3B

Image-to-Text • 3B • Updated Nov 7, 2025 • 345 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs