GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published Apr 29 • 112
view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 10 days ago • 43
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • May 14 • 60
view article Article KV Cache from scratch in nanoVLM +3 ariG23498, kashif, lusxvr, andito, pcuenq • Jun 4, 2025 • 120
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 408
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 351
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 23 days ago • 121
Efficient Memory Management for Large Language Model Serving with PagedAttention Paper • 2309.06180 • Published Sep 12, 2023 • 59
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published May 21 • 33
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published May 4 • 354
view article Article Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem drmapavone • Jan 5 • 26
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 9 days ago • 158
view article Article Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds nvidia • Mar 11 • 6
view article Article LeRobot v0.5.0: Scaling Every Dimension +8 imstevenpmwork, pepijn223, jadechoghari, CarolinePascal, lilkm, nepyope, Nico-robot, aractingi, VirgileBatto, thomwolf • Mar 9 • 44
VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models Paper • 2601.03309 • Published Jan 6 • 2
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model Paper • 2509.09372 • Published Sep 11, 2025 • 256
SWE-smith: Scaling Data for Software Engineering Agents Paper • 2504.21798 • Published Apr 30, 2025 • 15