-
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Paper • 2605.15980 • Published • 36 -
NGRPO: Negative-enhanced Group Relative Policy Optimization
Paper • 2509.18851 • Published • 2 -
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization
Paper • 2605.19436 • Published • 14 -
Delta Attention Residuals
Paper • 2605.18855 • Published • 8
Collections
Discover the best community collections!
Collections including paper arxiv:2605.18643
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 104 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 27 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 55
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 31 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 15 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 45 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 24
-
TsinghuaC3I/ZEDA-Qwen3-30B-A3B-Dynamic
Text Generation • 31B • Updated • 39 • 1 -
TsinghuaC3I/ZEDA-GLM-4.7-Flash-Dynamic
Text Generation • 30B • Updated • 124 • 2 -
TsinghuaC3I/ZEDA
Preview • Updated • 120 • 1 -
Post-Trained MoE Can Skip Half Experts via Self-Distillation
Paper • 2605.18643 • Published • 30
-
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 164 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
Paper • 2604.04707 • Published • 203
-
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Paper • 2410.10814 • Published • 51 -
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Paper • 2502.16894 • Published • 33 -
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Paper • 2506.14731 • Published • 8 -
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
Paper • 2506.18349 • Published • 13
-
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Paper • 2605.15980 • Published • 36 -
NGRPO: Negative-enhanced Group Relative Policy Optimization
Paper • 2509.18851 • Published • 2 -
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization
Paper • 2605.19436 • Published • 14 -
Delta Attention Residuals
Paper • 2605.18855 • Published • 8
-
TsinghuaC3I/ZEDA-Qwen3-30B-A3B-Dynamic
Text Generation • 31B • Updated • 39 • 1 -
TsinghuaC3I/ZEDA-GLM-4.7-Flash-Dynamic
Text Generation • 30B • Updated • 124 • 2 -
TsinghuaC3I/ZEDA
Preview • Updated • 120 • 1 -
Post-Trained MoE Can Skip Half Experts via Self-Distillation
Paper • 2605.18643 • Published • 30
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 164 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
Paper • 2604.04707 • Published • 203
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 104 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 27 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 55
-
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Paper • 2410.10814 • Published • 51 -
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Paper • 2502.16894 • Published • 33 -
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Paper • 2506.14731 • Published • 8 -
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
Paper • 2506.18349 • Published • 13
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 31 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 15 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 45 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 24