Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models Paper • 2606.16700 • Published 10 days ago • 12
ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time? Paper • 2606.05553 • Published 21 days ago • 48
Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models Paper • 2605.09241 • Published May 10 • 2
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation Paper • 2604.09132 • Published Apr 10 • 56
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published Apr 8 • 73
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published Mar 23 • 29
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published Mar 22 • 78
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States Paper • 2603.19987 • Published Mar 20 • 9
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 155
On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking Paper • 2602.16849 • Published Feb 18 • 7
QTIP Quantized Models Collection See https://github.com/Cornell-RelaxML/qtip • 27 items • Updated Mar 2 • 13