Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs Paper • 2605.20315 • Published May 19 • 28
Q-ARVD: Quantizing Autoregressive Video Diffusion Models Paper • 2605.21072 • Published May 20 • 21 • 2
ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation Paper • 2506.18810 • Published Jun 23, 2025 • 2
ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation Paper • 2506.18810 • Published Jun 23, 2025 • 2