Running 184 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 184 Building and scaling RL environments for LLM training
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 lysandre, ArthurZ, cyrilvallez, reach-vb • Dec 1, 2025 • 311
view article Article 20x Faster TRL Fine-tuning with RapidFire AI +1 kbigdelysh, arunkk09, qgallouedec • Nov 21, 2025 • 27
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29, 2025 • 99
view reply Great blog post! Thanks for this amazing work! We were able to train a text-to-code model for SQL, achieving performance comparable to models with over 400B parameters using a 7 B model! Check out our Think2SQL paper here: https://huggingface.co/papers/2504.15077 Thank you again for the outstanding work!