Instructions to use Interplay-LM-Reasoning/extrapolation_rl with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Interplay-LM-Reasoning/extrapolation_rl with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Interplay-LM-Reasoning/extrapolation_rl", dtype="auto") - Notebooks
- Google Colab
- Kaggle
metadata
license: mit
pipeline_tag: text-generation
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
Charlie Zhang, Graham Neubig, Xiang Yue
Carnegie Mellon University, Language Technologies Institute
This repository contains post-training related checkpoints in extrapolation tasks.
Code: GitHub Repository
๐ Citation
If you find this work or code useful, please consider citing:
@misc{zhang2025interplaypretrainingmidtrainingrl,
title={On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models},
author={Charlie Zhang and Graham Neubig and Xiang Yue},
year={2025},
eprint={2512.07783},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2512.07783},
}