--- license: apache-2.0 datasets: - MegaScience/MegaScience language: - en metrics: - accuracy base_model: - Qwen/Qwen3-14B-Base pipeline_tag: text-generation --- # [MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning](https://arxiv.org/abs/2507.16812) ## Qwen3-14B-MegaScience ### Training Recipe - **LR**: 5e-6 - **LR Schedule**: Cosine - **Batch Size**: 512 - **Max Length**: 4,096 - **Warm Up Ratio**: 0.05 - **Epochs**: 3 ### Evaluation Results