Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

od2961
/

qwen2.5-Math-1.5b-drx-grpo-readmeflash-node302-seed43

Reinforcement Learning

Model card Files Files and versions

Instructions to use od2961/qwen2.5-Math-1.5b-drx-grpo-readmeflash-node302-seed43 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use od2961/qwen2.5-Math-1.5b-drx-grpo-readmeflash-node302-seed43 with Transformers:

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("od2961/qwen2.5-Math-1.5b-drx-grpo-readmeflash-node302-seed43", dtype="auto")

Notebooks
Google Colab
Kaggle

qwen2.5-Math-1.5b-drx-grpo-readmeflash-node302-seed43

Ctrl+K

Ctrl+K

1 contributor

History: 77 commits

od2961's picture

Upload eval 176_olympiad_bench.json

78edb82 verified 2 months ago

checkpoints
Upload step_00176 2 months ago
eval_results
Upload eval 176_olympiad_bench.json 2 months ago
metadata
Upload metadata for qwen2.5-Math-1.5b-r1-zero-drx-grpo-node302-seed43_0416T10:04:41 2 months ago
.gitattributes

2.41 kB
Upload step_00176 2 months ago
README.md

1.94 kB
Upload metadata for qwen2.5-Math-1.5b-r1-zero-drx-grpo-node302-seed43_0416T10:04:41 2 months ago