Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

ShuoZheLi
/

rlvr_ppo_qwen2.5_0.5B_metamath_global_step_800

Model card Files Files and versions

rlvr_ppo_qwen2.5_0.5B_metamath_global_step_800

14.7 GB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

ShuoZheLi's picture

Upload folder using huggingface_hub

e5059c3 verified 14 days ago

actor
Upload folder using huggingface_hub 14 days ago
critic
Upload folder using huggingface_hub 14 days ago
merged_hf
Upload folder using huggingface_hub 14 days ago
.gitattributes

1.79 kB
Upload folder using huggingface_hub 14 days ago
data.pt
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch.ByteStorage",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
7.32 kB
xet

Upload folder using huggingface_hub 14 days ago