Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
ShuoZheLi
/
rlvr_ppo_qwen2.5_0.5B_metamath_global_step_800
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
rlvr_ppo_qwen2.5_0.5B_metamath_global_step_800
14.7 GB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
ShuoZheLi
Upload folder using huggingface_hub
e5059c3
verified
14 days ago
actor
Upload folder using huggingface_hub
14 days ago
critic
Upload folder using huggingface_hub
14 days ago
merged_hf
Upload folder using huggingface_hub
14 days ago
.gitattributes
1.79 kB
Upload folder using huggingface_hub
14 days ago
data.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.ByteStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
7.32 kB
xet
Upload folder using huggingface_hub
14 days ago