1 9 23

Akopian Albert Surenovich

albertakn

AI & ML interests

None yet

Recent Activity

upvoted a collection 9 days ago

Nemotron-Post-Training-v3

liked a dataset 13 days ago

nvidia/Nemotron-SFT-Agentic-v2

liked a Space 24 days ago

HuggingFaceH4/on-policy-distillation

View all activity

Organizations

liked a dataset 13 days ago

nvidia/Nemotron-SFT-Agentic-v2

Preview • Updated Mar 11 • 29.3k • 31

liked a Space 24 days ago

Unlocking On-Policy Distillation for Any Model Family

📝

114

Explore on-policy distillation visualization for any model

liked 4 datasets about 2 months ago

liked a dataset 3 months ago

Team-ACE/ToolACE

Viewer • Updated Sep 4, 2024 • 11.3k • 10.7k • 182

liked 3 datasets 5 months ago

google/IFEval

Viewer • Updated Aug 14, 2024 • 541 • 95.8k • 151

allenai/Dolci-Think-RL-7B

Viewer • Updated Jan 5 • 102k • 494 • 16

allenai/Dolci-Instruct-RL

Viewer • Updated Jan 5 • 170k • 186 • 14

liked 3 Spaces 5 months ago

The Ultra-Scale Playbook

🌌

3.89k

The ultimate guide to training LLM on large GPU Clusters

Predict Memory

🧮

109

Estimate model memory usage and see detailed plots

FineWeb: decanting the web for the finest text data at scale

🍷

1.37k

Explore and download the FineWeb web‑scale text dataset

liked a dataset 6 months ago

allenai/IF_multi_constraints_upto5

Viewer • Updated Oct 2, 2025 • 95.4k • 859 • 25

liked 3 datasets 7 months ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 20.7k • 251

zwhe99/DeepMath-103K

Viewer • Updated May 29, 2025 • 103k • 7.35k • 365

bigcode/starcoderdata

Viewer • Updated May 16, 2023 • 207M • 24.9k • 522

liked a model 7 months ago

unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit

Text Generation • Updated Sep 13, 2025 • 13.3k • 27

liked a Space 8 months ago

The Smol Training Playbook

📚

3.21k

The secrets to building world-class LLMs

liked a model 8 months ago

t-tech/T-pro-it-2.0

Text Generation • 33B • Updated Mar 31 • 650 • • 126