Instructions to use scrubster/dr-stein-colab-qwen3b-math-5k with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use scrubster/dr-stein-colab-qwen3b-math-5k with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-3B-Instruct") model = PeftModel.from_pretrained(base_model, "scrubster/dr-stein-colab-qwen3b-math-5k") - Notebooks
- Google Colab
- Kaggle
scrubster/dr-stein-colab-qwen3b-math-5k
PEFT/LoRA adapter for Qwen/Qwen2.5-3B-Instruct, fine-tuned on the
slm-learning GAD-tool translation track.
- Trained on: 153 hand-curated
(instruction, gad CLI command)pairs - Adapter kind: lora (r=16, alpha=32)
- Target modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
- Run name:
colab_qwen3b_math_5k - Compute:
colab-a100
Loading
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-3B-Instruct")
tok = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-3B-Instruct")
model = PeftModel.from_pretrained(base, "scrubster/dr-stein-colab-qwen3b-math-5k")
Notes
The moonshot for this Colab session: 3B base + 5000 OpenMathInstruct-2 pairs. If math gains from base-size, this is where we'd see it. bs=4 grad_accum=4 (effective batch=16) fits comfortably on A100 40GB. Wall time estimate: 12 min train + ~5 min eval.
Inference path for 3B locally: needs int8 (3GB) or stays remote. Acceptable for the math/reasoning track since each invocation is rare (compared to CLI which is per-command).
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support