MSGEncrypted
/

minicpm5-1b-math-lora

@@ -10,13 +10,26 @@ tags:
   - math
 ---
-# minicpm5-1b-math-lora
-QLoRA math adapter for openbmb/MiniCPM5-1B, trained on meta-math/MetaMathQA with tatsu-lab/alpaca replay.
-## Benchmark comparison
-Evaluated with research/evals/configs/lm_eval_math.yaml on Modal using slm-lm-eval.
 | task | metric | baseline | candidate | delta |
 | --- | --- | ---: | ---: | ---: |
@@ -27,9 +40,11 @@ Evaluated with research/evals/configs/lm_eval_math.yaml on Modal using slm-lm-ev
 ## Training
-- train loss: -
 - eval loss: 0.494981
-- result score: -
 ## Load with PEFT
@@ -39,7 +54,11 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 base = "openbmb/MiniCPM5-1B"
 adapter = "MSGEncrypted/minicpm5-1b-math-lora"
 tokenizer = AutoTokenizer.from_pretrained(base, trust_remote_code=True)
-model = AutoModelForCausalLM.from_pretrained(base, torch_dtype="auto", device_map="auto", trust_remote_code=True)
 model = PeftModel.from_pretrained(model, adapter)
 ```

   - math
 ---
+# math-lora
+QLoRA adapter for **math**, fine-tuned from `openbmb/MiniCPM5-1B` on `meta-math/MetaMathQA` + `tatsu-lab/alpaca` (format: `mix`).
+Trained, evaluated, and gated on [Modal](https://modal.com/docs/guide) via `research/modal/` (app `slm-finetune-benchmark`).
+## Benchmark gate
+- eval profile: `math`
+- gate: **PASSED**
+| check | value | result |
+| --- | ---: | --- |
+| gsm8k >= 0.05 | 0.4000 | pass |
+| gsm8k improve >= 0.02 | 0.0700 | pass |
+| arc_challenge regress <= 0.03 | -0.0500 | pass |
+| hellaswag regress <= 0.03 | 0.0000 | pass |
+| piqa regress <= 0.03 | 0.0200 | pass |
+## lm-eval results
 | task | metric | baseline | candidate | delta |
 | --- | --- | ---: | ---: | ---: |
 ## Training
+- dataset: `/repo/research/data/education-lesson-chat.jsonl`
+- mode: `qlora`
+- samples: {'train': 3528, 'eval': 72}
+- final train loss: 0.340698
 - eval loss: 0.494981
 ## Load with PEFT
 base = "openbmb/MiniCPM5-1B"
 adapter = "MSGEncrypted/minicpm5-1b-math-lora"
 tokenizer = AutoTokenizer.from_pretrained(base, trust_remote_code=True)
+model = AutoModelForCausalLM.from_pretrained(
+    base, torch_dtype="auto", device_map="auto", trust_remote_code=True
+)
 model = PeftModel.from_pretrained(model, adapter)
 ```