lex-interviewer-nemotron-4b-grpo-v12 / onnx /lm_head_MatMul_weight_quant

Commit History

Add model_q4.onnx: GRPO v12 fine-tuned, 4-bit quantized for browser (~2.5GB)
93531f5
verified

bobber commited on