lex-interviewer-nemotron-4b-grpo-v12 / adapter /adapter_model.safetensors

Commit History

GRPO v12 LoRA adapter: score=0.760, uses_guest=60%, probing=96%
aff278c
verified

bobber commited on