lex-interviewer-nemotron-4b-grpo-v12 / generation_config.json

Commit History

fix: add token 11 (im_end) to eos_token_id — stops generation at end of turn
50ce4f4
verified

bobber commited on

Add merged weights: GRPO v12 (score=0.760)
1a96a52
verified

bobber commited on