PEFT
Safetensors
grpo
trl
lora
vision-language-model
topological-reasoning
curvebench
AmirMohseni commited on
Commit
03b793e
·
verified ·
1 Parent(s): 87f528d

Add eval reward plot

Browse files
Files changed (2) hide show
  1. .gitattributes +1 -0
  2. eval_reward.png +3 -0
.gitattributes CHANGED
@@ -35,3 +35,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
  train_reward.png filter=lfs diff=lfs merge=lfs -text
 
 
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
  train_reward.png filter=lfs diff=lfs merge=lfs -text
38
+ eval_reward.png filter=lfs diff=lfs merge=lfs -text
eval_reward.png ADDED

Git LFS Details

  • SHA256: b1028b1ee0b7fc15d4402f255212405db77f5f449b01cd5669aba92075524c18
  • Pointer size: 131 Bytes
  • Size of remote file: 411 kB