PEFT
Safetensors
grpo
trl
lora
vision-language-model
topological-reasoning
curvebench
AmirMohseni commited on
Commit
87f528d
·
verified ·
1 Parent(s): 6c8cbf7

Add training reward plot

Browse files
Files changed (2) hide show
  1. .gitattributes +1 -0
  2. train_reward.png +3 -0
.gitattributes CHANGED
@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
 
 
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
+ train_reward.png filter=lfs diff=lfs merge=lfs -text
train_reward.png ADDED

Git LFS Details

  • SHA256: 3af7029280f05c56f709ffb164852cfe69f0557d1ee445b0b4d22967fbbcce89
  • Pointer size: 131 Bytes
  • Size of remote file: 496 kB