PEFT
Safetensors
grpo
trl
lora
vision-language-model
topological-reasoning
curvebench
AmirMohseni commited on
Commit
fa6814a
·
verified ·
1 Parent(s): 76748d7

Model save

Browse files
Files changed (3) hide show
  1. all_results.json +5 -5
  2. train_results.json +5 -5
  3. trainer_state.json +0 -0
all_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "total_flos": 0.0,
3
- "train_loss": 0.0005530451710296802,
4
- "train_runtime": 20175.9584,
5
- "train_samples": 114,
6
- "train_samples_per_second": 1.269,
7
- "train_steps_per_second": 0.02
8
  }
 
1
  {
2
  "total_flos": 0.0,
3
+ "train_loss": 1.7647783374741266e-06,
4
+ "train_runtime": 1084.2338,
5
+ "train_samples": 40,
6
+ "train_samples_per_second": 47.222,
7
+ "train_steps_per_second": 0.369
8
  }
train_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "total_flos": 0.0,
3
- "train_loss": 0.0005530451710296802,
4
- "train_runtime": 20175.9584,
5
- "train_samples": 114,
6
- "train_samples_per_second": 1.269,
7
- "train_steps_per_second": 0.02
8
  }
 
1
  {
2
  "total_flos": 0.0,
3
+ "train_loss": 1.7647783374741266e-06,
4
+ "train_runtime": 1084.2338,
5
+ "train_samples": 40,
6
+ "train_samples_per_second": 47.222,
7
+ "train_steps_per_second": 0.369
8
  }
trainer_state.json CHANGED
The diff for this file is too large to render. See raw diff