Round 8 gap-fill artifact: figures/exp_main_table_3b_r8.md
Browse files
figures/exp_main_table_3b_r8.md
ADDED
|
@@ -0,0 +1,7 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
| Domain | Task | base_Y | mean_N16 | best_R6_N16 | best_R8_new_N16 | best_learned_N16 | oracle | gap_recovered | winner |
|
| 2 |
+
|---|---|---|---|---|---|---|---|---|---|
|
| 3 |
+
| math | gsm_hard | 0.063 | 0.066±0.005 | 0.066±0.005 (mean) | 0.070±0.007 (pertensor_pca) | 0.070±0.007 (pertensor_pca) | 0.150 | 0.077±0.077 | R8:pertensor_pca |
|
| 4 |
+
| math | gsm8k_test_500 | 0.080 | 0.102±0.002 | 0.102±0.002 (mean) | 0.106±0.002 (pertensor_pca) | 0.106±0.002 (pertensor_pca) | 0.293 | 0.120±0.009 | R8:pertensor_pca |
|
| 5 |
+
| code | mbpp_test_held | 0.230 | 0.240±0.000 | 0.257±0.006 (global_ridge) | 0.250±0.000 (pertensor_ridge) | 0.257±0.006 (global_ridge) | 0.320 | 0.296±0.064 | R6:global_ridge |
|
| 6 |
+
| code | mbpp_plus | 0.217 | 0.212±0.002 | 0.270±0.003 (global_ridge) | 0.266±0.002 (pertensor_ridge) | 0.270±0.003 (global_ridge) | 0.450 | 0.229±0.014 | R6:global_ridge |
|
| 7 |
+
| science | openbookqa_test | 0.710 | 0.754±0.002 | 0.754±0.002 (mean) | 0.756±0.019 (pertensor_pca) | 0.756±0.019 (pertensor_pca) | 0.983 | 0.167±0.070 | R8:pertensor_pca |
|