Create analyze_soup_output.txt
Browse files- analyze_soup_output.txt +192 -0
analyze_soup_output.txt
ADDED
|
@@ -0,0 +1,192 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
=================================================================
|
| 2 |
+
BASE TIER SOUP ANALYSIS
|
| 3 |
+
Device: cuda
|
| 4 |
+
=================================================================
|
| 5 |
+
|
| 6 |
+
Loading checkpoint...
|
| 7 |
+
Loaded: mAP=0.825 cv=0.3117 epoch=19
|
| 8 |
+
clip_l14_openai loaded
|
| 9 |
+
dinov2_b14 loaded
|
| 10 |
+
siglip_b16_384 loaded
|
| 11 |
+
|
| 12 |
+
Running inference on 5000 val images...
|
| 13 |
+
Done: fused=torch.Size([5000, 128]) tri=torch.Size([5000, 256])
|
| 14 |
+
|
| 15 |
+
=================================================================
|
| 16 |
+
SCAN 1: ANCHOR GEOMETRY
|
| 17 |
+
=================================================================
|
| 18 |
+
Anchor pairwise cosine:
|
| 19 |
+
mean=0.0356 std=0.1896
|
| 20 |
+
max=0.9542 min=-0.9093
|
| 21 |
+
Max neighbor cosine per anchor:
|
| 22 |
+
mean=0.6949 std=0.2730
|
| 23 |
+
max=0.9542 min=0.0639
|
| 24 |
+
Anchor norms: mean=1.000000 std=0.000000
|
| 25 |
+
Anchor spectral: eff_rank=65.7/128
|
| 26 |
+
sv_max=5.0231 sv_10=2.8655 sv_50=0.9697 sv_min=0.125017
|
| 27 |
+
Anchor pentachoron CV: 0.2478
|
| 28 |
+
mean_vol=0.074751 std_vol=0.018524
|
| 29 |
+
|
| 30 |
+
=================================================================
|
| 31 |
+
SCAN 2: ANCHOR UTILIZATION
|
| 32 |
+
=================================================================
|
| 33 |
+
Active anchors: 1/256 (0.4%)
|
| 34 |
+
Visit counts: mean=19.5 std=312.5
|
| 35 |
+
max=5000 min=0
|
| 36 |
+
top 10: [5000, 0, 0, 0, 0, 0, 0, 0, 0, 0]
|
| 37 |
+
bottom 10: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
|
| 38 |
+
Anchor entropy: -0.0000 / 5.5452 (-0.0%)
|
| 39 |
+
|
| 40 |
+
Per-anchor embedding density:
|
| 41 |
+
Intra-cluster cosine: mean=0.9693 std=0.0000
|
| 42 |
+
|
| 43 |
+
=================================================================
|
| 44 |
+
SCAN 3: PROJECTOR ANALYSIS
|
| 45 |
+
=================================================================
|
| 46 |
+
|
| 47 |
+
clip_l14:
|
| 48 |
+
norm: mean=1.000000 (should be 1.0)
|
| 49 |
+
self-sim off-diag: 0.9668
|
| 50 |
+
eff_dim: 24.1/128
|
| 51 |
+
|
| 52 |
+
dinov2_b14:
|
| 53 |
+
norm: mean=1.000000 (should be 1.0)
|
| 54 |
+
self-sim off-diag: 0.9678
|
| 55 |
+
eff_dim: 25.3/128
|
| 56 |
+
|
| 57 |
+
siglip_b16:
|
| 58 |
+
norm: mean=1.000000 (should be 1.0)
|
| 59 |
+
self-sim off-diag: 0.9501
|
| 60 |
+
eff_dim: 23.8/128
|
| 61 |
+
|
| 62 |
+
Expert agreement (cosine in 128-d):
|
| 63 |
+
clip_l14 × dinov2_b14 : mean=0.9898 std=0.0066 min=0.8730
|
| 64 |
+
clip_l14 × siglip_b16 : mean=0.9893 std=0.0052 min=0.9307
|
| 65 |
+
dinov2_b14 × siglip_b16 : mean=0.9855 std=0.0081 min=0.8920
|
| 66 |
+
|
| 67 |
+
Per-expert nearest anchor agreement:
|
| 68 |
+
clip_l14 × dinov2_b14 : same_anchor=1.0000 (100.0%)
|
| 69 |
+
clip_l14 × siglip_b16 : same_anchor=1.0000 (100.0%)
|
| 70 |
+
dinov2_b14 × siglip_b16 : same_anchor=1.0000 (100.0%)
|
| 71 |
+
|
| 72 |
+
Projector weight comparison:
|
| 73 |
+
clip_l14 : norm=37.4114 eff_rank=30.5/128
|
| 74 |
+
dinov2_b14 : norm=36.3149 eff_rank=23.0/128
|
| 75 |
+
siglip_b16 : norm=39.2079 eff_rank=29.0/128
|
| 76 |
+
clip_l14 × dinov2_b14 weight_cos=0.0046
|
| 77 |
+
clip_l14 × siglip_b16 weight_cos=-0.0049
|
| 78 |
+
dinov2_b14 × siglip_b16 weight_cos=-0.0055
|
| 79 |
+
|
| 80 |
+
=================================================================
|
| 81 |
+
SCAN 4: PATCHWORK COMPARTMENTS
|
| 82 |
+
=================================================================
|
| 83 |
+
Comp 0: 32 anchors
|
| 84 |
+
Comp 1: 32 anchors
|
| 85 |
+
Comp 2: 32 anchors
|
| 86 |
+
Comp 3: 32 anchors
|
| 87 |
+
Comp 4: 32 anchors
|
| 88 |
+
Comp 5: 32 anchors
|
| 89 |
+
Comp 6: 32 anchors
|
| 90 |
+
Comp 7: 32 anchors
|
| 91 |
+
|
| 92 |
+
Patchwork output: torch.Size([5000, 512])
|
| 93 |
+
norm: mean=11.6381 std=0.5046
|
| 94 |
+
comp 0: norm=2.8604 std_across_dims=0.0010
|
| 95 |
+
comp 1: norm=3.7652 std_across_dims=0.1596
|
| 96 |
+
comp 2: norm=2.3303 std_across_dims=0.0057
|
| 97 |
+
comp 3: norm=3.4802 std_across_dims=0.2053
|
| 98 |
+
comp 4: norm=3.3465 std_across_dims=0.1143
|
| 99 |
+
comp 5: norm=5.9651 std_across_dims=0.4720
|
| 100 |
+
comp 6: norm=6.0775 std_across_dims=0.4946
|
| 101 |
+
comp 7: norm=3.2188 std_across_dims=0.0718
|
| 102 |
+
|
| 103 |
+
=================================================================
|
| 104 |
+
SCAN 5: TRIANGULATION PATTERNS
|
| 105 |
+
=================================================================
|
| 106 |
+
Triangulation distances (1-cosine):
|
| 107 |
+
mean=0.8988 std=0.1301
|
| 108 |
+
min=0.0038 max=1.2538
|
| 109 |
+
Nearest anchor distance:
|
| 110 |
+
mean=0.0156 std=0.0042
|
| 111 |
+
max=0.0419 min=0.0038
|
| 112 |
+
Anchors within cos>0.5 per image:
|
| 113 |
+
mean=1.0 std=0.0
|
| 114 |
+
Top-10 nearest anchor distances:
|
| 115 |
+
k=0: mean=0.0156 std=0.0042
|
| 116 |
+
k=1: mean=0.6646 std=0.0218
|
| 117 |
+
k=2: mean=0.6806 std=0.0185
|
| 118 |
+
k=3: mean=0.6909 std=0.0173
|
| 119 |
+
k=4: mean=0.6977 std=0.0167
|
| 120 |
+
k=5: mean=0.7033 std=0.0162
|
| 121 |
+
k=6: mean=0.7081 std=0.0158
|
| 122 |
+
k=7: mean=0.7126 std=0.0154
|
| 123 |
+
k=8: mean=0.7166 std=0.0150
|
| 124 |
+
k=9: mean=0.7204 std=0.0147
|
| 125 |
+
|
| 126 |
+
=================================================================
|
| 127 |
+
SCAN 6: PER-CLASS ANCHOR AFFINITY
|
| 128 |
+
=================================================================
|
| 129 |
+
|
| 130 |
+
Top-3 anchors per class (first 20 classes):
|
| 131 |
+
person (n=2693): 65(2693/2693) 1(0/2693) 0(0/2693)
|
| 132 |
+
bicycle (n= 149): 65(149/149) 1(0/149) 0(0/149)
|
| 133 |
+
car (n= 535): 65(535/535) 1(0/535) 0(0/535)
|
| 134 |
+
motorcycle (n= 159): 65(159/159) 1(0/159) 0(0/159)
|
| 135 |
+
airplane (n= 97): 65(97/97) 1(0/97) 0(0/97)
|
| 136 |
+
bus (n= 189): 65(189/189) 1(0/189) 0(0/189)
|
| 137 |
+
train (n= 157): 65(157/157) 1(0/157) 0(0/157)
|
| 138 |
+
truck (n= 250): 65(250/250) 1(0/250) 0(0/250)
|
| 139 |
+
boat (n= 121): 65(121/121) 1(0/121) 0(0/121)
|
| 140 |
+
traffic light (n= 191): 65(191/191) 1(0/191) 0(0/191)
|
| 141 |
+
fire hydrant (n= 86): 65(86/86) 1(0/86) 0(0/86)
|
| 142 |
+
stop sign (n= 69): 65(69/69) 1(0/69) 0(0/69)
|
| 143 |
+
parking meter (n= 37): 65(37/37) 1(0/37) 0(0/37)
|
| 144 |
+
bench (n= 235): 65(235/235) 1(0/235) 0(0/235)
|
| 145 |
+
bird (n= 125): 65(125/125) 1(0/125) 0(0/125)
|
| 146 |
+
cat (n= 184): 65(184/184) 1(0/184) 0(0/184)
|
| 147 |
+
dog (n= 177): 65(177/177) 1(0/177) 0(0/177)
|
| 148 |
+
horse (n= 128): 65(128/128) 1(0/128) 0(0/128)
|
| 149 |
+
sheep (n= 65): 65(65/65) 1(0/65) 0(0/65)
|
| 150 |
+
cow (n= 87): 65(87/87) 1(0/87) 0(0/87)
|
| 151 |
+
|
| 152 |
+
Anchor specialization:
|
| 153 |
+
classes per anchor: mean=80.0 std=nan
|
| 154 |
+
max=80 min=80
|
| 155 |
+
|
| 156 |
+
=================================================================
|
| 157 |
+
SCAN 7: FUSED EMBEDDING GEOMETRY
|
| 158 |
+
=================================================================
|
| 159 |
+
Norms: mean=1.000000 std=0.000000
|
| 160 |
+
Self-sim (off-diag): 0.9693
|
| 161 |
+
Effective dim: 23.6/128
|
| 162 |
+
top-5 SVs explain 44.1%
|
| 163 |
+
top-10 SVs explain 70.2%
|
| 164 |
+
top-20 SVs explain 99.2%
|
| 165 |
+
top-50 SVs explain 100.0%
|
| 166 |
+
top-100 SVs explain 100.0%
|
| 167 |
+
Pentachoron CV: 0.3529
|
| 168 |
+
|
| 169 |
+
=================================================================
|
| 170 |
+
SCAN 8: EXPERT CONTRIBUTION
|
| 171 |
+
=================================================================
|
| 172 |
+
clip_l14 : cos_to_fused mean=0.9970 std=0.0016
|
| 173 |
+
dinov2_b14 : cos_to_fused mean=0.9957 std=0.0027
|
| 174 |
+
siglip_b16 : cos_to_fused mean=0.9955 std=0.0023
|
| 175 |
+
Without clip_l14 : cos_to_full=0.9992 (uniqueness=0.0008)
|
| 176 |
+
Without dinov2_b14 : cos_to_full=0.9989 (uniqueness=0.0011)
|
| 177 |
+
Without siglip_b16 : cos_to_full=0.9989 (uniqueness=0.0011)
|
| 178 |
+
|
| 179 |
+
Per-image expert disagreement:
|
| 180 |
+
Agreement: mean=0.9882 std=0.0055
|
| 181 |
+
Disagreement: mean=0.0041 std=0.0034
|
| 182 |
+
|
| 183 |
+
Most agreed image (1449): agreement=0.9978
|
| 184 |
+
labels: [22]
|
| 185 |
+
Most disagreed image (1435): agreement=0.9214
|
| 186 |
+
labels: [28]
|
| 187 |
+
|
| 188 |
+
=================================================================
|
| 189 |
+
ANALYSIS COMPLETE
|
| 190 |
+
=================================================================
|
| 191 |
+
/tmp/ipykernel_10600/3734699858.py:410: UserWarning: std(): degrees of freedom is <= 0. Correction should be strictly less than the reduction factor (input numel divided by output numel). (Triggered internally at /pytorch/aten/src/ATen/native/ReduceOps.cpp:1857.)
|
| 192 |
+
f"std={anchor_class_count[anchor_class_count>0].std():.1f}")
|