geolip-vit-base-x3 / run_1-collapse /analyze_soup_output.txt
AbstractPhil's picture
Rename analyze_soup_output.txt to run_1-collapse/analyze_soup_output.txt
5a6b379 verified
Raw
History Blame
7.44 kB
=================================================================
BASE TIER SOUP ANALYSIS
Device: cuda
=================================================================
Loading checkpoint...
Loaded: mAP=0.825 cv=0.3117 epoch=19
clip_l14_openai loaded
dinov2_b14 loaded
siglip_b16_384 loaded
Running inference on 5000 val images...
Done: fused=torch.Size([5000, 128]) tri=torch.Size([5000, 256])
=================================================================
SCAN 1: ANCHOR GEOMETRY
=================================================================
Anchor pairwise cosine:
mean=0.0356 std=0.1896
max=0.9542 min=-0.9093
Max neighbor cosine per anchor:
mean=0.6949 std=0.2730
max=0.9542 min=0.0639
Anchor norms: mean=1.000000 std=0.000000
Anchor spectral: eff_rank=65.7/128
sv_max=5.0231 sv_10=2.8655 sv_50=0.9697 sv_min=0.125017
Anchor pentachoron CV: 0.2478
mean_vol=0.074751 std_vol=0.018524
=================================================================
SCAN 2: ANCHOR UTILIZATION
=================================================================
Active anchors: 1/256 (0.4%)
Visit counts: mean=19.5 std=312.5
max=5000 min=0
top 10: [5000, 0, 0, 0, 0, 0, 0, 0, 0, 0]
bottom 10: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
Anchor entropy: -0.0000 / 5.5452 (-0.0%)
Per-anchor embedding density:
Intra-cluster cosine: mean=0.9693 std=0.0000
=================================================================
SCAN 3: PROJECTOR ANALYSIS
=================================================================
clip_l14:
norm: mean=1.000000 (should be 1.0)
self-sim off-diag: 0.9668
eff_dim: 24.1/128
dinov2_b14:
norm: mean=1.000000 (should be 1.0)
self-sim off-diag: 0.9678
eff_dim: 25.3/128
siglip_b16:
norm: mean=1.000000 (should be 1.0)
self-sim off-diag: 0.9501
eff_dim: 23.8/128
Expert agreement (cosine in 128-d):
clip_l14 × dinov2_b14 : mean=0.9898 std=0.0066 min=0.8730
clip_l14 × siglip_b16 : mean=0.9893 std=0.0052 min=0.9307
dinov2_b14 × siglip_b16 : mean=0.9855 std=0.0081 min=0.8920
Per-expert nearest anchor agreement:
clip_l14 × dinov2_b14 : same_anchor=1.0000 (100.0%)
clip_l14 × siglip_b16 : same_anchor=1.0000 (100.0%)
dinov2_b14 × siglip_b16 : same_anchor=1.0000 (100.0%)
Projector weight comparison:
clip_l14 : norm=37.4114 eff_rank=30.5/128
dinov2_b14 : norm=36.3149 eff_rank=23.0/128
siglip_b16 : norm=39.2079 eff_rank=29.0/128
clip_l14 × dinov2_b14 weight_cos=0.0046
clip_l14 × siglip_b16 weight_cos=-0.0049
dinov2_b14 × siglip_b16 weight_cos=-0.0055
=================================================================
SCAN 4: PATCHWORK COMPARTMENTS
=================================================================
Comp 0: 32 anchors
Comp 1: 32 anchors
Comp 2: 32 anchors
Comp 3: 32 anchors
Comp 4: 32 anchors
Comp 5: 32 anchors
Comp 6: 32 anchors
Comp 7: 32 anchors
Patchwork output: torch.Size([5000, 512])
norm: mean=11.6381 std=0.5046
comp 0: norm=2.8604 std_across_dims=0.0010
comp 1: norm=3.7652 std_across_dims=0.1596
comp 2: norm=2.3303 std_across_dims=0.0057
comp 3: norm=3.4802 std_across_dims=0.2053
comp 4: norm=3.3465 std_across_dims=0.1143
comp 5: norm=5.9651 std_across_dims=0.4720
comp 6: norm=6.0775 std_across_dims=0.4946
comp 7: norm=3.2188 std_across_dims=0.0718
=================================================================
SCAN 5: TRIANGULATION PATTERNS
=================================================================
Triangulation distances (1-cosine):
mean=0.8988 std=0.1301
min=0.0038 max=1.2538
Nearest anchor distance:
mean=0.0156 std=0.0042
max=0.0419 min=0.0038
Anchors within cos>0.5 per image:
mean=1.0 std=0.0
Top-10 nearest anchor distances:
k=0: mean=0.0156 std=0.0042
k=1: mean=0.6646 std=0.0218
k=2: mean=0.6806 std=0.0185
k=3: mean=0.6909 std=0.0173
k=4: mean=0.6977 std=0.0167
k=5: mean=0.7033 std=0.0162
k=6: mean=0.7081 std=0.0158
k=7: mean=0.7126 std=0.0154
k=8: mean=0.7166 std=0.0150
k=9: mean=0.7204 std=0.0147
=================================================================
SCAN 6: PER-CLASS ANCHOR AFFINITY
=================================================================
Top-3 anchors per class (first 20 classes):
person (n=2693): 65(2693/2693) 1(0/2693) 0(0/2693)
bicycle (n= 149): 65(149/149) 1(0/149) 0(0/149)
car (n= 535): 65(535/535) 1(0/535) 0(0/535)
motorcycle (n= 159): 65(159/159) 1(0/159) 0(0/159)
airplane (n= 97): 65(97/97) 1(0/97) 0(0/97)
bus (n= 189): 65(189/189) 1(0/189) 0(0/189)
train (n= 157): 65(157/157) 1(0/157) 0(0/157)
truck (n= 250): 65(250/250) 1(0/250) 0(0/250)
boat (n= 121): 65(121/121) 1(0/121) 0(0/121)
traffic light (n= 191): 65(191/191) 1(0/191) 0(0/191)
fire hydrant (n= 86): 65(86/86) 1(0/86) 0(0/86)
stop sign (n= 69): 65(69/69) 1(0/69) 0(0/69)
parking meter (n= 37): 65(37/37) 1(0/37) 0(0/37)
bench (n= 235): 65(235/235) 1(0/235) 0(0/235)
bird (n= 125): 65(125/125) 1(0/125) 0(0/125)
cat (n= 184): 65(184/184) 1(0/184) 0(0/184)
dog (n= 177): 65(177/177) 1(0/177) 0(0/177)
horse (n= 128): 65(128/128) 1(0/128) 0(0/128)
sheep (n= 65): 65(65/65) 1(0/65) 0(0/65)
cow (n= 87): 65(87/87) 1(0/87) 0(0/87)
Anchor specialization:
classes per anchor: mean=80.0 std=nan
max=80 min=80
=================================================================
SCAN 7: FUSED EMBEDDING GEOMETRY
=================================================================
Norms: mean=1.000000 std=0.000000
Self-sim (off-diag): 0.9693
Effective dim: 23.6/128
top-5 SVs explain 44.1%
top-10 SVs explain 70.2%
top-20 SVs explain 99.2%
top-50 SVs explain 100.0%
top-100 SVs explain 100.0%
Pentachoron CV: 0.3529
=================================================================
SCAN 8: EXPERT CONTRIBUTION
=================================================================
clip_l14 : cos_to_fused mean=0.9970 std=0.0016
dinov2_b14 : cos_to_fused mean=0.9957 std=0.0027
siglip_b16 : cos_to_fused mean=0.9955 std=0.0023
Without clip_l14 : cos_to_full=0.9992 (uniqueness=0.0008)
Without dinov2_b14 : cos_to_full=0.9989 (uniqueness=0.0011)
Without siglip_b16 : cos_to_full=0.9989 (uniqueness=0.0011)
Per-image expert disagreement:
Agreement: mean=0.9882 std=0.0055
Disagreement: mean=0.0041 std=0.0034
Most agreed image (1449): agreement=0.9978
labels: [22]
Most disagreed image (1435): agreement=0.9214
labels: [28]
=================================================================
ANALYSIS COMPLETE
=================================================================
/tmp/ipykernel_10600/3734699858.py:410: UserWarning: std(): degrees of freedom is <= 0. Correction should be strictly less than the reduction factor (input numel divided by output numel). (Triggered internally at /pytorch/aten/src/ATen/native/ReduceOps.cpp:1857.)
f"std={anchor_class_count[anchor_class_count>0].std():.1f}")