geolip-vit-base-x3 / run_1-collapse /analyze_soup_output.txt

Rename analyze_soup_output.txt to run_1-collapse/analyze_soup_output.txt

5a6b379 verified 3 months ago

7.44 kB

	=================================================================
	BASE TIER SOUP ANALYSIS
	Device: cuda
	=================================================================

	Loading checkpoint...
	Loaded: mAP=0.825 cv=0.3117 epoch=19
	clip_l14_openai loaded
	dinov2_b14 loaded
	siglip_b16_384 loaded

	Running inference on 5000 val images...
	Done: fused=torch.Size([5000, 128]) tri=torch.Size([5000, 256])

	=================================================================
	SCAN 1: ANCHOR GEOMETRY
	=================================================================
	Anchor pairwise cosine:
	mean=0.0356 std=0.1896
	max=0.9542 min=-0.9093
	Max neighbor cosine per anchor:
	mean=0.6949 std=0.2730
	max=0.9542 min=0.0639
	Anchor norms: mean=1.000000 std=0.000000
	Anchor spectral: eff_rank=65.7/128
	sv_max=5.0231 sv_10=2.8655 sv_50=0.9697 sv_min=0.125017
	Anchor pentachoron CV: 0.2478
	mean_vol=0.074751 std_vol=0.018524

	=================================================================
	SCAN 2: ANCHOR UTILIZATION
	=================================================================
	Active anchors: 1/256 (0.4%)
	Visit counts: mean=19.5 std=312.5
	max=5000 min=0
	top 10: [5000, 0, 0, 0, 0, 0, 0, 0, 0, 0]
	bottom 10: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
	Anchor entropy: -0.0000 / 5.5452 (-0.0%)

	Per-anchor embedding density:
	Intra-cluster cosine: mean=0.9693 std=0.0000

	=================================================================
	SCAN 3: PROJECTOR ANALYSIS
	=================================================================

	clip_l14:
	norm: mean=1.000000 (should be 1.0)
	self-sim off-diag: 0.9668
	eff_dim: 24.1/128

	dinov2_b14:
	norm: mean=1.000000 (should be 1.0)
	self-sim off-diag: 0.9678
	eff_dim: 25.3/128

	siglip_b16:
	norm: mean=1.000000 (should be 1.0)
	self-sim off-diag: 0.9501
	eff_dim: 23.8/128

	Expert agreement (cosine in 128-d):
	clip_l14 × dinov2_b14 : mean=0.9898 std=0.0066 min=0.8730
	clip_l14 × siglip_b16 : mean=0.9893 std=0.0052 min=0.9307
	dinov2_b14 × siglip_b16 : mean=0.9855 std=0.0081 min=0.8920

	Per-expert nearest anchor agreement:
	clip_l14 × dinov2_b14 : same_anchor=1.0000 (100.0%)
	clip_l14 × siglip_b16 : same_anchor=1.0000 (100.0%)
	dinov2_b14 × siglip_b16 : same_anchor=1.0000 (100.0%)

	Projector weight comparison:
	clip_l14 : norm=37.4114 eff_rank=30.5/128
	dinov2_b14 : norm=36.3149 eff_rank=23.0/128
	siglip_b16 : norm=39.2079 eff_rank=29.0/128
	clip_l14 × dinov2_b14 weight_cos=0.0046
	clip_l14 × siglip_b16 weight_cos=-0.0049
	dinov2_b14 × siglip_b16 weight_cos=-0.0055

	=================================================================
	SCAN 4: PATCHWORK COMPARTMENTS
	=================================================================
	Comp 0: 32 anchors
	Comp 1: 32 anchors
	Comp 2: 32 anchors
	Comp 3: 32 anchors
	Comp 4: 32 anchors
	Comp 5: 32 anchors
	Comp 6: 32 anchors
	Comp 7: 32 anchors

	Patchwork output: torch.Size([5000, 512])
	norm: mean=11.6381 std=0.5046
	comp 0: norm=2.8604 std_across_dims=0.0010
	comp 1: norm=3.7652 std_across_dims=0.1596
	comp 2: norm=2.3303 std_across_dims=0.0057
	comp 3: norm=3.4802 std_across_dims=0.2053
	comp 4: norm=3.3465 std_across_dims=0.1143
	comp 5: norm=5.9651 std_across_dims=0.4720
	comp 6: norm=6.0775 std_across_dims=0.4946
	comp 7: norm=3.2188 std_across_dims=0.0718

	=================================================================
	SCAN 5: TRIANGULATION PATTERNS
	=================================================================
	Triangulation distances (1-cosine):
	mean=0.8988 std=0.1301
	min=0.0038 max=1.2538
	Nearest anchor distance:
	mean=0.0156 std=0.0042
	max=0.0419 min=0.0038
	Anchors within cos>0.5 per image:
	mean=1.0 std=0.0
	Top-10 nearest anchor distances:
	k=0: mean=0.0156 std=0.0042
	k=1: mean=0.6646 std=0.0218
	k=2: mean=0.6806 std=0.0185
	k=3: mean=0.6909 std=0.0173
	k=4: mean=0.6977 std=0.0167
	k=5: mean=0.7033 std=0.0162
	k=6: mean=0.7081 std=0.0158
	k=7: mean=0.7126 std=0.0154
	k=8: mean=0.7166 std=0.0150
	k=9: mean=0.7204 std=0.0147

	=================================================================
	SCAN 6: PER-CLASS ANCHOR AFFINITY
	=================================================================

	Top-3 anchors per class (first 20 classes):
	person (n=2693): 65(2693/2693) 1(0/2693) 0(0/2693)
	bicycle (n= 149): 65(149/149) 1(0/149) 0(0/149)
	car (n= 535): 65(535/535) 1(0/535) 0(0/535)
	motorcycle (n= 159): 65(159/159) 1(0/159) 0(0/159)
	airplane (n= 97): 65(97/97) 1(0/97) 0(0/97)
	bus (n= 189): 65(189/189) 1(0/189) 0(0/189)
	train (n= 157): 65(157/157) 1(0/157) 0(0/157)
	truck (n= 250): 65(250/250) 1(0/250) 0(0/250)
	boat (n= 121): 65(121/121) 1(0/121) 0(0/121)
	traffic light (n= 191): 65(191/191) 1(0/191) 0(0/191)
	fire hydrant (n= 86): 65(86/86) 1(0/86) 0(0/86)
	stop sign (n= 69): 65(69/69) 1(0/69) 0(0/69)
	parking meter (n= 37): 65(37/37) 1(0/37) 0(0/37)
	bench (n= 235): 65(235/235) 1(0/235) 0(0/235)
	bird (n= 125): 65(125/125) 1(0/125) 0(0/125)
	cat (n= 184): 65(184/184) 1(0/184) 0(0/184)
	dog (n= 177): 65(177/177) 1(0/177) 0(0/177)
	horse (n= 128): 65(128/128) 1(0/128) 0(0/128)
	sheep (n= 65): 65(65/65) 1(0/65) 0(0/65)
	cow (n= 87): 65(87/87) 1(0/87) 0(0/87)

	Anchor specialization:
	classes per anchor: mean=80.0 std=nan
	max=80 min=80

	=================================================================
	SCAN 7: FUSED EMBEDDING GEOMETRY
	=================================================================
	Norms: mean=1.000000 std=0.000000
	Self-sim (off-diag): 0.9693
	Effective dim: 23.6/128
	top-5 SVs explain 44.1%
	top-10 SVs explain 70.2%
	top-20 SVs explain 99.2%
	top-50 SVs explain 100.0%
	top-100 SVs explain 100.0%
	Pentachoron CV: 0.3529

	=================================================================
	SCAN 8: EXPERT CONTRIBUTION
	=================================================================
	clip_l14 : cos_to_fused mean=0.9970 std=0.0016
	dinov2_b14 : cos_to_fused mean=0.9957 std=0.0027
	siglip_b16 : cos_to_fused mean=0.9955 std=0.0023
	Without clip_l14 : cos_to_full=0.9992 (uniqueness=0.0008)
	Without dinov2_b14 : cos_to_full=0.9989 (uniqueness=0.0011)
	Without siglip_b16 : cos_to_full=0.9989 (uniqueness=0.0011)

	Per-image expert disagreement:
	Agreement: mean=0.9882 std=0.0055
	Disagreement: mean=0.0041 std=0.0034

	Most agreed image (1449): agreement=0.9978
	labels: [22]
	Most disagreed image (1435): agreement=0.9214
	labels: [28]

	=================================================================
	ANALYSIS COMPLETE
	=================================================================
	/tmp/ipykernel_10600/3734699858.py:410: UserWarning: std(): degrees of freedom is <= 0. Correction should be strictly less than the reduction factor (input numel divided by output numel). (Triggered internally at /pytorch/aten/src/ATen/native/ReduceOps.cpp:1857.)
	f"std={anchor_class_count[anchor_class_count>0].std():.1f}")