AbstractPhil
/

geolip-vit-base-x3

geometric-deep-learning

Model card Files Files and versions

Metrics Training metrics Community

AbstractPhil commited on Mar 15

Commit

68eead2

·

verified ·

1 Parent(s): d9e7232

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -25,6 +25,10 @@ in the smaller model by direct implicit learning.
 geolip-vit-x3 must learn to predict the experts, which means it can never get a full picture of anything outside of it's own tools.
 The anchors are strong enough and tuned to the experts, the external losses are tuned to teach the expert responses, the expert data is used
 as loss methods of attenuation, and the structure conforms to those losses specifically because it's required to teach the model tobe
 standalone and compliant without requiring the experts later.

 geolip-vit-x3 must learn to predict the experts, which means it can never get a full picture of anything outside of it's own tools.
+This model is exceptionally small, absurdly small even by vit standards. This is because even at this size, this is too much. The model cannot
+overfit if the model uses every tool at the expense, this model will train indefinitely unless a cascade overflow happens, a math continuity corruption
+occurs, or the substructure collapses to a simpler shortcut-centric behavior that would require scrambling.
 The anchors are strong enough and tuned to the experts, the external losses are tuned to teach the expert responses, the expert data is used
 as loss methods of attenuation, and the structure conforms to those losses specifically because it's required to teach the model tobe
 standalone and compliant without requiring the experts later.