philipjohnbasile commited on
Commit
9b0de50
·
verified ·
1 Parent(s): 381e5d2

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md CHANGED
@@ -122,6 +122,22 @@ heals the **native** prior so it designs elite with no prompt at all.
122
  | Decode speed | **11.3 tok/s** (no draft) — see the speed note in limitations |
123
  | Verified-decode checker | TS 0.3 ms · Python ~0 ms · Rust 34 ms |
124
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
125
  ## Honest limitations
126
  - **Specialist:** ~70% of experts pruned — strong in the target niche, weaker outside it. Not the full 743B.
127
  - **Speed ~11 tok/s decode** (reading pace; ~3 min for long thinking-ON answers). Partly MLX's still-naive
 
122
  | Decode speed | **11.3 tok/s** (no draft) — see the speed note in limitations |
123
  | Verified-decode checker | TS 0.3 ms · Python ~0 ms · Rust 34 ms |
124
 
125
+ ## Roadmap — the Demolition family (shrink, keep the soul)
126
+ Same masters-trained soul (design · dataviz · code · security · math · prose · architecture · research), every
127
+ Mac — the elite training lives in the facet-inclusive calibration + heal corpus, which are **size-agnostic**:
128
+
129
+ ```
130
+ 99GB : ████████ (baseline, this model)
131
+ 64GB : should hold ~baseline (96 GB Macs)
132
+ 48GB : should hold high (64 GB Macs)
133
+ 28GB : the squeeze — watch which facets dip (36-48 GB Macs)
134
+ 14GB : ⚗️ where does the soul start to break? (24 GB Macs)
135
+ 7GB : ⚗️ the floor (16 GB laptops)
136
+ ```
137
+
138
+ Each size: facet-calib → prune harder → quantize → heal (the soul corpus) → soul-retention scorecard (% elite
139
+ per facet). See [`design/DESIGN.md`](design/DESIGN.md).
140
+
141
  ## Honest limitations
142
  - **Specialist:** ~70% of experts pruned — strong in the target niche, weaker outside it. Not the full 743B.
143
  - **Speed ~11 tok/s decode** (reading pace; ~3 min for long thinking-ON answers). Partly MLX's still-naive