philipjohnbasile commited on
Commit
ef322ad
·
verified ·
1 Parent(s): d7aaa7a

Sync README ← card (design-heal note)

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -199,7 +199,9 @@ per facet). See [`design/DESIGN.md`](design/DESIGN.md).
199
  on MoE; $4–15K or weeks). 4-bit re-quant is also *slower* for single-token decode (bandwidth-bound).
200
  - **Multilingual** ability reduced (optional vocab-trim drops ~31% of tokens).
201
  - **Design** is competent but not yet design-soul-elite (correct structure, but missed OKLCH/grid when
202
- tested) — the design-canon heal closes this.
 
 
203
  - Prompt-cache can OOM under heavy concurrent load. Speculative decoding (external draft, MTP, *or* prompt-lookup)
204
  does **not** help single-stream on this MoE — all measured-dead (above); batch for throughput instead.
205
 
 
199
  on MoE; $4–15K or weeks). 4-bit re-quant is also *slower* for single-token decode (bandwidth-bound).
200
  - **Multilingual** ability reduced (optional vocab-trim drops ~31% of tokens).
201
  - **Design** is competent but not yet design-soul-elite (correct structure, but missed OKLCH/grid when
202
+ tested) — a first **design-soul heal adapter** ships in [`adapters-design/`](adapters-design) (light
203
+ 19-example LoRA pass, val loss 0.14); evaluate via the soul-retention scorecard before relying on it, and
204
+ a fuller 60-example heal is the next pass. Use the `CANON` system prompt for elite output *today*.
205
  - Prompt-cache can OOM under heavy concurrent load. Speculative decoding (external draft, MTP, *or* prompt-lookup)
206
  does **not** help single-stream on this MoE — all measured-dead (above); batch for throughput instead.
207