TinyLlama-1.1B-RYS-10-14-GGUF

A layer-duplication ("RYS" — Repeat Your Self, David Ng) variant of TinyLlama/TinyLlama-1.1B-Chat-v1.0: layers 10–13 duplicated, 22 → 26 layers. No training, merging, or weight changes. GGUF.

⚠️ Evaluation status — please read (updated 2026-06)

The original card headlined an EQ jump of 4.65 → 52.50 (+47.85) as "the largest EQ gain in the corpus." Direct output inspection confirms this is a scoring artifact, not a gain in emotional intelligence.

What the model actually emits (inspected on the current stack):

  • Base fails to follow the rating format — on many scenarios it returns prose ("David is furious…") instead of numbers, so the probe can't parse it and scores ~0. Its low EQ is largely a parse failure.
  • (10,14) emits a near-constant rating vector (~`5,3,8,2`) for every scenario — the same numbers for "your spouse is cheating" as for "you received an award." It isn't reading the scenario; it just produces a parseable constant guess that averages ~60.

So the "EQ unlock" is the probe rewarding a parseable constant guess over the base model's unparseable prose — not emotional understanding. These are 16-question search probes, not a validated benchmark.

Bottom line: treat this as a normal TinyLlama-1.1B-Chat with layers 10–13 duplicated. The "EQ unlock" is a measurement artifact, confirmed by inspection — not an established capability gain.

Original sweep numbers (search probe — kept for the record)

probe reported baseline reported (10,14) note
EQ (16 q) 4.65 52.50 verified artifact: base outputs unparseable prose (~0); (10,14) emits a constant 5,3,8,2 for all scenarios. Not emotional understanding
Reasoning (17 q) 29.41% 23.53% down 1 question — no reasoning gain
Math (16 q) 0.296 0.296 flat

Run it

llama-server -m TinyLlama-1.1B-RYS-10-14-Q4_K_M.gguf -ngl 99

Method · data · attribution

  • Method: layer duplication — Repeat Your Self (David Ng); toolkit llm-circuit-finder (alainnothere).
  • Raw sweep data: rys-sovereign-collection-v2.
  • Built by John Broadway with Claude. The method and the raw data are real; the interpretation of the EQ probe delta as capability is what this update corrects.

License

Apache-2.0.

Downloads last month
56
GGUF
Model size
1B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for john-broadway/TinyLlama-1.1B-RYS-10-14-GGUF

Quantized
(150)
this model

Collection including john-broadway/TinyLlama-1.1B-RYS-10-14-GGUF