john-broadway commited on
Commit
ef8d7f8
·
verified ·
1 Parent(s): 9656f93

Add card: v1 Sovereign Collection voice + corrected credit

Browse files
Files changed (1) hide show
  1. README.md +58 -0
README.md ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Qwen/Qwen2.5-7B-Instruct
4
+ tags:
5
+ - rys
6
+ - layer-duplication
7
+ - reasoning-circuits
8
+ - gguf
9
+ - sovereign-collection-v1
10
+ ---
11
+
12
+ # Qwen2.5-7B-RYS-8-12
13
+
14
+ Qwen2.5-7B-Instruct with layers 8-12 duplicated. The math circuit runs twice on every forward pass.
15
+
16
+ 28 base layers → 32 after duplication. No training, no merging, no weight changes.
17
+
18
+ **Math +10% (0.5653 → 0.6645). EQ +1.0. Reasoning held at 94.12%.**
19
+
20
+ ## Results
21
+
22
+ | Metric | Baseline | RYS (8,12) | Delta |
23
+ |--------|----------|------------|-------|
24
+ | Math | 0.5653 | 0.6645 | +10% |
25
+ | EQ | 89.69 | 90.66 | +0.97 |
26
+ | Reasoning | 94.12% | 94.12% | 0.00 |
27
+
28
+ **The math specialist.** The first sub-14B model the method was extended to. David Ng demonstrated layer-duplication on Qwen2-72B; we found it works just as cleanly here, 10× smaller. Of 51 swept configurations, (8,12) is the one that gets the math circuit alone — duplicate that 4-layer block and nothing else trades down. The original GitHub-issue writeup describing exactly this result lives in the v2 corpus: [`docs/github-issue-draft.md`](https://huggingface.co/datasets/john-broadway/rys-sovereign-collection-v2/blob/main/github-issue-draft.md).
29
+
30
+ ## Usage
31
+
32
+ ```
33
+ llama-server -m Qwen2.5-7B-RYS-8-12-Q4_K_M.gguf -ngl 99
34
+ ```
35
+
36
+ ## Full sweep data
37
+
38
+ 51 configurations tested. Full sweep data in the v2 corpus dataset. For vLLM-serving with AWQ quantization, see companion repo [`john-broadway/Qwen2.5-7B-RYS-8-12-AWQ`](https://huggingface.co/john-broadway/Qwen2.5-7B-RYS-8-12-AWQ).
39
+
40
+ Part of the RYS Sovereign Collection v1.
41
+
42
+ ---
43
+
44
+ ## Where this sits in the Sovereign Collection
45
+
46
+ **v1 — Qwen2.5 cross-scale + Qwen3-32B headline.** Four sizes from 0.5B to 32B; RYS works at every scale, with the lift size and dimension shifting by baseline:
47
+
48
+ - 0.5B → EQ specialist
49
+ - 1.5B → balanced daily driver
50
+ - 7B → math specialist via (8,12)
51
+ - 32B → the headline "Big Boy"
52
+
53
+ **v2 — cross-architecture extension.** 21 model variants across 10 architecture families. Headline: weak baselines lift more, in their weakest dimension. → [`john-broadway/rys-sovereign-collection-v2`](https://huggingface.co/datasets/john-broadway/rys-sovereign-collection-v2)
54
+
55
+
56
+ **Credit**
57
+
58
+ John Broadway, with collaboration from Claude (Opus 4.6 in April 2026 build; Opus 4.7 in May 2026 analysis and publication). Original RYS method by [David Ng](https://dnhkng.github.io/posts/rys/) on Qwen2-72B; sweep toolkit by [alainnothere](https://github.com/alainnothere/llm-circuit-finder).