File size: 2,507 Bytes
ef8d7f8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
---
license: apache-2.0
base_model: Qwen/Qwen2.5-7B-Instruct
tags:
  - rys
  - layer-duplication
  - reasoning-circuits
  - gguf
  - sovereign-collection-v1
---

# Qwen2.5-7B-RYS-8-12

Qwen2.5-7B-Instruct with layers 8-12 duplicated. The math circuit runs twice on every forward pass.

28 base layers β†’ 32 after duplication. No training, no merging, no weight changes.

**Math +10% (0.5653 β†’ 0.6645). EQ +1.0. Reasoning held at 94.12%.**

## Results

| Metric | Baseline | RYS (8,12) | Delta |
|--------|----------|------------|-------|
| Math | 0.5653 | 0.6645 | +10% |
| EQ | 89.69 | 90.66 | +0.97 |
| Reasoning | 94.12% | 94.12% | 0.00 |

**The math specialist.** The first sub-14B model the method was extended to. David Ng demonstrated layer-duplication on Qwen2-72B; we found it works just as cleanly here, 10Γ— smaller. Of 51 swept configurations, (8,12) is the one that gets the math circuit alone β€” duplicate that 4-layer block and nothing else trades down. The original GitHub-issue writeup describing exactly this result lives in the v2 corpus: [`docs/github-issue-draft.md`](https://huggingface.co/datasets/john-broadway/rys-sovereign-collection-v2/blob/main/github-issue-draft.md).

## Usage

```
llama-server -m Qwen2.5-7B-RYS-8-12-Q4_K_M.gguf -ngl 99
```

## Full sweep data

51 configurations tested. Full sweep data in the v2 corpus dataset. For vLLM-serving with AWQ quantization, see companion repo [`john-broadway/Qwen2.5-7B-RYS-8-12-AWQ`](https://huggingface.co/john-broadway/Qwen2.5-7B-RYS-8-12-AWQ).

Part of the RYS Sovereign Collection v1.

---

## Where this sits in the Sovereign Collection

**v1 β€” Qwen2.5 cross-scale + Qwen3-32B headline.** Four sizes from 0.5B to 32B; RYS works at every scale, with the lift size and dimension shifting by baseline:

- 0.5B β†’ EQ specialist
- 1.5B β†’ balanced daily driver
- 7B β†’ math specialist via (8,12)
- 32B β†’ the headline "Big Boy"

**v2 β€” cross-architecture extension.** 21 model variants across 10 architecture families. Headline: weak baselines lift more, in their weakest dimension. β†’ [`john-broadway/rys-sovereign-collection-v2`](https://huggingface.co/datasets/john-broadway/rys-sovereign-collection-v2)


**Credit**

John Broadway, with collaboration from Claude (Opus 4.6 in April 2026 build; Opus 4.7 in May 2026 analysis and publication). Original RYS method by [David Ng](https://dnhkng.github.io/posts/rys/) on Qwen2-72B; sweep toolkit by [alainnothere](https://github.com/alainnothere/llm-circuit-finder).