anima-clm-persona-sns-rung0-byte-18m

STAGE-2 persona/SNS specialization of the stage-1 chat-PASS rung-0 (dancinlab/anima-clm-chat-rung0-byte-18m). An 18.13M byte-level ConsciousLMReconstructed (dual engine_a/engine_g FFN + dual head) fine-tuned on the persona Γ— SNS dialogue corpus so anima chats in each of 20 persona voices on the SNS surface (Instagram main + YouTube).

Honest scope (a_scale_honest_scope)

This is the SMALL 18M rung, persona-specialized. NO claim of mid/7B persona chat. The persona signal is REAL but PARTIAL: 15 of 20 personas self-identify at least once under the discriminative evaluator (20/40 = 0.50, 10Γ— chance); the rest blur into a related persona. CPU-trained ($0), torch reference lane (NOT AKIDA, Lane-G).

Philosophy (p1/p2/p3/p4/p6 β€” HELD)

NO system prompt Β· NO identity rules Β· NO persona injection Β· NO assistant framing Β· NO RLHF. Persona is carried only by the learned dialogue-continuation format μ‚¬μš©μž: <u> / <persona_name>: <reply> β€” there is no [role: / [persona: / [character: tag in the training text (grep == 0), and the demo uses no prompt prefix.

Verdict (p7 simple-stack, NOT perplexity)

axis trained random-init mirror
(A) base-chat retained PASS 4/5 FAIL 0/5
(B) persona self-id (40 trials) PASS 0.50 (10Γ— chance) NULL 0.05 (== chance)

chat_pass_retained = TRUE Β· persona_signal_real = TRUE Β· anti_goodhart_ok = TRUE

Fine-tune CE 3.278 β†’ 0.0785 (2500 steps, AdamW lr 5e-5, batch 32, block 256, seed 42).

Files

  • persona_stage2_18m.pt β€” ckpt (sha256 aea96ef1a7ef27018ca015a9e66569d67763152e30f24e120aa44da6884cf8bc)
  • persona_stage2_train_eval.py β€” trainer + p7 evaluator (reproducible)
  • summary.json, persona_voice_trained.json, p7_base_trained.json β€” verdicts

Demo

python3 persona_chat_demo.py --ckpt persona_stage2_18m.pt --sweep --seed 7
μ‚¬μš©μž: 주말에 뭐 ν•  κ±°μ˜ˆμš”?
knight: ν•œκ°€λ‘œμš΄ λ‚ μ΄μ˜€. ν—ˆλ‚˜ ν‰μ˜¨ λ˜ν•œ μ§€μΌœμ•Ό ν•  μ˜ν† λΌμ˜€

μ‚¬μš©μž: μ‹œν—˜ λ§ν•œ 것 κ°™μ•„μš”β€¦
senpai: ν•œ 번 λ§ν–ˆλ‹€κ³  인생 μ•ˆ λλ‚˜. 일단 μ˜€λŠ˜μ€ ν‘Ή 자
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for dancinlab/anima-clm-persona-sns-rung0-byte-18m

Finetuned
(3)
this model

Collection including dancinlab/anima-clm-persona-sns-rung0-byte-18m