# qwen2.5-7b-ben-franklin-v1-lite-r4-qv ## Summary Custom Benjamin Franklin LoRA adapter for Qwen2.5 7B Instruct 4-bit. ## Paths - Portfolio copy: `adapters/qwen2.5-7b-ben-franklin-v1-lite-r4-qv` - Source: copied adapter artifact in `adapters/qwen2.5-7b-ben-franklin-v1-lite-r4-qv` - Base model: `unsloth/Qwen2.5-7B-Instruct-bnb-4bit` ## Adapter details - Adapter directory size: 0.06 GB - adapter_model.safetensors: 4.8 MB - SHA256(adapter_model.safetensors): `f1d6d49a4ecb8597501d50089688c8ce950c1b7ffa92f0d69099fa62a7a1f72a` - LoRA rank: 4 - LoRA alpha: 8 - Target modules: `['v_proj', 'q_proj']` ## Strengths - Largest Benjamin Franklin LoRA family proven trainable on this RTX 3070 8GB machine. - Best base reasoning/coherence among the local Franklin adapters. - Good short-turn continuity in the coherence benchmark. - Benchmark score: 30 with flags {'base_identity_leak': 1, 'overdenial': 1}. ## Weaknesses / caveats - Qwen2.5 7B base is already highly steerable, so improvements over the prompted base are modest. - q/v-only LoRA is weak for implanting stubborn factual corrections. - Craven Street/Hewson factuality remains unreliable unless retrieval/prompt context is supplied. ## Data mix - franklin_qwen3_8b_chatml_answer_clean.jsonl: 2400 rows ## Performance Score: 30 Flags: `{'base_identity_leak': 1, 'overdenial': 1}` Raw benchmark: benchmarks/franklin_coherence/franklin_coherence_20260623_080144.json HTML: benchmarks/franklin_coherence/franklin_coherence_20260623_080144.html ## Memory / compute requirements Inference: RTX 3070 8GB works in 4-bit with the adapter; expect roughly 6-7GB VRAM. Training proven only with q_proj/v_proj, r=4 or r=8, max_seq_length=512, batch_size=1, gradient_accumulation=16; full-module LoRA is not recommended on 8GB. ## Possible project uses - Conversational Franklin persona in local apps. - A strong baseline for RAG-backed historical character demos. - Best starting point for further 7B experiments.