# qwen3-4b-instruct-2507-ben-franklin-v1-lora ## Summary Custom Benjamin Franklin LoRA adapter for Qwen3 4B Instruct 4-bit. ## Paths - Portfolio copy: `adapters/qwen3-4b-instruct-2507-ben-franklin-v1-lora` - Source: copied adapter artifact in `adapters/qwen3-4b-instruct-2507-ben-franklin-v1-lora` - Base model: `unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit` ## Adapter details - Adapter directory size: 1.34 GB - adapter_model.safetensors: 126.1 MB - SHA256(adapter_model.safetensors): `e109b9fdb3da837c52d86e58dd2e3ba286dcac9b68d5596b56b28b6dc2d52c2b` - LoRA rank: 16 - LoRA alpha: 32 - Target modules: `['o_proj', 'v_proj', 'k_proj', 'gate_proj', 'up_proj', 'down_proj', 'q_proj']` ## Strengths - Middle-size family: more capable than 1.7B while still comfortable on 8GB VRAM. - Several variants target ChatML/completion formatting, tool-call cleanup, and English-lock behavior. ## Weaknesses / caveats - Some later 4B adapters, especially v5, know targeted facts but emit visible tool_call tags offline. - Can leak base-model identity or policy/meta phrasing depending on prompt path. ## Data mix - franklin_qwen3_4b_answer_only.jsonl: 3253 rows ## Performance Not benchmarked in the current coherence index. ## Memory / compute requirements Inference/training: comfortable on RTX 3070 8GB in 4-bit. Full-module LoRA at r=16-32 was used historically; expect several GB VRAM and slower but practical training. ## Possible project uses - Fast local prototyping and comparison against the 7B family. - Source of good factual repair examples after stripping tool-call pollution.