# qwen3-4b-instruct-2507-ben-franklin-v2-chatml-lora

## Summary

Custom Benjamin Franklin LoRA adapter for Qwen3 4B Instruct 4-bit.

## Paths

- Portfolio copy: `adapters/qwen3-4b-instruct-2507-ben-franklin-v2-chatml-lora`
- Source: copied adapter artifact in `adapters/qwen3-4b-instruct-2507-ben-franklin-v2-chatml-lora`
- Base model: `unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit`

## Adapter details

- Adapter directory size: 2.19 GB
- adapter_model.safetensors: 252.1 MB
- SHA256(adapter_model.safetensors): `e6a6242d8200b0533c371296ea3c836ca3f4a4401162232bbbd02e572dcfd432`
- LoRA rank: 32
- LoRA alpha: 64
- Target modules: `['down_proj', 'q_proj', 'o_proj', 'v_proj', 'gate_proj', 'k_proj', 'up_proj']`

## Strengths

- Middle-size family: more capable than 1.7B while still comfortable on 8GB VRAM.
- Several variants target ChatML/completion formatting, tool-call cleanup, and English-lock behavior.

## Weaknesses / caveats

- Some later 4B adapters, especially v5, know targeted facts but emit visible tool_call tags offline.
- Can leak base-model identity or policy/meta phrasing depending on prompt path.

## Data mix

- franklin_qwen3_4b_answer_only.jsonl: 3253 rows

## Performance

Not benchmarked in the current coherence index.

## Memory / compute requirements

Inference/training: comfortable on RTX 3070 8GB in 4-bit. Full-module LoRA at r=16-32 was used historically; expect several GB VRAM and slower but practical training.

## Possible project uses

- Fast local prototyping and comparison against the 7B family.
- Source of good factual repair examples after stripping tool-call pollution.