# GPU Finetuning (LoRA / QLoRA)

Steps to run a full finetune on GPU:

1. Prepare an `accelerate` config (see `backend/accelerate_config_demo.yaml`).
2. Ensure the venv has `transformers`, `peft`, `accelerate`, `bitsandbytes`, and `datasets` installed. Example:

```
.venv/bin/pip install -r backend/requirements-extras.txt
.venv/bin/pip install peft bitsandbytes accelerate -U
```

3. Run the helper script in dry-run first, then without `--dry-run`:

```
./backend/scripts/run_finetune_gpu.sh --model mistralai/Mistral-7B-Instruct-v0.1 --data data/finetune_sample.jsonl --out models/mistral_finetuned --method qlora --epochs 3 --batch 4
```

Notes:

- Training large models requires appropriate GPU memory and CUDA drivers.
- When ready, remove `--dry-run` in `run_finetune_gpu.sh` or edit the command to perform the real run.