--- base_model: Hcompany/Holo-3.1-4B library_name: peft pipeline_tag: text-generation tags: - coding - lora - peft - qlora - python --- # Holo-3.1-4B Coding LoRA ## Overview This repository contains a PEFT LoRA adapter for `Hcompany/Holo-3.1-4B` adapted for coding-oriented instruction following and Python problem solving. The adapter is intended to be loaded on top of the base model with PEFT-compatible tooling. ## What Is Included - LoRA adapter weights in `adapter_model.safetensors`. - PEFT configuration in `adapter_config.json`. - Tokenizer and chat template files copied for convenient loading. - Evaluation and provenance artifacts from the release run. ## Training And Evaluation Summary The adapter was produced with supervised fine-tuning on curated coding instruction data, including targeted Python problem-solving examples, broader coding instruction examples, and small external coding-instruction samples. Evaluation used an 80-task held-out greedy decoding probe drawn from HumanEval-style and MBPP-style tasks. Measured result on the held-out probe: - Base model: 24 / 80 tasks passed. - Adapter model: 31 / 80 tasks passed. - Relative lift over the measured base result: 29.17%. These numbers are a compact functional probe, not a complete benchmark suite. ## Intended Use Use this adapter for coding assistance experiments, Python function synthesis, small algorithmic tasks, and research on lightweight coding adaptation. Load it with PEFT on top of `Hcompany/Holo-3.1-4B`. ## Known Limitations - The evaluation probe is small and focused on short Python tasks. - The adapter may still fail hidden edge cases, multi-file tasks, long-context repository work, and non-Python languages. - Outputs should be tested before use in production or security-sensitive environments. - The adapter inherits limitations and licensing terms from the base model and training data sources. ## File List - `adapter_model.safetensors`: LoRA adapter weights. - `adapter_config.json`: PEFT adapter configuration. - `tokenizer.json`, `tokenizer_config.json`, `chat_template.jinja`: tokenizer/chat assets. - `release_summary.json`: run summary and measured evaluation counts. - `dataset_selection.json`: high-level dataset selection record. - `eval_before_after_full_code.csv`: per-task before/after evaluation table. - `trainer_log_history.json`: trainer log history. ## Reproducibility And Provenance The release artifacts include dataset selection, trainer history, and before/after evaluation outputs to support auditability. The adapter was trained as a parameter-efficient LoRA continuation of the public base model and is distributed separately from the base weights.