Instructions to use josephmayo/Holo-3.1-4B-Coder-LoRA with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use josephmayo/Holo-3.1-4B-Coder-LoRA with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Hcompany/Holo-3.1-4B") model = PeftModel.from_pretrained(base_model, "josephmayo/Holo-3.1-4B-Coder-LoRA") - Notebooks
- Google Colab
- Kaggle
Holo-3.1-4B Coding LoRA
Overview
This repository contains a PEFT LoRA adapter for Hcompany/Holo-3.1-4B adapted for coding-oriented instruction following and Python problem solving. The adapter is intended to be loaded on top of the base model with PEFT-compatible tooling.
What Is Included
- LoRA adapter weights in
adapter_model.safetensors. - PEFT configuration in
adapter_config.json. - Tokenizer and chat template files copied for convenient loading.
- Evaluation and provenance artifacts from the release run.
Training And Evaluation Summary
The adapter was produced with supervised fine-tuning on curated coding instruction data, including targeted Python problem-solving examples, broader coding instruction examples, and small external coding-instruction samples. Evaluation used an 80-task held-out greedy decoding probe drawn from HumanEval-style and MBPP-style tasks.
Measured result on the held-out probe:
- Base model: 24 / 80 tasks passed.
- Adapter model: 31 / 80 tasks passed.
- Relative lift over the measured base result: 29.17%.
These numbers are a compact functional probe, not a complete benchmark suite.
Intended Use
Use this adapter for coding assistance experiments, Python function synthesis, small algorithmic tasks, and research on lightweight coding adaptation. Load it with PEFT on top of Hcompany/Holo-3.1-4B.
Known Limitations
- The evaluation probe is small and focused on short Python tasks.
- The adapter may still fail hidden edge cases, multi-file tasks, long-context repository work, and non-Python languages.
- Outputs should be tested before use in production or security-sensitive environments.
- The adapter inherits limitations and licensing terms from the base model and training data sources.
File List
adapter_model.safetensors: LoRA adapter weights.adapter_config.json: PEFT adapter configuration.tokenizer.json,tokenizer_config.json,chat_template.jinja: tokenizer/chat assets.release_summary.json: run summary and measured evaluation counts.dataset_selection.json: high-level dataset selection record.eval_before_after_full_code.csv: per-task before/after evaluation table.trainer_log_history.json: trainer log history.
Reproducibility And Provenance
The release artifacts include dataset selection, trainer history, and before/after evaluation outputs to support auditability. The adapter was trained as a parameter-efficient LoRA continuation of the public base model and is distributed separately from the base weights.
- Downloads last month
- 24