Holo-3.1-4B Coding LoRA

Overview

This repository contains a PEFT LoRA adapter for Hcompany/Holo-3.1-4B adapted for coding-oriented instruction following and Python problem solving. The adapter is intended to be loaded on top of the base model with PEFT-compatible tooling.

What Is Included

LoRA adapter weights in adapter_model.safetensors.
PEFT configuration in adapter_config.json.
Tokenizer and chat template files copied for convenient loading.
Evaluation and provenance artifacts from the release run.

Training And Evaluation Summary

The adapter was produced with supervised fine-tuning on curated coding instruction data, including targeted Python problem-solving examples, broader coding instruction examples, and small external coding-instruction samples. Evaluation used an 80-task held-out greedy decoding probe drawn from HumanEval-style and MBPP-style tasks.

Measured result on the held-out probe:

Base model: 24 / 80 tasks passed.
Adapter model: 31 / 80 tasks passed.
Relative lift over the measured base result: 29.17%.

These numbers are a compact functional probe, not a complete benchmark suite.

Intended Use

Use this adapter for coding assistance experiments, Python function synthesis, small algorithmic tasks, and research on lightweight coding adaptation. Load it with PEFT on top of Hcompany/Holo-3.1-4B.

Known Limitations

The evaluation probe is small and focused on short Python tasks.
The adapter may still fail hidden edge cases, multi-file tasks, long-context repository work, and non-Python languages.
Outputs should be tested before use in production or security-sensitive environments.
The adapter inherits limitations and licensing terms from the base model and training data sources.

File List

adapter_model.safetensors: LoRA adapter weights.
adapter_config.json: PEFT adapter configuration.
tokenizer.json, tokenizer_config.json, chat_template.jinja: tokenizer/chat assets.
release_summary.json: run summary and measured evaluation counts.
dataset_selection.json: high-level dataset selection record.
eval_before_after_full_code.csv: per-task before/after evaluation table.
trainer_log_history.json: trainer log history.

Reproducibility And Provenance

The release artifacts include dataset selection, trainer history, and before/after evaluation outputs to support auditability. The adapter was trained as a parameter-efficient LoRA continuation of the public base model and is distributed separately from the base weights.

Downloads last month: 24

Model tree for josephmayo/Holo-3.1-4B-Coder-LoRA

Base model

Qwen/Qwen3.5-0.8B-Base

Finetuned

Qwen/Qwen3.5-0.8B

Finetuned

Hcompany/Holo-3.1-4B

Adapter

(1)

this model