Holo-3.1-4B Coding Merged Model

Overview

This repository contains a merged Transformers checkpoint produced from Hcompany/Holo-3.1-4B and the companion coding LoRA adapter. It is intended for users who prefer loading a standard merged model rather than applying a PEFT adapter at runtime.

What Is Included

  • Merged model weights in sharded safetensors format.
  • Model configuration and generation configuration.
  • Tokenizer and chat template files.
  • A model card summarizing the measured coding adaptation result.

Training And Evaluation Summary

The underlying adapter was trained with supervised fine-tuning on curated coding instruction data. Evaluation used an 80-task held-out greedy decoding probe drawn from HumanEval-style and MBPP-style tasks.

Measured result on the held-out probe:

  • Base model: 24 / 80 tasks passed.
  • Adapted model: 31 / 80 tasks passed.
  • Relative lift over the measured base result: 29.17%.

The merged model should match the adapter-applied behavior, subject to normal numerical and runtime differences.

Intended Use

Use this checkpoint for coding assistance experiments, Python function generation, lightweight algorithmic problem solving, and local inference workflows that expect standard Transformers model files.

Known Limitations

  • The evaluation probe is small and does not cover all programming languages or repository-scale workflows.
  • The model can produce incorrect code, incomplete reasoning, or solutions that fail edge cases.
  • Generated code should be reviewed, tested, and sandboxed where appropriate.
  • The checkpoint inherits limitations and licensing terms from the base model and adaptation data sources.

File List

  • model-00001-of-00009.safetensors through model-00009-of-00009.safetensors: merged model shards.
  • model.safetensors.index.json: shard index.
  • config.json, generation_config.json: model configuration files.
  • tokenizer.json, tokenizer_config.json, chat_template.jinja: tokenizer/chat assets.
  • README.md: this model card.

Reproducibility And Provenance

The model was produced by merging a PEFT LoRA coding adapter into Hcompany/Holo-3.1-4B and saving the result as sharded safetensors. Companion evaluation and training provenance artifacts are available in the LoRA repository.

Downloads last month
21
Safetensors
Model size
4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for josephmayo/Holo-3.1-4B-Coder

Finetuned
(2)
this model
Quantizations
1 model