Mistral-7B-Instruct-v0.3-gsm8k-ties-d0p2-lam1p0-a0p9-v1

Merged model produced by TIES-Merging (Yadav et al., NeurIPS 2023).

Construction

  • Base model: mistralai/Mistral-7B-Instruct-v0.3
  • Task vectors:
    • safety = wvnvwn/Mistral-7B-Instruct-v0.3-hhrlhf-v1, alpha = 0.9
    • downstream = wvnvwn/Mistral-7B-Instruct-v0.3-gsm8k-v1, alpha = 0.09999999999999998

Hyperparameters

  • Method: ties
  • Scaling coefficient (lambda): 1.0
  • TIES Trim density: 0.2
  • dtype: bfloat16

Merge formula

θmerged=θbase+λM(ατsafety+(1α)τdownstream),\theta_\text{merged} = \theta_\text{base} + \lambda \cdot M\bigl(\alpha \cdot \tau_\text{safety} + (1-\alpha) \cdot \tau_\text{downstream}\bigr),

where tau_i = theta_i_sft - theta_base, and M(.) is the identity for TA, the Trim-Elect-Disjoint operator for TIES, and DARE's drop-and-rescale preprocessing (composed with TA) for DARE.

Reproduction

Produced by the GTM merge_cli (see commit history of the GTM repo). Full metadata is shipped as merge_metadata.json alongside the weights.

Intended use

Research only. This checkpoint is part of a sweep designed to probe the safety / downstream-utility Pareto frontier; individual α values are NOT recommended for deployment without further evaluation.

Downloads last month
36
Safetensors
Model size
7B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for wvnvwn/Mistral-7B-Instruct-v0.3-gsm8k-ties-d0p2-lam1p0-a0p9-v1

Finetuned
(503)
this model

Paper for wvnvwn/Mistral-7B-Instruct-v0.3-gsm8k-ties-d0p2-lam1p0-a0p9-v1