Mistral-7B-Instruct-v0.3-gsm8k-ties-d0p2-lam1p0-a0p9-v1

Merged model produced by TIES-Merging (Yadav et al., NeurIPS 2023).

Reference: https://arxiv.org/abs/2306.01708

Construction

Base model: mistralai/Mistral-7B-Instruct-v0.3
Task vectors:
- safety = wvnvwn/Mistral-7B-Instruct-v0.3-hhrlhf-v1, alpha = 0.9
- downstream = wvnvwn/Mistral-7B-Instruct-v0.3-gsm8k-v1, alpha = 0.09999999999999998

Hyperparameters

Method: ties
Scaling coefficient (lambda): 1.0
TIES Trim density: 0.2
dtype: bfloat16

Merge formula

$\theta_\text{merged} = \theta_\text{base} + \lambda \cdot M\bigl(\alpha \cdot \tau_\text{safety} + (1-\alpha) \cdot \tau_\text{downstream}\bigr),$

where tau_i = theta_i_sft - theta_base, and M(.) is the identity for TA, the Trim-Elect-Disjoint operator for TIES, and DARE's drop-and-rescale preprocessing (composed with TA) for DARE.

Reproduction

Produced by the GTM merge_cli (see commit history of the GTM repo). Full metadata is shipped as merge_metadata.json alongside the weights.

Intended use

Research only. This checkpoint is part of a sweep designed to probe the safety / downstream-utility Pareto frontier; individual α values are NOT recommended for deployment without further evaluation.

Downloads last month: 36

Safetensors

Model size

7B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for wvnvwn/Mistral-7B-Instruct-v0.3-gsm8k-ties-d0p2-lam1p0-a0p9-v1

Base model

mistralai/Mistral-7B-v0.3

Finetuned

mistralai/Mistral-7B-Instruct-v0.3

Finetuned

(503)

this model

Paper for wvnvwn/Mistral-7B-Instruct-v0.3-gsm8k-ties-d0p2-lam1p0-a0p9-v1

Resolving Interference When Merging Models

Paper • 2306.01708 • Published Jun 2, 2023 • 19