How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="ibivibiv/orthorus-125b-v2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("ibivibiv/orthorus-125b-v2")
model = AutoModelForCausalLM.from_pretrained("ibivibiv/orthorus-125b-v2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

img

Model Card for Orthorus 125B v2

Orthorus is a MOE of Fine Tuned Mistral models.

Model Details

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

Uses

This model is geared toward general knowledge performance and should perform acceptably across mulitple tasks. Refer to the leaderboard evaluation for specific strengths/weaknesses.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 77.22
AI2 Reasoning Challenge (25-Shot) 73.63
HellaSwag (10-Shot) 89.04
MMLU (5-Shot) 75.99
TruthfulQA (0-shot) 70.19
Winogrande (5-shot) 85.48
GSM8k (5-shot) 68.99
Downloads last month
96
Safetensors
Model size
125B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ibivibiv/orthorus-125b-v2

Quantizations
2 models

Evaluation results