How to use from
SGLang
Install from pip and serve model
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "ibivibiv/orthorus-125b-v2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ibivibiv/orthorus-125b-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker images
docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "ibivibiv/orthorus-125b-v2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ibivibiv/orthorus-125b-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Quick Links

img

Model Card for Orthorus 125B v2

Orthorus is a MOE of Fine Tuned Mistral models.

Model Details

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

Uses

This model is geared toward general knowledge performance and should perform acceptably across mulitple tasks. Refer to the leaderboard evaluation for specific strengths/weaknesses.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 77.22
AI2 Reasoning Challenge (25-Shot) 73.63
HellaSwag (10-Shot) 89.04
MMLU (5-Shot) 75.99
TruthfulQA (0-shot) 70.19
Winogrande (5-shot) 85.48
GSM8k (5-shot) 68.99
Downloads last month
93
Safetensors
Model size
125B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ibivibiv/orthorus-125b-v2

Quantizations
2 models

Evaluation results