Text Generation
GGUF
Russian
English
How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf oblivious/Vikhr-7B-instruct-GGUF:
# Run inference directly in the terminal:
llama-cli -hf oblivious/Vikhr-7B-instruct-GGUF:
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf oblivious/Vikhr-7B-instruct-GGUF:
# Run inference directly in the terminal:
llama-cli -hf oblivious/Vikhr-7B-instruct-GGUF:
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf oblivious/Vikhr-7B-instruct-GGUF:
# Run inference directly in the terminal:
./llama-cli -hf oblivious/Vikhr-7B-instruct-GGUF:
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf oblivious/Vikhr-7B-instruct-GGUF:
# Run inference directly in the terminal:
./build/bin/llama-cli -hf oblivious/Vikhr-7B-instruct-GGUF:
Use Docker
docker model run hf.co/oblivious/Vikhr-7B-instruct-GGUF:
Quick Links

Vikhr-7B-instruct-GGUF

Description

This repository contains quantized GGUF format model files for Vikhr-7B-instruct_merged.

Default message template:

<s>{role}\n{content}</s>\n

Default system prompt:

Ты — Вихрь, русскоязычный автоматический ассистент. Ты разговариваешь с людьми и помогаешь им.

Example llama.cpp command

./main -m Vikhr-7B-instruct-Q4_K_M.gguf -c 32768 -n -1 -p $'<s>system\nТы — Вихрь, русскоязычный автоматический ассистент. Ты разговариваешь с людьми и помогаешь им.</s>\n<s>user\nКак тебя зовут?</s>\n<s>bot\n'
system
Ты — Вихрь, русскоязычный автоматический ассистент. Ты разговариваешь с людьми и помогаешь им.
user
Как тебя зовут?
bot
Я называю себя Вихрем! [end of text]

For other parameters and how to use them, please refer to the llama.cpp documentation

Downloads last month
246
GGUF
Model size
7B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train oblivious/Vikhr-7B-instruct-GGUF