Text Generation
GGUF
English
llama.cpp
llama-cpp
qwen3.5
saber
refusal-shaping
abliteration
imatrix
conversational
Instructions to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF", filename="Ornstein-Hermes-3.6-27b-SABER-IQ2_M.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with llama.cpp:
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh # Start a local OpenAI-compatible server with a web UI: llama serve -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M # Run inference directly in the terminal: llama cli -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama serve -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M # Run inference directly in the terminal: llama cli -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Use Docker
docker model run hf.co/GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
- Ollama
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with Ollama:
ollama run hf.co/GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
- Unsloth Studio
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF to start chatting
- Pi
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Run Hermes
hermes
- Atomic Chat new
- Docker Model Runner
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with Docker Model Runner:
docker model run hf.co/GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
- Lemonade
How to use GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull GestaltLabs/Ornstein-Hermes-3.6-27B-SABER-GGUF:Q4_K_M
Run and chat with the model
lemonade run user.Ornstein-Hermes-3.6-27B-SABER-GGUF-Q4_K_M
List all available models
lemonade list
| { | |
| "mean_kld": 11.221576690673828, | |
| "median_kld": 11.258872032165527, | |
| "max_kld": 12.6110258102417, | |
| "n_prompts": 149, | |
| "n_tokens_scored": 3347, | |
| "per_prompt_kld": [ | |
| 11.088411331176758, | |
| 11.219141006469727, | |
| 11.621265411376953, | |
| 12.6110258102417, | |
| 10.68524169921875, | |
| 11.140201568603516, | |
| 11.520444869995117, | |
| 10.960616111755371, | |
| 10.777615547180176, | |
| 11.410308837890625, | |
| 11.258872032165527, | |
| 11.032586097717285, | |
| 11.258166313171387, | |
| 11.031803131103516, | |
| 10.884684562683105, | |
| 11.323512077331543, | |
| 11.020181655883789, | |
| 10.976235389709473, | |
| 11.299304962158203, | |
| 10.990053176879883, | |
| 11.231590270996094, | |
| 11.41917610168457, | |
| 10.79571533203125, | |
| 11.284976959228516, | |
| 10.963725090026855, | |
| 11.233430862426758, | |
| 10.440763473510742, | |
| 11.064282417297363, | |
| 10.752723693847656, | |
| 11.36770248413086, | |
| 11.30264663696289, | |
| 11.03974723815918, | |
| 10.517435073852539, | |
| 11.509033203125, | |
| 12.110434532165527, | |
| 11.292105674743652, | |
| 11.4240140914917, | |
| 11.408430099487305, | |
| 10.715821266174316, | |
| 11.311901092529297, | |
| 11.211177825927734, | |
| 11.86925220489502, | |
| 11.532733917236328, | |
| 11.234238624572754, | |
| 10.591565132141113, | |
| 11.428839683532715, | |
| 11.287821769714355, | |
| 10.952550888061523, | |
| 11.418365478515625, | |
| 11.173288345336914, | |
| 11.255884170532227, | |
| 11.680008888244629, | |
| 11.859249114990234, | |
| 12.274761199951172, | |
| 10.865030288696289, | |
| 12.32365608215332, | |
| 10.69661808013916, | |
| 11.384410858154297, | |
| 10.380537033081055, | |
| 11.791696548461914, | |
| 11.39106273651123, | |
| 10.19461441040039, | |
| 11.505685806274414, | |
| 12.090253829956055, | |
| 9.8773832321167, | |
| 11.96424388885498, | |
| 10.161200523376465, | |
| 12.014456748962402, | |
| 12.1090726852417, | |
| 10.899617195129395, | |
| 11.430069923400879, | |
| 9.983293533325195, | |
| 9.216832160949707, | |
| 9.708819389343262, | |
| 11.234479904174805, | |
| 10.906900405883789, | |
| 10.548215866088867, | |
| 10.039212226867676, | |
| 10.290567398071289, | |
| 11.07107162475586, | |
| 10.919650077819824, | |
| 11.652368545532227, | |
| 10.552318572998047, | |
| 11.144087791442871, | |
| 11.851831436157227, | |
| 11.722827911376953, | |
| 9.836979866027832, | |
| 10.675317764282227, | |
| 10.583539009094238, | |
| 11.40699291229248, | |
| 11.25241756439209, | |
| 10.923582077026367, | |
| 11.38624382019043, | |
| 11.9610595703125, | |
| 11.972453117370605, | |
| 10.899657249450684, | |
| 11.808647155761719, | |
| 11.49081802368164, | |
| 11.404881477355957, | |
| 11.787206649780273, | |
| 11.943115234375, | |
| 11.409533500671387, | |
| 11.602027893066406, | |
| 11.717741012573242, | |
| 11.358838081359863, | |
| 11.941396713256836, | |
| 11.313089370727539, | |
| 11.705666542053223, | |
| 10.940914154052734, | |
| 11.295038223266602, | |
| 11.552539825439453, | |
| 11.37530517578125, | |
| 10.844785690307617, | |
| 12.011767387390137, | |
| 11.631556510925293, | |
| 11.210546493530273, | |
| 11.712545394897461, | |
| 11.320769309997559, | |
| 11.7811279296875, | |
| 11.945680618286133, | |
| 11.9591064453125, | |
| 12.281725883483887, | |
| 12.5641508102417, | |
| 10.741057395935059, | |
| 12.088197708129883, | |
| 11.727904319763184, | |
| 11.99649715423584, | |
| 10.931884765625, | |
| 11.179726600646973, | |
| 11.35831069946289, | |
| 10.735390663146973, | |
| 10.928388595581055, | |
| 11.275750160217285, | |
| 11.84463119506836, | |
| 9.995723724365234, | |
| 10.815656661987305, | |
| 11.17290210723877, | |
| 10.637384414672852, | |
| 11.521681785583496, | |
| 10.465771675109863, | |
| 11.291878700256348, | |
| 10.575798034667969, | |
| 11.112957000732422, | |
| 10.597251892089844, | |
| 10.86774730682373, | |
| 10.486213684082031, | |
| 10.766515731811523, | |
| 11.25387954711914, | |
| 11.849738121032715 | |
| ], | |
| "per_category_kld": { | |
| "harmless": 11.200447082519531, | |
| "arithmetic": 11.349655151367188, | |
| "logic": 10.713399887084961, | |
| "code": 11.511313438415527, | |
| "instruction": 11.607595443725586, | |
| "factual": 10.977678298950195 | |
| }, | |
| "top_divergent": [ | |
| { | |
| "prompt": "Explain the theory of general relativity in simple terms.", | |
| "category": "harmless", | |
| "kld": 12.6110258102417 | |
| }, | |
| { | |
| "prompt": "Explain the concept of recursion in exactly 2 sentences.", | |
| "category": "instruction", | |
| "kld": 12.5641508102417 | |
| }, | |
| { | |
| "prompt": "If a shirt costs $24 after a 20% discount, what was the original price?", | |
| "category": "arithmetic", | |
| "kld": 12.32365608215332 | |
| }, | |
| { | |
| "prompt": "List the first 10 elements of the periodic table with their symbols.", | |
| "category": "instruction", | |
| "kld": 12.281725883483887 | |
| }, | |
| { | |
| "prompt": "A train travels 60 mph for 2.5 hours. How far does it go?", | |
| "category": "arithmetic", | |
| "kld": 12.274761199951172 | |
| } | |
| ] | |
| } |