How to use from
Hermes Agent
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama serve -hf Trilogix1/Anthropics-Fable-finetuned-in-Qwen3.6-35B:IQ4_NL
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default Trilogix1/Anthropics-Fable-finetuned-in-Qwen3.6-35B:IQ4_NL
Run Hermes
hermes
Quick Links

This is a converted and quantized version of Qwen 3.6 35B using Quanta and HugstonOne.


model size = 132219.74 MiB (32.00 BPW)


quant size = 21192.47 MiB (5.13 BPW)

Original weights here: https://huggingface.co/lordx64/Qwable-v1


Credit to Qwen team for the model creation

Credit to https://huggingface.co/lordx64/Qwable-v1 for the finetuning work

Credit to LLama.cpp team for the great contribution

Credit to Hugston Team for Converting, Quantizing, Testing, Benching and other...

Credit to Huggingface for the amazing hosting platform


The quantization in GGUF was made in f32 for better quality quants.
Here we show Quanta our convertor and Quantizer tool.

image

Downloads last month
793
GGUF
Model size
35B params
Architecture
qwen35moe
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Trilogix1/Anthropics-Fable-finetuned-in-Qwen3.6-35B

Dataset used to train Trilogix1/Anthropics-Fable-finetuned-in-Qwen3.6-35B