How to use from
Unsloth Studio
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Entz/gpt-oss-20b-ck-MXFP4 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Entz/gpt-oss-20b-ck-MXFP4 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for Entz/gpt-oss-20b-ck-MXFP4 to start chatting
Quick Links

gpt-oss-20b-ck-MXFP4

This is a test fine-tune of the base model unsloth/gpt-oss-20b. It was fine-tuned using Unsloth for efficiency on a small dataset as an experimental setup.

Model Details

  • Base Model: unsloth/gpt-oss-20b (MXFP4 quantized)
  • Fine-Tuning Method: QLoRA with rank=64, targeting MoE layers
  • Training Epochs: 6
  • Dataset: Small custom dataset (~4,000 examples)
  • Max Sequence Length: 8192
  • Optimizer: AdamW 8-bit
  • Learning Rate: 1e-4

The model is provided in MXFP4 GGUF format for compatibility with llama.cpp, Ollama, or LM Studio.

Usage

Load with Unsloth or transformers for inference:

from unsloth import FastLanguageModel
model, tokenizer = FastLanguageModel.from_pretrained("Entz/gpt-oss-20b-ck-MXFP4")
Downloads last month
2
GGUF
Model size
21B params
Architecture
gpt-oss
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for Entz/gpt-oss-20b-ck-MXFP4

Quantized
(21)
this model