Instructions to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF", dtype="auto") - llama-cpp-python
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF", filename="DistilGPT-OSS-qwen3-4B.i1-IQ1_M.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Use Docker
docker model run hf.co/mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with Ollama:
ollama run hf.co/mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
- Unsloth Studio
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF to start chatting
- Pi
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with Docker Model Runner:
docker model run hf.co/mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
- Lemonade
How to use mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull mradermacher/DistilGPT-OSS-qwen3-4B-i1-GGUF:Q4_K_M
Run and chat with the model
lemonade run user.DistilGPT-OSS-qwen3-4B-i1-GGUF-Q4_K_M
List all available models
lemonade list
uploaded from rich1
Browse files- .gitattributes +24 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ1_M.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ1_S.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ2_M.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ2_S.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ2_XS.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ2_XXS.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ3_M.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ3_S.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ3_XS.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ3_XXS.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ4_NL.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-IQ4_XS.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q2_K.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q2_K_S.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q3_K_L.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q3_K_M.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q3_K_S.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q4_0.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q4_1.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q4_K_M.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q4_K_S.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q5_K_M.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q5_K_S.gguf +3 -0
- DistilGPT-OSS-qwen3-4B.i1-Q6_K.gguf +3 -0
.gitattributes
CHANGED
|
@@ -34,3 +34,27 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
DistilGPT-OSS-qwen3-4B.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
DistilGPT-OSS-qwen3-4B.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 40 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 41 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
| 42 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
| 43 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 44 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 45 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
| 46 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
| 47 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
|
| 48 |
+
DistilGPT-OSS-qwen3-4B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
| 49 |
+
DistilGPT-OSS-qwen3-4B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
| 50 |
+
DistilGPT-OSS-qwen3-4B.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 51 |
+
DistilGPT-OSS-qwen3-4B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
| 52 |
+
DistilGPT-OSS-qwen3-4B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 53 |
+
DistilGPT-OSS-qwen3-4B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 54 |
+
DistilGPT-OSS-qwen3-4B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
| 55 |
+
DistilGPT-OSS-qwen3-4B.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
| 56 |
+
DistilGPT-OSS-qwen3-4B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 57 |
+
DistilGPT-OSS-qwen3-4B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 58 |
+
DistilGPT-OSS-qwen3-4B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 59 |
+
DistilGPT-OSS-qwen3-4B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 60 |
+
DistilGPT-OSS-qwen3-4B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
DistilGPT-OSS-qwen3-4B.i1-IQ1_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bde9e9ddd70e4a743c23554f2d53e56115286a413656c9d2568b0109a6af8dfd
|
| 3 |
+
size 1127016704
|
DistilGPT-OSS-qwen3-4B.i1-IQ1_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:11221de9755175130725e11d86ed301d87663925cb85186ee8507fe9c80f81d9
|
| 3 |
+
size 1055254784
|
DistilGPT-OSS-qwen3-4B.i1-IQ2_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5af9dc684123f0539179e23531c22f9c42cc3f7191fadb5d29e55a62dbb7744
|
| 3 |
+
size 1512982784
|
DistilGPT-OSS-qwen3-4B.i1-IQ2_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3c8e137e2d011c9be2d5575225ddff7a6c4c586f5a52d4ba244072aa81f59f8c
|
| 3 |
+
size 1417300224
|
DistilGPT-OSS-qwen3-4B.i1-IQ2_XS.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:891b1512c7b13040d5d84c9b67f5c6d8d35b45cd139954b5edaff1b25bf7aa74
|
| 3 |
+
size 1354098944
|
DistilGPT-OSS-qwen3-4B.i1-IQ2_XXS.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c6c32c1b01b347a115fc69a40cfafb81c207a15a82fca29f18272222c4703381
|
| 3 |
+
size 1246619904
|
DistilGPT-OSS-qwen3-4B.i1-IQ3_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a0898d5a6ddf012a59182e0b4199ac24c254dfa1226372fb72e1587e682f2624
|
| 3 |
+
size 1962895104
|
DistilGPT-OSS-qwen3-4B.i1-IQ3_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e5d938929137356dfa3ad6b15aadc87a5d30ae42eff5978a215fd1b5a3770a40
|
| 3 |
+
size 1899529984
|
DistilGPT-OSS-qwen3-4B.i1-IQ3_XS.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fa4c1fda0a120a76e673f6f94230b993d0f008b34c4c1c44cbc0c24f7e4ae97f
|
| 3 |
+
size 1814374144
|
DistilGPT-OSS-qwen3-4B.i1-IQ3_XXS.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5eb89443184cf0b85a843161691bacaacf501bf52a3b41f2f8f721ca7e7f19e9
|
| 3 |
+
size 1670187264
|
DistilGPT-OSS-qwen3-4B.i1-IQ4_NL.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c8e1ce7bebb5e6ecfe8387da81a0f871a07a00757e05f2e55ac4cbe21e8bb685
|
| 3 |
+
size 2381342464
|
DistilGPT-OSS-qwen3-4B.i1-IQ4_XS.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:da687b3e783564562903987d376702e76dcb8aaa63629ff12d40c665c8868486
|
| 3 |
+
size 2270750464
|
DistilGPT-OSS-qwen3-4B.i1-Q2_K.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:eccd1273df7be7545571b4a08f3d45f4d8405d03d4a77c8f5b948c90ffd2e330
|
| 3 |
+
size 1669498624
|
DistilGPT-OSS-qwen3-4B.i1-Q2_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dd234dca8a23a5105e6dc1062153c74484b1dc9cf724c56043bf6ab10d4a3db0
|
| 3 |
+
size 1563453184
|
DistilGPT-OSS-qwen3-4B.i1-Q3_K_L.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0669ec4c309e0d10f8acff57a2d792c045d60b7933bfc89c3f4c4c32cb47973a
|
| 3 |
+
size 2239784704
|
DistilGPT-OSS-qwen3-4B.i1-Q3_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f3819e52f717c5c58d896056af7c293139502e42689079c14c73da4c2069280f
|
| 3 |
+
size 2075617024
|
DistilGPT-OSS-qwen3-4B.i1-Q3_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4517ab646e58de263532a88e589d91e1df5d2a567c6df24c894414aad4f8dd41
|
| 3 |
+
size 1886996224
|
DistilGPT-OSS-qwen3-4B.i1-Q4_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:afaad2900fc205d3e52303813ebe628e8145122c10d0163f3c50ff02a738f55a
|
| 3 |
+
size 2375771904
|
DistilGPT-OSS-qwen3-4B.i1-Q4_1.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2bcb6fb67722342cc90268cd02f82c0be5ed9b757b886a1b5eda0222116c5a18
|
| 3 |
+
size 2596628224
|
DistilGPT-OSS-qwen3-4B.i1-Q4_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8d89008f9c932075a4efa95317ad8ff22dc9ae77e8d4a6b60cbb47c54c7cae92
|
| 3 |
+
size 2497279744
|
DistilGPT-OSS-qwen3-4B.i1-Q4_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bc46c0776221738496c1ffb317267ee26ce1f47cfa4d821efa1febe28f011222
|
| 3 |
+
size 2383308544
|
DistilGPT-OSS-qwen3-4B.i1-Q5_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2150c3fae144c4af27251e825a4f7bb22215a1fc771df5df94ae5f1ae8d71eab
|
| 3 |
+
size 2889512704
|
DistilGPT-OSS-qwen3-4B.i1-Q5_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c9fca800c6ca4f10072471bd5f697315a257bf0a71bd738079c74de39374d66f
|
| 3 |
+
size 2823710464
|
DistilGPT-OSS-qwen3-4B.i1-Q6_K.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f06a67f5a50d57a11175a700f3cbf36be9dd223fa3877495ce751f8e63bdbe6d
|
| 3 |
+
size 3306260224
|