dadmobile commited on
Commit
893b966
·
verified ·
1 Parent(s): c1380ce

Added custom GGUF loader note to README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -2
README.md CHANGED
@@ -9,9 +9,16 @@ tags: [text-to-image, diffusion, flow-matching, quantization, gguf, q4_k, ideogr
9
 
10
  # Ideogram 4 — GGUF Q4_K (Transformer Lab)
11
 
12
- A **GGUF Q4_K** (4.5 bits/weight) quantization of the Ideogram 4 DiT, for consumer GPUs.
13
 
14
- > **Note:** this checkpoint is the **quantized DiT only** (both CFG branches). To run it you also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8) and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4). The quantization recipe and loader are included **in this repo** (`recipe-q4_k.json`, `gguf_loader.py`).
 
 
 
 
 
 
 
15
 
16
  ## Why this one
17
  Q4_K is the **Pareto winner** on the quality-vs-memory frontier: at **10.4 GB** (the same
 
9
 
10
  # Ideogram 4 — GGUF Q4_K (Transformer Lab)
11
 
12
+ A **GGUF Q4_K** (4.5 bits/weight) quantization of the Ideogram 4 DiT, sized for consumer GPUs.
13
 
14
+ > :warning: **Not a llama.cpp / stable-diffusion.cpp file.** Despite the `.gguf` extension, this
15
+ > loads **only** via the included PyTorch `gguf_loader.py` + the `ideogram4` pipeline. It is
16
+ > **not** compatible with llama.cpp, stable-diffusion.cpp, Ollama, etc.
17
+
18
+ > ℹ️ **Quantized DiT only.** This checkpoint is the DiT (both CFG branches). To generate you
19
+ > also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8)
20
+ > and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4).
21
+ > The quantization recipe and loader are included **in this repo** (`recipe-q4_k.json`, `gguf_loader.py`).
22
 
23
  ## Why this one
24
  Q4_K is the **Pareto winner** on the quality-vs-memory frontier: at **10.4 GB** (the same