--- license: cc-by-nc-4.0 language: [multilingual] tags: [embeddings, gguf, ggml, text-embeddings, qwen3, crispembed, ollama] pipeline_tag: feature-extraction base_model: jinaai/jina-embeddings-v5-text-small --- # jina-v5-small GGUF GGUF format of [jinaai/jina-embeddings-v5-text-small](https://huggingface.co/jinaai/jina-embeddings-v5-text-small) for use with [CrispEmbed](https://github.com/CrispStrobe/CrispEmbed) and [Ollama](https://ollama.com). ## Files | File | Quantization | Size | |------|-------------|------| | [jina-v5-small-q4_k.gguf](https://huggingface.co/cstr/jina-v5-small-GGUF/resolve/main/jina-v5-small-q4_k.gguf) | Q4_K | 0 MB | | [jina-v5-small-q8_0.gguf](https://huggingface.co/cstr/jina-v5-small-GGUF/resolve/main/jina-v5-small-q8_0.gguf) | Q8_0 | 0 MB | | [jina-v5-small.gguf](https://huggingface.co/cstr/jina-v5-small-GGUF/resolve/main/jina-v5-small.gguf) | F32 | 0 MB | **Recommended:** Q8_0 for quality (cos vs HF: L2=1.0), Q4_K for size (L2=1.0). ## Quick Start ### CrispEmbed ```bash ./crispembed -m jina-v5-small "Hello world" ./crispembed-server -m jina-v5-small --port 8080 ``` ### Ollama (with [CrispStrobe fork](https://github.com/CrispStrobe/ollama/tree/feat/xlmr-embedding)) ```bash echo "FROM jina-v5-small-q8_0.gguf" > Modelfile ollama create jina-v5-small -f Modelfile curl http://localhost:11434/api/embed -d '{"model":"jina-v5-small","input":["Hello world"]}' ``` ### Python (CrispEmbed) ```python from crispembed import CrispEmbed model = CrispEmbed("jina-v5-small-q8_0.gguf") vectors = model.encode(["Hello world", "Goodbye world"]) ``` ## Model Details | Property | Value | |----------|-------| | Architecture | Qwen3 | | Parameters | 600M | | Embedding Dimension | 1024 | | Layers | 28 | | Pooling | last-token | | Tokenizer | BPE | | Language | multilingual | | Q8_0 vs HuggingFace | L2=1.0 | | Q4_K vs HuggingFace | L2=1.0 | ## Server API CrispEmbed server supports four API dialects: - `POST /embed` -- native - `POST /v1/embeddings` -- OpenAI-compatible - `POST /api/embed` -- Ollama-compatible - `POST /api/embeddings` -- Ollama legacy ## Credits - Original model: [jinaai/jina-embeddings-v5-text-small](https://huggingface.co/jinaai/jina-embeddings-v5-text-small) - Inference: [CrispEmbed](https://github.com/CrispStrobe/CrispEmbed) (MIT, ggml-based)