cstr commited on
Commit
3eabb58
·
verified ·
1 Parent(s): 3728af7

Add model card for mxbai-rerank-base-v1 GGUF

Browse files
Files changed (1) hide show
  1. README.md +77 -0
README.md ADDED
@@ -0,0 +1,77 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language: [en]
4
+ tags: [embeddings, gguf, ggml, text-embeddings, bert, crispembed]
5
+ pipeline_tag: feature-extraction
6
+ base_model: mixedbread-ai/mxbai-rerank-base-v1
7
+ ---
8
+
9
+ # mxbai-rerank-base-v1 GGUF
10
+
11
+ GGUF format of [mixedbread-ai/mxbai-rerank-base-v1](https://huggingface.co/mixedbread-ai/mxbai-rerank-base-v1) for use with [CrispEmbed](https://github.com/CrispStrobe/CrispEmbed).
12
+
13
+ MixedBread Rerank Base v1. Cross-encoder reranker for English, good quality/speed balance.
14
+
15
+ ## Files
16
+
17
+ | File | Quantization | Size |
18
+ |------|-------------|------|
19
+ | [mxbai-rerank-base-v1-q4_k.gguf](https://huggingface.co/cstr/mxbai-rerank-base-v1-GGUF/resolve/main/mxbai-rerank-base-v1-q4_k.gguf) | Q4_K | 149 MB |
20
+ | [mxbai-rerank-base-v1-q8_0.gguf](https://huggingface.co/cstr/mxbai-rerank-base-v1-GGUF/resolve/main/mxbai-rerank-base-v1-q8_0.gguf) | Q8_0 | 190 MB |
21
+ | [mxbai-rerank-base-v1.gguf](https://huggingface.co/cstr/mxbai-rerank-base-v1-GGUF/resolve/main/mxbai-rerank-base-v1.gguf) | F32 | 706 MB |
22
+
23
+
24
+ ## Quick Start
25
+
26
+ ```bash
27
+ # Download
28
+ huggingface-cli download cstr/mxbai-rerank-base-v1-GGUF mxbai-rerank-base-v1-q4_k.gguf --local-dir .
29
+
30
+ # Run with CrispEmbed
31
+ ./crispembed -m mxbai-rerank-base-v1-q4_k.gguf "Hello world"
32
+
33
+ # Or with auto-download
34
+ ./crispembed -m mxbai-rerank-base-v1 "Hello world"
35
+ ```
36
+
37
+ ## Model Details
38
+
39
+ | Property | Value |
40
+ |----------|-------|
41
+ | Architecture | BERT |
42
+ | Parameters | 86M |
43
+ | Embedding Dimension | 768 |
44
+ | Layers | 12 |
45
+ | Pooling | CLS |
46
+ | Tokenizer | WordPiece |
47
+ | Base Model | [mixedbread-ai/mxbai-rerank-base-v1](https://huggingface.co/mixedbread-ai/mxbai-rerank-base-v1) |
48
+
49
+ ## Verification
50
+
51
+ Verified bit-identical to HuggingFace sentence-transformers (cosine similarity >= 0.999 on test texts).
52
+
53
+ ## Usage with CrispEmbed
54
+
55
+ CrispEmbed is a lightweight C/C++ text embedding inference engine using ggml.
56
+ No Python runtime, no ONNX. Supports BERT, XLM-R, Qwen3, and Gemma3 architectures.
57
+
58
+ ```bash
59
+ # Build CrispEmbed
60
+ git clone https://github.com/CrispStrobe/CrispEmbed
61
+ cd CrispEmbed
62
+ cmake -S . -B build && cmake --build build -j
63
+
64
+ # Encode
65
+ ./build/crispembed -m mxbai-rerank-base-v1-q4_k.gguf "query text"
66
+
67
+ # Server mode
68
+ ./build/crispembed-server -m mxbai-rerank-base-v1-q4_k.gguf --port 8080
69
+ curl -X POST http://localhost:8080/v1/embeddings \
70
+ -d '{"input": ["Hello world"], "model": "mxbai-rerank-base-v1"}'
71
+ ```
72
+
73
+ ## Credits
74
+
75
+ - Original model: [mixedbread-ai/mxbai-rerank-base-v1](https://huggingface.co/mixedbread-ai/mxbai-rerank-base-v1)
76
+ - Inference engine: [CrispEmbed](https://github.com/CrispStrobe/CrispEmbed) (ggml-based)
77
+ - Conversion: `convert-bert-embed-to-gguf.py`