How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull gghfez/MiMo-V2.5-ikllama-GGUF:IQ3_S
Run and chat with the model
lemonade run user.MiMo-V2.5-ikllama-GGUF-IQ3_S
List all available models
lemonade list
Quick Links

Model

This is a text-only GGUF quantization of XiaomiMiMo/MiMo-V2.5 with unfused atten q,k,v for compatibility with ik_llama.cpp

Re-uploaded from AesSedai/MiMo-V2.5-GGUF prior to this change

Available quants: IQ4_XS and IQ3_S

From AesSedai's model card:

Quant Size Mixture PPL 1-(Mean PPL(Q)/PPL(base)) KLD
Q8_0 306.66 GiB (8.50 BPW) Unknown / TBD 5.134769 ยฑ 0.030261 +0.1230% 0.012010 ยฑ 0.000150
Q5_K_M 213.39 GiB (5.92 BPW) Q8_0 / Q5_K / Q5_K / Q6_K 5.147654 ยฑ 0.030377 +0.3743% 0.014752 ยฑ 0.000240
Q4_K_M 177.68 GiB (4.93 BPW) Q8_0 / Q4_K / Q4_K / Q5_K 5.202785 ยฑ 0.030828 +1.4493% 0.020631 ยฑ 0.000251
IQ4_XS 137.75 GiB (3.82 BPW) Q8_0 / IQ3_S / IQ3_S / IQ4_XS 5.272594 ยฑ 0.031193 +2.8105% 0.041508 ยฑ 0.000343
IQ3_S 106.31 GiB (2.95 BPW) Q6_K / IQ2_S / IQ2_S / IQ3_S 5.545001 ยฑ 0.033188 +8.1221% 0.092415 ยฑ 0.000600
Downloads last month
1,004
GGUF
Model size
309B params
Architecture
mimo2
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for gghfez/MiMo-V2.5-ikllama-GGUF

Quantized
(21)
this model