--- base_model: - openbmb/MiniCPM5-1B license: apache-2.0 language: - en tags: - minicpm - minicpm5 - llama - text-generation - long-context - tool-calling - on-device - edge-ai - gguf --- # MiniCPM5-1B-i1-GGUF This repository contains GGUF format quantizations of the model openbmb/MiniCPM5-1B. These models were generated using llama.cpp with an Importance Matrix (imatrix) - https://gist.github.com/bartowski1182/82ae9b520227f57d79ba04add13d0d0d ## Models Available The collection includes a total of 26 files: * Baseline Formats: BF16, FP16, and FP32 * Quantization Types: 23 different levels including IQ1_S, IQ2_XXS, Q4_K_M, and Q8_0 * Importance Matrix: `imatrix.gguf` is included for reference or custom use # NOTE Quantization below 4 bits may result in unacceptably low quality, particularly for a 1B parameter model, which is inherently more sensitive to such compression.