--- license: apache-2.0 base_model: mistralai/Magistral-Small-2509 tags: - magistral - mistral - reasoning - chat - text-only ---
Send me your support to help me feed the data beast! also taking comissions for universe specific models
Support on Ko-fiThis model is a specialized, text-only version of Mistral AI's powerful Magistral Small 1.2. It was derived from the official release by carefully removing the vision encoder and related multimodal layers. The result is a more streamlined and efficient 24B parameter model that excels at text-based tasks, retaining the exceptional reasoning capabilities of its progenitor.
Built upon Mistral Small 3.2 (2506), this model underwent Supervised Fine-Tuning (SFT) from Magistral Medium traces and Reinforcement Learning (RL) on top, inheriting a deep capacity for logical deduction and problem-solving. By focusing exclusively on text, this version offers a smaller footprint and potentially faster inference for applications where vision is not required.
This model uses a special reasoning format. There are two methods to enable it: the official format designed by MistralAI, and a legacy format that works due to the base model's pre-training. The correct method depends on your backend software (e.g., llama.cpp, Kobold.cpp).
[THINK] (Recommended for llama.cpp)This is the official instruction format from MistralAI and is the recommended method. It is confirmed to work with backends like llama.cpp (with specific flags) and mistral-common.
--special and --jinja arguments enabled.[THINK] and [/THINK] tags.[THINK]./think anywhere in your system prompt to activate the reasoning module.Add the following to your system prompt to guide the model's output structure:
First draft your thinking process (inner monologue) until you arrive at a response. You must use the /think keyword. Format your response using Markdown, and use LaTeX for any mathematical equations. Write both your thoughts and the response in the same language as the input. Your thinking process must follow the template below:[THINK]Your thoughts or/and draft, like working through an exercise on scratch paper. Be as casual and as long as you want until you are confident to generate the response. Use the same language as the input.[/THINK]Here, provide a self-contained response.
For a complete SillyTavern configuration, you can download and import this JSON file:
Download SillyTavern JSON →<think> (For Kobold.cpp & TabbyAPI)This format is not official but is highly effective with backends like Kobold.cpp and TabbyAPI. It works because the model's predecessor was trained on these angle-bracket tags, and the model inherits this behavior.
<think> and </think> tags.<think>.For best results, you must use the recommended system prompt and response format. The model uses special [THINK] and [/THINK] tokens to encapsulate its reasoning process before delivering the final answer.
First draft your thinking process (inner monologue) until you arrive at a response. Format your response using Markdown, and use LaTeX for any mathematical equations. Write both your thoughts and the response in the same language as the input.
Your thinking process must follow the template below:[THINK]Your thoughts or/and draft, like working through an exercise on scratch paper. Be as casual and as long as you want until you are confident to generate the response. Use the same language as the input.[/THINK]Here, provide a self-contained response.Important: The [THINK] and [/THINK] tags are special tokens and must be encoded as such. Please ensure you are using a recent version of a library that supports the Magistral chat template, such as mistral-common.
The following benchmarks are from the official release of the original multimodal Magistral Small 1.2. As this version is a direct derivative with only vision components removed, performance on these text-based reasoning benchmarks is expected to be identical.
| Model | AIME24 pass@1 | AIME25 pass@1 | GPQA Diamond | Livecodebench (v5) |
|---|---|---|---|---|
| Magistral Medium 1.2 | 91.82% | 83.48% | 76.26% | 75.00% |
| Magistral Small 1.2 | 86.14% | 77.34% | 70.07% | 70.88% |
| Magistral Small 1.1 | 70.52% | 62.03% | 65.78% | 59.17% |