--- license: apache-2.0 base_model: OrionLLM/GRM-2.6-Plus tags: - gguf - conversational - reasoning - qwen3_5 pipeline_tag: text-generation --- # GRM-2.6-Plus (27B) - GGUF ## Overview This repository contains the **GGUF** quantized files for **[OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus)**. GRM-2.6-Plus is a highly capable **27B-parameter reasoning model** built on the Qwen3.6 architecture. It is specifically engineered for **general-purpose AI** and optimized for **difficult, high-complexity tasks**. - **Original Model:** [OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus) - **Architecture:** Qwen3.6-27B - **License:** Apache 2.0 ## Key Capabilities - **Elite-Level Reasoning for Hard Tasks:** GRM-2.6-Plus is optimized to handle difficult reasoning workloads with clarity, consistency, and strong step-by-step problem-solving ability. - **High Performance for Its Size:** With **27B parameters**, the model is designed to deliver excellent capability relative to its scale, balancing strong intelligence with practical deployment. - **Advanced Coding and Agentic Use:** GRM-2.6-Plus is well suited for code generation, structured problem-solving, tool-style workflows, and local agentic applications. - **Optimized for Practical Deployment:** The model aims to remain efficient and usable across capable consumer and workstation hardware while offering strong performance for advanced tasks. ## How to Use These GGUF files are fully compatible with [llama.cpp](https://github.com/ggml-org/llama.cpp) and popular graphical interfaces like **LM Studio**, **Ollama**. ### Example using `llama.cpp` CLI: ```bash ./llama-cli -m GRM-2.6-Plus-Q8_0.gguf \ -p "System: You are a helpful assistant.\nUser: Create a calculator in a single HTML file backwards.\nAssistant:" \ -n 2048 -c 8192