---
license: apache-2.0
base_model: OrionLLM/GRM-2.6-Plus
tags:
- gguf
- conversational
- reasoning
- qwen3_5
pipeline_tag: text-generation
---

# GRM-2.6-Plus (27B) - GGUF

## Overview
This repository contains the **GGUF** quantized files for **[OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus)**. 

GRM-2.6-Plus is a highly capable **27B-parameter reasoning model** built on the Qwen3.6 architecture. It is specifically engineered for **general-purpose AI** and optimized for **difficult, high-complexity tasks**.

- **Original Model:** [OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus)
- **Architecture:** Qwen3.6-27B
- **License:** Apache 2.0

## Key Capabilities

- **Elite-Level Reasoning for Hard Tasks:** GRM-2.6-Plus is optimized to handle difficult reasoning workloads with clarity, consistency, and strong step-by-step problem-solving ability.
- **High Performance for Its Size:** With **27B parameters**, the model is designed to deliver excellent capability relative to its scale, balancing strong intelligence with practical deployment.
- **Advanced Coding and Agentic Use:** GRM-2.6-Plus is well suited for code generation, structured problem-solving, tool-style workflows, and local agentic applications.
- **Optimized for Practical Deployment:** The model aims to remain efficient and usable across capable consumer and workstation hardware while offering strong performance for advanced tasks.

## How to Use
These GGUF files are fully compatible with [llama.cpp](https://github.com/ggml-org/llama.cpp) and popular graphical interfaces like **LM Studio**, **Ollama**.

### Example using `llama.cpp` CLI:
```bash
./llama-cli -m GRM-2.6-Plus-Q8_0.gguf \
  -p "System: You are a helpful assistant.\nUser: Create a calculator in a single HTML file backwards.\nAssistant:" \
  -n 2048 -c 8192