GRM-2.6-Plus-GGUF / README.md
morikomorizz's picture
Update README.md
4ac4626
|
Raw
History Blame
1.85 kB
---
license: apache-2.0
base_model: OrionLLM/GRM-2.6-Plus
tags:
- gguf
- conversational
- reasoning
- qwen3_5
pipeline_tag: text-generation
---
# GRM-2.6-Plus (27B) - GGUF
## Overview
This repository contains the **GGUF** quantized files for **[OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus)**.
GRM-2.6-Plus is a highly capable **27B-parameter reasoning model** built on the Qwen3.6 architecture. It is specifically engineered for **general-purpose AI** and optimized for **difficult, high-complexity tasks**.
- **Original Model:** [OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus)
- **Architecture:** Qwen3.6-27B
- **License:** Apache 2.0
## Key Capabilities
- **Elite-Level Reasoning for Hard Tasks:** GRM-2.6-Plus is optimized to handle difficult reasoning workloads with clarity, consistency, and strong step-by-step problem-solving ability.
- **High Performance for Its Size:** With **27B parameters**, the model is designed to deliver excellent capability relative to its scale, balancing strong intelligence with practical deployment.
- **Advanced Coding and Agentic Use:** GRM-2.6-Plus is well suited for code generation, structured problem-solving, tool-style workflows, and local agentic applications.
- **Optimized for Practical Deployment:** The model aims to remain efficient and usable across capable consumer and workstation hardware while offering strong performance for advanced tasks.
## How to Use
These GGUF files are fully compatible with [llama.cpp](https://github.com/ggml-org/llama.cpp) and popular graphical interfaces like **LM Studio**, **Ollama**.
### Example using `llama.cpp` CLI:
```bash
./llama-cli -m GRM-2.6-Plus-Q8_0.gguf \
-p "System: You are a helpful assistant.\nUser: Create a calculator in a single HTML file backwards.\nAssistant:" \
-n 2048 -c 8192