openai/gsm8k
Benchmark • Updated • 17.6k • 913k • 1.38k
Model: XformAI-india/qwen-0.6b-reasoning
Base Model: Qwen/Qwen3-0.6B
Architecture: Transformer decoder (GPT-style)
Fine-Tuned By: XformAI
Release Date: May 2025
License: MIT
qwen-0.6b-reasoning is a compact transformer model fine-tuned for reasoning, logic, and analytical thinking.
Despite its size, it demonstrates strong performance across:
Fine-tuned on a curated instruction-style dataset focused on multi-step reasoning.
| Category | Detail |
|---|---|
| Base Model | Qwen 0.6B |
| Target Objective | Reasoning, logic, CoT |
| Fine-Tuning Type | Instruction |
| Optimizer | AdamW (LoRA tuning) |
| Precision | bfloat16 |
| Epochs | 2 |
| Max Tokens | 2048 |
from transformers import AutoTokenizer, AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("XformAI-india/qwen-0.6b-reasoning")
tokenizer = AutoTokenizer.from_pretrained("XformAI-india/qwen-0.6b-reasoning")
prompt = "A farmer has 17 sheep. All but 9 run away. How many are left?"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=100)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))