---
license: mit
library_name: transformers
base_model: unsloth/gpt-oss-20b
tags:
- gpt-oss
- lora
- unsloth
- text-generation
- instruction-following
- multilingual
datasets:
- HuggingFaceH4/Multilingual-Thinking
pipeline_tag: text-generation
language:
- en
---

# GPT-OSS-20B Fine-Tuned

A fine-tuned **gpt-oss-20b** model optimized for *efficient text generation, multilingual conversational tasks, and instruction-following*.

---

## Overview

| Item | Details |
|---|---|
| **Base checkpoint** | `unsloth/gpt-oss-20b` |
| **Fine-tune method** | LoRA (PEFT) with Unsloth |
| **Training run** | 30 steps • Multilingual-Thinking dataset |
| **Trainable params** | [To be calculated, if available] |
| **Loss** | [Loss metrics unavailable] |
| **Hardware** | [Hardware details unavailable] |
| **License** | MIT License (Base model: Refer to gpt-oss-20b license) |
| **Intended use** | Educational, research, and chat-based applications |

---

## Datasets

| Dataset | Size | Focus |
|---|---|---|
| `HuggingFaceH4/Multilingual-Thinking` | [Size unavailable] | Multilingual reasoning and conversational tasks |

The dataset was wrapped with the **chat template** before training.

---

## Installation

To use this model, install the required dependencies:

```bash
pip install torch>=2.8.0 triton>=3.4.0 transformers>=4.55.3 bitsandbytes unsloth
```

## Usage

### Loading the Model

```python
from unsloth import FastLanguageModel
import torch

model, tokenizer = FastLanguageModel.from_pretrained(
    model_name="unsloth/gpt-oss-20b",
    max_seq_length=1024,
    dtype=torch.float16,
    load_in_4bit=True,
)
```

### Fine-Tuning with LoRA

```python
model = FastLanguageModel.get_peft_model(
    model,
    r=8,
    target_modules=["q_proj", "k_proj", "v_proj", "o_proj", "gate_proj", "up_proj", "down_proj"],
    lora_alpha=16,
    lora_dropout=0,
    bias="none",
    use_gradient_checkpointing="unsloth",
)
```

### Inference

```python
from transformers import TextStreamer

messages = [
    {"role": "user", "content": "Solve x^5 + 3x^4 - 10 = 3."},
]
inputs = tokenizer.apply_chat_template(messages, return_tensors="pt", return_dict=True).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=512, streamer=TextStreamer(tokenizer))
```

---

## Training Details

### Training Configuration

- **Batch Size**: 1
- **Gradient Accumulation Steps**: 4
- **Learning Rate**: 2e-4
- **Optimizer**: adamw_8bit
- **Warmup Steps**: 5
- **Max Steps**: 30

---

## Responsible Use

- **Bias**: The model may reflect biases in the training data. Users should evaluate outputs for fairness.
- **Misuse**: Avoid using for harmful or misleading content generation.
- **Limitations**: Optimized for efficiency with 4-bit quantization, which may introduce minor accuracy trade-offs. Limited to 1024-token sequences.
- **Disclaimer**: Not intended for critical decision-making. The author and base-model creators accept no liability for misuse or errors.

---

## Acknowledgements

- The unsloth library for enabling efficient fine-tuning.
- Hugging Face for providing the base model and training infrastructure.

---