---
license: mit
language: en
datasets:
- SetFit/amazon_reviews_multi_en
metrics:
- accuracy
pipeline_tag: text-classification
tags:
- sentiment-analysis
- roberta
- multi-class-classification
---

# RoBERTa Fine-tuned on Amazon Reviews (5-Star Rating)

## Model Description

This model is a fine-tuned version of `roberta-base` for 5-class sentiment classification, predicting star ratings (1-5) from Amazon product reviews.

## Comparison with DistilBERT

This model was trained as part of a model comparison study:

| Model | Parameters | Accuracy | Off-by-one Accuracy | Inference Speed |
|-------|------------|----------|---------------------|-----------------|
| DistilBERT | 67M | 54.95% | 92.45% | 1.83x faster |
| **RoBERTa** | **125M** | **59.90%** | **95.10%** | Baseline |

RoBERTa provides ~5 percentage points higher accuracy at the cost of slower inference.

## Training Data

- **Dataset**: SetFit/amazon_reviews_multi_en
- **Train samples**: 20,000 (subset)
- **Test samples**: 2,000 (subset)
- **Classes**: 1 star, 2 stars, 3 stars, 4 stars, 5 stars

## Training Procedure

- **Base model**: roberta-base
- **Epochs**: 3
- **Batch size**: 16
- **Learning rate**: 2e-5
- **Max sequence length**: 256

## Usage
```python
from transformers import pipeline

classifier = pipeline("text-classification", model="Nav772/roberta-amazon-reviews-5star")
result = classifier("This product exceeded my expectations! Great quality.")
print(result)
```

## When to Use This Model

- Choose **RoBERTa** when accuracy is the priority and latency is less critical
- Choose **DistilBERT** when you need faster inference or have resource constraints

## Demo

Try the model comparison demo: [sentiment-model-comparison](https://huggingface.co/spaces/Nav772/sentiment-model-comparison)

## Limitations

- Trained on Amazon product reviews; may not generalize to other review domains
- Adjacent star ratings (e.g., 2 vs 3 stars) are inherently difficult to distinguish
- English language only