Update README.md

9bf1df8 verified 7 months ago

1.58 kB

license: apache-2.0
datasets:
  - prometheus-eval/Preference-Collection
language:
  - en
base_model:
  - unsloth/gemma-3-4b-it

📘 Model Summary

This model is a fine-tuned preference evaluation model based on unsloth/gemma-3-4b-it, trained on the prometheus-eval/Preference-Collection dataset.
It is designed to perform pairwise preference comparison and alignment evaluation tasks, inspired by the Prometheus framework (Kim et al., 2023).

🧮 Performance Benchmark

Model	Benchmark	Accuracy (%) (Pairwise)
🟦 This model	Preference Bench	95.6
🟨 Prometheus 2 (8×7B) (Kim et al., 2024)	Preference Bench	90.65

Highlights:

Outperforms Prometheus 2 (8×7B) by +4.95%, while being smaller in size.
Optimized for efficiency, alignment scoring, and feedback consistency.

🧾 License

This model is released under the Apache 2.0 License.
However, because it is derived from Google’s Gemma 3, your use of this model must also comply with the Gemma Terms of Use.

By using this model, you agree to:

Follow Google’s Gemma Model Terms of Use, including restrictions on misuse and redistribution.
Attribute Google as the original provider of the Gemma 3 base model.

For full details, see: https://ai.google.dev/gemma/terms