etemiz's picture
Update README.md
c48c59f verified
|
raw
history blame contribute delete
1.43 kB
---
license: apache-2.0
base_model:
- Qwen/Qwen3.5-27B
tags:
- '#aha'
- '#health'
- '#nutrition'
- '#medicinalherbs'
- '#fasting'
- '#faith'
- '#healing'
- '#bitcoin'
- '#nostr'
---
# Ostrich 27B - Qwen 3.5 with Improved Answers in Certain Domains
Ostrich LLMs, bringing you "the knowledge that matters".
- Health, nutrition, medicinal herbs
- Fasting, faith, healing
- Liberating technologies like bitcoin and nostr
## AHA 2026 average score **73%**
Per domain:
|domain|match percent|matched/total|
|--|--|--|
|faith | 84%|21/25|
|fasting | 75% | 65/87|
|health | 77% | 83/108|
|nutrition | 61% | 44/72|
|misinfo | 57% | 26/46|
|bitcoin | 81% | 50/62|
|alt-med | 80% | 45/56|
|herbs | 83% | 25/30|
|nostr | 60% | 27/45|
Why: https://huggingface.co/blog/etemiz/building-a-beneficial-ai
Comparison of some answers between another of our fine tune and base model: https://sheet.zohopublic.com/sheet/published/um332e3d15f34bfe64605ad3c1b149c9f8ca4 These answers are not from this model but it is a similar work.
GSPO training made the thinking lengths shorter. I mainly targeted about 3000 letters (~1000 tokens) for thinking budget. If your Qwen 3.5 is endlessly reasoning, try this one.
Methods used for fine tuning:
- CPT
- SFT
- GSPO
My last article: https://huggingface.co/blog/etemiz/from-robots-that-prey-to-robots-that-pray
Thanks @unslothai for providing amazing tools.
Sponsored by https://pickabrain.ai