--- license: apache-2.0 tags: - 256k context - Qwen3 - Mixture of Experts - MOE - MOE Dense - 2 experts - 4Bx6 - All use cases - bfloat16 - merge - creative - creative writing - fiction writing - plot generation - sub-plot generation - fiction writing - story generation - scene continue - storytelling - fiction story - science fiction - romance - all genres - story - writing - vivid prosing - vivid writing - fiction pipeline_tag: text-generation language: - en library_name: transformers ---

Qwen3-24B-MOE-6x-4B-Star-Trek-AwayTeam-Instruct

A fully gated INSTRUCT MOE (Mixture of Experts) model of 24B compressed into 18B "model size". This is a "Colab" between myself and Nightmedia. Gating is based on Star Trek characters NAME(s) that each model said it was closest to during testing (no quotes): - "Q Continuum" - "[Q]" - "Enterprise Computer" - "Quark" - "Picard" - "Sisko" - "Janeway" - "Garak" - "Martok" - "Spock" - "Sarek" - "Data" - "Seven of Nine" - "Kira" - "Odo" - "Dr Crusher" - "Bashir" - "Worf" - "Klingons" Use like: "Sisko, [prompt here]" or "Sisko, Kira and Worf [prompt here]" You can use: "Away-Team" to address all experts. Each model is isolated from one another and controlled using prompts and/or activation of additional experts. You can set experts from 1 to 6 with default of 2. Features: - Six of the top Qwen3 4B models (each benchmarked) in one package. - 2 experts activated (adjustable) - "programmable" model which features gating instructions embedded in prompts and/or system prompts. - 256k context. [more coming soon...]