--- license: apache-2.0 base_model: Qwen/Qwen3.5-27B tags: - qwen - qwen3.5 - reasoning - chat - text-only - 40b - upscale ---
NOT TO BE USED AS IS, AND REQUIRES FINE-TUNING.
This upscaled model produces gibberish out of the box and currently has a default PPL of 500k.
Send me your support to help me feed the data beast! also taking comissions for universe specific models
Support on Ko-fiThis model is an interleaved upscale of Qwen3.5-27B to 40B. It expands the base architecture from 64 to 96 layers using an interleaved copying technique.
Upscaling Details:
o_proj, down_proj, and out_proj were mapped with σ = 0.000625.