--- license: apache-2.0 language: - en base_model: - LiquidAI/LFM2-8B-A1B pipeline_tag: text-generation library_name: transformers tags: - distill - finetune - unsloth - moe - mixture of experts ---

LFM2-8B-A1B-Instruct-Quantum-IQ1C-Qwen3.6-35B-A3B-DISTILL2

A LFM 8B-1A8 (MOE, 32 experts) trained on a custom built DISTILL dataset using Qwen 3.6 35B-A3B (MOE, 256 experts, in thinking mode, with 24k context) as a teacher via LMSTUDIO "server" (using custom python scripts) using IQ4_XS NEO-CODE-Dimatrix (Imatrix) custom quant. Model training via Unsloth on local hardware via custom training scripts. Training notes: - Model intelligence has jumped. - Serious improvements in formatting, style, and prose. - Rep pen of 1, (off) can now be used (whereas the root model required 1.05 to 1.1). - Model maintained as "instruct" rather than converting to "thinking". This is an INSTRUCT model that runs at over 500 t/s on a 5090 GPU, 50-100 t/s on CPU. SETTINGS/MODEL INFO: - 128k max context, suggest 8k min context window. - Rep pen 1. [off] ; note model maker suggests: rep pen 1.05 - Temp .2 to 1.2 ; temp of 1 used during testing. - 4 of 32 experts activated by default. - Model is moderately uncensored. - Suggest quant IQXxs (imatrix) or Q5/Q6 non imatrix ggufs. ``` IN HOUSE BENCHMARKS [by Nightmedia]: arc-c arc/e boolq hswag obkqa piqa wino LFM2-8B-A1B-Instruct-Quantum-IQ1C-Qwen3.6-35B-A3B-DISTILL2 q8-hi 0.484,0.702,0.796,0.649,0.414,0.755,0.616 LFM2-8B-A1B-Instruct-Quantum-IQ1C-Qwen3.6-35B-A3B-DISTILL q8-hi 0.471,0.677,0.783,0.647,0.390,0.740,0.616 --- BASE UNTUNED MODEL: LFM2-8B-A1B mxfp8 0.460,0.575,0.829,0.624,0.394,0.711,0.567 ``` ---

Example Generations:

Q6_K, rep pen: 1 [off], temp:1 non imatrix NOTE: Some formatting may be lost. --- EXAMPLE #1 --- [coming soon]