BitNet b1.58 (MIT) + Llama3-8B-1.58 (Llama3 License). ATLAS TQ1.0, CPU inference, no GPU needed.
Note 2B (SubLN, ReLU²)
Note 8B (Llama3 arch, GQA, QK-Norm)