--- language: [en] license: mit tags: - education - pedagogy - edtech - e-learning - slm - llama-style - rope - 5m-context - from-scratch pipeline_tag: text-generation --- # Education-SLM: Education Small Language Model A **LLaMA-style transformer** (~33.9M params) trained from scratch on Education domain data. Supports up to **5M token context** via RoPE. ## Architecture | Component | Value | |-----------|-------| | Architecture | LLaMA-style (RoPE + RMSNorm + SwiGLU) | | Parameters | ~33.9M | | Layers | 8 | | Heads | 8 | | Embedding | 512 | | Max Context | 5,000,000 tokens | | Vocab | 16,000 BPE | | Best Loss | 0.7721403505653143 | ## License MIT — Built from scratch.