--- title: TAF Agent emoji: ๐Ÿ”ฌ colorFrom: blue colorTo: green sdk: static pinned: true license: apache-2.0 short_description: Test ANY transformer LLM before you spend GPU/$. Free. Auditable. tags: - transformer - llm - diagnostic - rope - kv-cache - long-context - viability - thermodynamics - free - browser - webgpu language: - en - es - fr - zh --- # ๐Ÿ”ฌ TAF Agent > **Test ANY transformer LLM before you spend GPU/$.** > Free. Unlimited. Auditable. Runs entirely in your browser. --- ## What it does Predicts practical viability of any transformer LLM from its config alone: - Will Llama-3-8B serve 32K context with NIAH retrieval? - Should I train a custom 7B model or use GPT-4o for 50M tokens/month? - Cheapest GPU to serve Llama-70B at 100M tokens/day? - Which KV compression strategy fits my model's ฮณ profile? 5 cross-section recipes, 5 modes, 4 languages (EN/ES/FR/ZH), 100% in-browser. ## Why it's different - **Truly free**: no server, no auth, no rate limits. Compute runs in YOUR browser. - **Auditable**: every number is deterministic Python (TAF formulas, see paper). No hallucination โ€” the LLM only synthesises, doesn't invent values. - **Falsifiable**: 23 paper predictions tracked publicly with verification status. - **Community-first**: submit your analyses to a public registry; debate them. ## Architecture coverage โœ“ RoPE-MHA ยท โœ“ RoPE-GQA ยท โœ“ ALiBi ยท โœ“ AbsPE ยท โœ“ SWA ยท โœ“ SSM ยท โœ“ Any HuggingFace public model ## Modes - **๐Ÿ“‡ Profile**: paste model id โ†’ all 5 recipes scored at once = TAF Card - **๐Ÿ†š Compare**: 2-3 models side-by-side on same recipe - **๐Ÿ” Inspector**: paste raw config.json (private/in-development models) - **๐Ÿ’ฌ Ask**: free-form question, in-browser LLM picks the recipe - **๐Ÿ“‹ Recipe**: manual selection with full form control ## Underlying paper [Marin 2026 โ€” Predicting How Transformers Attend](https://zenodo.org/records/19826343) ## Source [github.com/karlesmarin/tafagent](https://github.com/karlesmarin/tafagent) ## Public registry [tafagent-registry](https://github.com/karlesmarin/tafagent-registry) โ€” community-submitted analyses ## Citation ```bibtex @misc{marin2026tafagent, author = {Marin, Carles}, title = {{TAF Agent}: Browser-Based Transformer Diagnostic Tool}, year = {2026}, url = {https://huggingface.co/spaces/karlexmarin/taf-agent}, } ``` ## Acknowledgements Built by an independent researcher with the help of LLMs as research instruments. Not affiliated with any model vendor. The tool would not exist without the open-weights commons (Meta, Mistral, Qwen, EleutherAI, AI2, BigScience, TII, DeepSeek, Microsoft, Google DeepMind, Anthropic), the Pyodide + WebLLM projects, and HuggingFace for hosting models, datasets, and now this Space.