taf-agent / hf-space-readme.md
karlexmarin's picture
docs: rename paper "Transformer Thermodynamics" → "Predicting How Transformers Attend"
1a6c909
|
raw
history blame
2.82 kB
---
title: TAF Agent
emoji: 🔬
colorFrom: blue
colorTo: green
sdk: static
pinned: true
license: apache-2.0
short_description: Test ANY transformer LLM before you spend GPU/$. Free. Auditable.
tags:
- transformer
- llm
- diagnostic
- rope
- kv-cache
- long-context
- viability
- thermodynamics
- free
- browser
- webgpu
language:
- en
- es
- fr
- zh
---
# 🔬 TAF Agent
> **Test ANY transformer LLM before you spend GPU/$.**
> Free. Unlimited. Auditable. Runs entirely in your browser.
---
## What it does
Predicts practical viability of any transformer LLM from its config alone:
- Will Llama-3-8B serve 32K context with NIAH retrieval?
- Should I train a custom 7B model or use GPT-4o for 50M tokens/month?
- Cheapest GPU to serve Llama-70B at 100M tokens/day?
- Which KV compression strategy fits my model's γ profile?
5 cross-section recipes, 5 modes, 4 languages (EN/ES/FR/ZH), 100% in-browser.
## Why it's different
- **Truly free**: no server, no auth, no rate limits. Compute runs in YOUR browser.
- **Auditable**: every number is deterministic Python (TAF formulas, see paper).
No hallucination — the LLM only synthesises, doesn't invent values.
- **Falsifiable**: 23 paper predictions tracked publicly with verification status.
- **Community-first**: submit your analyses to a public registry; debate them.
## Architecture coverage
✓ RoPE-MHA · ✓ RoPE-GQA · ✓ ALiBi · ✓ AbsPE · ✓ SWA · ✓ SSM · ✓ Any HuggingFace public model
## Modes
- **📇 Profile**: paste model id → all 5 recipes scored at once = TAF Card
- **🆚 Compare**: 2-3 models side-by-side on same recipe
- **🔍 Inspector**: paste raw config.json (private/in-development models)
- **💬 Ask**: free-form question, in-browser LLM picks the recipe
- **📋 Recipe**: manual selection with full form control
## Underlying paper
[Marin 2026 — Predicting How Transformers Attend](https://zenodo.org/records/19826343)
## Source
[github.com/karlesmarin/tafagent](https://github.com/karlesmarin/tafagent)
## Public registry
[tafagent-registry](https://github.com/karlesmarin/tafagent-registry) — community-submitted analyses
## Citation
```bibtex
@misc{marin2026tafagent,
author = {Marin, Carles},
title = {{TAF Agent}: Browser-Based Transformer Diagnostic Tool},
year = {2026},
url = {https://huggingface.co/spaces/karlexmarin/taf-agent},
}
```
## Acknowledgements
Built by an independent researcher with the help of LLMs as research instruments.
Not affiliated with any model vendor.
The tool would not exist without the open-weights commons (Meta, Mistral, Qwen,
EleutherAI, AI2, BigScience, TII, DeepSeek, Microsoft, Google DeepMind, Anthropic),
the Pyodide + WebLLM projects, and HuggingFace for hosting models, datasets,
and now this Space.