taf-agent / hf-space-readme.md
karlexmarin's picture
docs: rename paper "Transformer Thermodynamics" → "Predicting How Transformers Attend"
1a6c909
|
raw
history blame
2.82 kB
metadata
title: TAF Agent
emoji: 🔬
colorFrom: blue
colorTo: green
sdk: static
pinned: true
license: apache-2.0
short_description: Test ANY transformer LLM before you spend GPU/$. Free. Auditable.
tags:
  - transformer
  - llm
  - diagnostic
  - rope
  - kv-cache
  - long-context
  - viability
  - thermodynamics
  - free
  - browser
  - webgpu
language:
  - en
  - es
  - fr
  - zh

🔬 TAF Agent

Test ANY transformer LLM before you spend GPU/$. Free. Unlimited. Auditable. Runs entirely in your browser.


What it does

Predicts practical viability of any transformer LLM from its config alone:

  • Will Llama-3-8B serve 32K context with NIAH retrieval?
  • Should I train a custom 7B model or use GPT-4o for 50M tokens/month?
  • Cheapest GPU to serve Llama-70B at 100M tokens/day?
  • Which KV compression strategy fits my model's γ profile?

5 cross-section recipes, 5 modes, 4 languages (EN/ES/FR/ZH), 100% in-browser.

Why it's different

  • Truly free: no server, no auth, no rate limits. Compute runs in YOUR browser.
  • Auditable: every number is deterministic Python (TAF formulas, see paper). No hallucination — the LLM only synthesises, doesn't invent values.
  • Falsifiable: 23 paper predictions tracked publicly with verification status.
  • Community-first: submit your analyses to a public registry; debate them.

Architecture coverage

✓ RoPE-MHA · ✓ RoPE-GQA · ✓ ALiBi · ✓ AbsPE · ✓ SWA · ✓ SSM · ✓ Any HuggingFace public model

Modes

  • 📇 Profile: paste model id → all 5 recipes scored at once = TAF Card
  • 🆚 Compare: 2-3 models side-by-side on same recipe
  • 🔍 Inspector: paste raw config.json (private/in-development models)
  • 💬 Ask: free-form question, in-browser LLM picks the recipe
  • 📋 Recipe: manual selection with full form control

Underlying paper

Marin 2026 — Predicting How Transformers Attend

Source

github.com/karlesmarin/tafagent

Public registry

tafagent-registry — community-submitted analyses

Citation

@misc{marin2026tafagent,
  author = {Marin, Carles},
  title  = {{TAF Agent}: Browser-Based Transformer Diagnostic Tool},
  year   = {2026},
  url    = {https://huggingface.co/spaces/karlexmarin/taf-agent},
}

Acknowledgements

Built by an independent researcher with the help of LLMs as research instruments. Not affiliated with any model vendor.

The tool would not exist without the open-weights commons (Meta, Mistral, Qwen, EleutherAI, AI2, BigScience, TII, DeepSeek, Microsoft, Google DeepMind, Anthropic), the Pyodide + WebLLM projects, and HuggingFace for hosting models, datasets, and now this Space.