Spaces:

build-small-hackathon
/

duel-of-albion

Sleeping

App Files Files Community

Sathvik0101 commited on 12 days ago

Commit

8ee4863

verified ·

1 Parent(s): 460851a

Deploy exact mirror from Sathvik0101/cyberpunk-duel-ai (Gemma 3 4B)

Browse files

Files changed (10) hide show

3d_scene.html +0 -0
Dockerfile +4 -0
README.md +9 -78
app.py +392 -696
static/assets/index-DUDqKLMt.css +1 -0
static/assets/index-Dp4pBtge.js +0 -0
static/favicon.svg +1 -0
static/icons.svg +24 -0
static/index.html +30 -0
three.min.js +0 -0

3d_scene.html ADDED Viewed

The diff for this file is too large to render. See raw diff

Dockerfile CHANGED Viewed

@@ -2,10 +2,14 @@ FROM pytorch/pytorch:2.4.0-cuda12.1-cudnn9-runtime
 WORKDIR /app
 COPY requirements.txt .
 RUN pip install --no-cache-dir -r requirements.txt --extra-index-url https://download.pytorch.org/whl/cu121
 COPY app.py .
 ENV PORT=7860
 EXPOSE 7860

 WORKDIR /app
+# Install dependencies
 COPY requirements.txt .
 RUN pip install --no-cache-dir -r requirements.txt --extra-index-url https://download.pytorch.org/whl/cu121
+# Copy app files
 COPY app.py .
+COPY 3d_scene.html .
+COPY three.min.js .
 ENV PORT=7860
 EXPOSE 7860

README.md CHANGED Viewed

@@ -1,85 +1,16 @@
 ---
-title: Cyber Duel Tiny
-emoji: ⚡
-colorFrom: yellow
-colorTo: red
 sdk: docker
-hardware: a10g-small
-app_file: app.py
 pinned: false
 ---
-# Cyber Duel Tiny
-A 270M-parameter combat advisor that replaces the Gemma 3 4B base model
-in [Duel of Albion](https://huggingface.co/spaces/Sathvik0101/cyberpunk-duel-ai).
-## What it does
-Given the player's last 5 moves, the model recommends a counter-move from
-the 9 legal options (jab, cross, low_kick, roundhouse, uppercut, parry,
-backstep, clinch, throw), conditioned on both fighters' stats
-(speed, power, range, weight, stance), stamina, distance, and round.
-## API
-`POST /predict` — counter-move recommendation.
-```http
-POST /predict
-Content-Type: application/json
-{
-  "sequence":      "jab,cross,low_kick,jab,cross",
-  "player":        {"name": "monk", "speed": 5, "power": 2, "range": 3, "weight": 0.8, "stance": "low", "stamina": 100, "hp": 100},
-  "npc":           {"name": "brute","speed": 1, "power": 5, "range": 2, "weight": 1.4, "stance": "hunched", "stamina": 100, "hp": 100},
-  "round":         3,
-  "distance":      "close",
-  "playerId":      "ab12-...",          // optional, enables online RL
-  "playerPrevMove": "jab"                // optional, back-fills the previous log row
-}
-```
-```json
-{
-  "reasoning":    "The player is alternating jab and cross before finishing low...",
-  "counterMove":  "throw",
-  "sequence":     "jab,cross,low_kick,jab,cross",
-  "adapterScope": "user"                  // "user" once you have a personalised adapter
-}
-```
-`GET  /health`   — `{ready, has_token, online_rl_enabled, user_adapters_cached, buffered_users}`
-`GET  /me?playerId=...`   — `{rounds_logged, next_retrain_in, cooldown_left_sec, adapter_scope, online_rl_enabled}`
-`POST /forget`  body `{"playerId": "..."}` — deletes the user's adapter + log (privacy / GDPR).
-### Online RL
-If the request includes a `playerId` and the Space was started with the
-`MODAL_WEBHOOK_URL` and `MODAL_WEBHOOK_SECRET` env vars set, the Space
-will:
-1. Log `(state, model_move, player_next_move)` to the
-   `cyber-duel-tiny-logs` Hugging Face dataset (private).
-2. After 25 fresh rows have been flushed, POST to the Modal webhook to
-   trigger a per-user DPO retrain.
-3. The new LoRA delta is uploaded to `cyber-duel-tiny-users/<uid>/` and
-   loaded on the next `/predict` for that player.
-The default base adapter (`Sathvik0101/cyber-duel-tiny-adapter`) is used
-as the starting point for every per-user delta, and the global base
-gets refreshed weekly by Modal's `retrain_global_base` job.
-## Training
-Trained on Modal with LoRA + DPO (verifiable rewards from the in-game
-combat resolver). See `modal/app.py` in the [training repo](https://huggingface.co/Sathvik0101/cyber-duel-tiny-adapter).
-## How to redeploy
-1. Add `HF_TOKEN` as a Space Secret so the gated `gemma-3-270m-it` weights can be downloaded.
-2. (Optional) Add `MODAL_WEBHOOK_URL` and `MODAL_WEBHOOK_SECRET` to enable
-   online per-user RL.
-3. Update `ADAPTER_MODEL` env var to point to the latest adapter release.
-4. The Space will hot-reload on push (you may need a manual restart to
-   pick up the new code if the env vars change).

 ---
+title: Duel of Albion
+emoji: ⚔️
+colorFrom: purple
+colorTo: blue
 sdk: docker
 pinned: false
 ---
+# Duel of Albion
+A 3D AI-powered cyberpunk fighting game built with FastAPI.
+The AI opponent uses a fine-tuned Gemma 3 4B model to counter your moves in real time.
+## Setup
+Add your HF_TOKEN as a Secret in Space settings so the gated base model (google/gemma-3-4b-it) can be downloaded. The fine-tuned adapter is public at Sathvik0101/gemma-3-combat-npc-adapter.

app.py CHANGED Viewed

@@ -1,696 +1,392 @@
-"""Cyber Duel Tiny -- FastAPI service for the HF Space.
-Drop-in replacement for the 4B Gemma advisor, with optional per-user
-online RL: each /predict can carry a `playerId` (UUID minted by the
-client), and the Space will lazily load that user's LoRA delta adapter
-from the `cyber-duel-tiny-users` Hub repo, log the (state, model_move,
-player_next_move) triple to the `cyber-duel-tiny-logs` dataset repo, and
-trigger a Modal retrain job when the user has accumulated enough new
-clean pairs.  Clients without `playerId` keep using the frozen global
-adapter exactly as before.
-API
----
-POST /predict
-  Legacy form (still supported, no RL):
-    {"sequence": "jab,cross,low_kick,roundhouse,uppercut"}
-  Full state form (recommended):
-    {
-      "sequence":  "jab,cross,low_kick,roundhouse,uppercut",
-      "player": {...}, "npc": {...},
-      "round": 3, "distance": "close",
-      "playerId":      "ab12-...",        # optional
-      "playerPrevMove": "jab",            # optional, required for online RL
-    }
-GET  /health                          # {ready, has_token, online_rl_enabled}
-GET  /me?playerId=...                 # {rounds_logged, retrains_done, ...}
-POST /forget     {"playerId": "..."}  # delete user's adapter + log
-"""
-import hashlib
-import hmac
-import json
-import logging
-import os
-import re
-import secrets
-import threading
-import time
-from collections import OrderedDict, defaultdict
-from pathlib import Path
-from typing import Any, Dict, List, Optional, Tuple
-import torch
-from transformers import AutoTokenizer, AutoModelForCausalLM, AutoConfig
-from peft import PeftModel
-from huggingface_hub import (
-    HfApi, snapshot_download, hf_hub_download, create_repo,
-)
-from fastapi import FastAPI, Request
-from fastapi.middleware.cors import CORSMiddleware
-from fastapi.responses import JSONResponse
-import gradio as gr
-logging.basicConfig(level=os.environ.get("LOG_LEVEL", "INFO"),
-                    format="%(asctime)s | %(levelname)s | %(message)s")
-log = logging.getLogger("cyber-duel-tiny")
-BASE_MODEL = os.environ.get("BASE_MODEL", "google/gemma-3-270m-it")
-ADAPTER_MODEL = os.environ.get("ADAPTER_MODEL", "Sathvik0101/cyber-duel-tiny-adapter")
-USERS_REPO = os.environ.get("USERS_REPO", "Sathvik0101/cyber-duel-tiny-users")
-LOGS_REPO = os.environ.get("LOGS_REPO", "Sathvik0101/cyber-duel-tiny-logs")
-MODAL_WEBHOOK_URL = os.environ.get("MODAL_WEBHOOK_URL", "").strip()
-MODAL_WEBHOOK_SECRET = os.environ.get("MODAL_WEBHOOK_SECRET", "").strip()
-SKIP_MODEL_LOAD = os.environ.get("SKIP_MODEL_LOAD", "0") == "1"
-ONLINE_RL_ENABLED = MODAL_WEBHOOK_URL != "" and MODAL_WEBHOOK_SECRET != ""
-# Online-RL tunables
-RETRAIN_THRESHOLD = int(os.environ.get("RETRAIN_THRESHOLD", "25"))
-LOG_BUFFER_FLUSH_SEC = int(os.environ.get("LOG_BUFFER_FLUSH_SEC", "15"))
-LOG_BUFFER_FLUSH_ROWS = int(os.environ.get("LOG_BUFFER_FLUSH_ROWS", "10"))
-USER_ADAPTER_CACHE_SIZE = int(os.environ.get("USER_ADAPTER_CACHE_SIZE", "32"))
-RETRAIN_COOLDOWN_SEC = int(os.environ.get("RETRAIN_COOLDOWN_SEC", "600"))  # 10 min
-# Track retrain-request timestamps per uid in RAM to avoid spamming Modal
-RETRAIN_INFLIGHT: Dict[str, float] = {}
-# Per-uid last flush time (RAM-side; not a Space secret)
-LAST_FLUSH_AT: Dict[str, float] = {}
-# ---- Global model state ---------------------------------------------------
-HAS_MODEL = False
-base_model = None  # the underlying base, shared by all PEFT deltas
-global_adapter = None  # PEFT model wrapping base_model with the global DPO adapter
-tokenizer = None
-# ---- Per-user state -------------------------------------------------------
-# LRU cache of PeftModel objects, keyed by playerId
-USER_ADAPTER_CACHE: "OrderedDict[str, PeftModel]" = OrderedDict()
-# Per-uid pending log rows (not yet flushed to Hub)
-LOG_BUFFER: Dict[str, List[Dict[str, Any]]] = defaultdict(list)
-# Last row for this uid (so the *next* /predict can fill in player_next_move)
-LAST_ROW: Dict[str, Dict[str, Any]] = {}
-# uid -> total flushed rows (for /me + retrain threshold)
-FLUSHED_COUNT: Dict[str, int] = defaultdict(int)
-# uid -> last time we POSTed /retrain to Modal (RAM-side cooldown)
-LAST_RETRAIN_REQUEST: Dict[str, float] = {}
-# Single lock so concurrent /predict calls don't double-flush
-state_lock = threading.Lock()
-LEGAL_MOVES = ("jab", "cross", "low_kick", "roundhouse", "uppercut",
-               "parry", "backstep", "clinch", "throw")
-# ---- Prompt schema (mirrors train/common.py + generate_data_v2.py) -------
-DEFAULT_PLAYER = {"name": "fighter", "speed": 3, "power": 3, "range": 3,
-                  "weight": 1.0, "stance": "neutral", "stamina": 100, "hp": 100}
-DEFAULT_NPC = {"name": "fighter", "speed": 3, "power": 3, "range": 3,
-               "weight": 1.0, "stance": "neutral", "stamina": 100, "hp": 100}
-SYSTEM_PROMPT = (
-    "You are an expert NPC AI for Duel of Albion, a 3D fighting game.\n"
-    "Read the round, distance, both fighters' stats and stances, and the "
-    "player's last 5 moves. Choose the single best counter-move from the 9 "
-    "legal moves. Always end your reply with `counter_move: <move>` on its "
-    "own line."
-)
-def get_hf_token() -> Optional[str]:
-    tok = os.environ.get("HF_TOKEN") or os.environ.get("HUGGINGFACE_TOKEN")
-    if tok:
-        return tok
-    cache = Path.home() / ".cache" / "huggingface" / "token"
-    if cache.exists():
-        return cache.read_text(encoding="utf-8").strip() or None
-    return None
-def load_global_model():
-    """Load the base + global DPO adapter once, share base_model across
-    all per-user PEFT deltas."""
-    global HAS_MODEL, base_model, global_adapter, tokenizer
-    if SKIP_MODEL_LOAD:
-        log.info("SKIP_MODEL_LOAD=1 -- model is not loaded")
-        return
-    try:
-        hf_token = get_hf_token()
-        log.info(f"Loading base {BASE_MODEL} + global adapter {ADAPTER_MODEL}...")
-        tokenizer = AutoTokenizer.from_pretrained(BASE_MODEL, token=hf_token)
-        if tokenizer.pad_token is None:
-            tokenizer.pad_token = tokenizer.eos_token
-        tokenizer.padding_side = "left"
-        device_arg = "auto" if torch.cuda.is_available() else None
-        dtype = torch.bfloat16 if torch.cuda.is_available() else torch.float32
-        config = AutoConfig.from_pretrained(BASE_MODEL, token=hf_token)
-        if hasattr(config, "vision_config") and config.vision_config is not None:
-            config.vision_config = None
-        base_model = AutoModelForCausalLM.from_pretrained(
-            BASE_MODEL, config=config, token=hf_token,
-            torch_dtype=dtype, device_map=device_arg,
-        )
-        adapter_path = snapshot_download(repo_id=ADAPTER_MODEL, token=hf_token)
-        global_adapter = PeftModel.from_pretrained(base_model, adapter_path)
-        global_adapter.eval()
-        # Warmup
-        warmup_state = {
-            "player": DEFAULT_PLAYER, "npc": DEFAULT_NPC,
-            "round": 1, "distance": "close",
-            "sequence": "jab,cross,low_kick,roundhouse,uppercut",
-        }
-        warmup_prompt = build_prompt(warmup_state)
-        warmup_inputs = tokenizer(warmup_prompt, return_tensors="pt").to(base_model.device)
-        with torch.no_grad():
-            _ = global_adapter.generate(
-                **warmup_inputs, max_new_tokens=20, do_sample=False,
-                pad_token_id=tokenizer.eos_token_id,
-            )
-        HAS_MODEL = True
-        log.info("Global model loaded and warmed up on %s", base_model.device)
-    except Exception as e:
-        log.exception("Global model load failed: %s", e)
-        base_model = None
-        global_adapter = None
-        tokenizer = None
-        HAS_MODEL = False
-# ---- Prompt + parser ------------------------------------------------------
-def _format_fighter(f: dict) -> str:
-    return (
-        f"{f.get('name', 'fighter')}"
-        f" (speed={f.get('speed', 3)}, power={f.get('power', 3)}, "
-        f"range={f.get('range', 3)}, weight={f.get('weight', 1.0)}, "
-        f"stance={f.get('stance', 'neutral')}, "
-        f"stamina={f.get('stamina', 100)}, hp={f.get('hp', 100)})"
-    )
-def build_prompt(state: dict) -> str:
-    player = state.get("player") or DEFAULT_PLAYER
-    npc = state.get("npc") or DEFAULT_NPC
-    round_ = state.get("round", 1)
-    dist = state.get("distance", "close")
-    sequence = state.get("sequence", "jab,cross,low_kick,roundhouse,uppercut")
-    user_msg = (
-        f"Round {round_} | Distance: {dist}\n"
-        f"Player: {_format_fighter(player)}\n"
-        f"NPC   : {_format_fighter(npc)}\n"
-        f"Player last 5 moves: {sequence}\n"
-        f"Decide the best counter-move from: "
-        f"{', '.join(LEGAL_MOVES)}."
-    )
-    return (
-        f"<start_of_turn>user\n{SYSTEM_PROMPT}\n\n{user_msg}<end_of_turn>\n"
-        f"<start_of_turn>model\n"
-    )
-def parse_counter(text: str) -> str:
-    text_low = text.lower()
-    if "counter_move:" in text_low:
-        tail = text_low.split("counter_move:", 1)[1].strip()
-        first = tail.split()[0].strip(".,!?;:'\"")
-        first = first.rstrip(",.;:?!")
-        if first in LEGAL_MOVES:
-            return first
-    for m in LEGAL_MOVES:
-        if m in text_low:
-            return m
-    return "jab"
-# ---- Inference ------------------------------------------------------------
-def _generate_with_model(model, state: dict) -> Tuple[str, str]:
-    """Returns (full_text, counter_move)."""
-    prompt = build_prompt(state)
-    inputs = tokenizer(prompt, return_tensors="pt").to(base_model.device)
-    with torch.no_grad():
-        out = model.generate(
-            **inputs, max_new_tokens=200, do_sample=False,
-            pad_token_id=tokenizer.eos_token_id,
-        )
-    text = tokenizer.decode(
-        out[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True,
-    )
-    return text, parse_counter(text)
-def get_model_for(uid: Optional[str]):
-    """Return the PeftModel to use for this request.
-    LRU-cache per-user deltas; lazy-download from Hub on first request for
-    a new uid.  Falls back to the global adapter if the user has none yet.
-    """
-    if not uid or not ONLINE_RL_ENABLED:
-        return global_adapter, "global"
-    with state_lock:
-        if uid in USER_ADAPTER_CACHE:
-            USER_ADAPTER_CACHE.move_to_end(uid)
-            return USER_ADAPTER_CACHE[uid], "user"
-    # Lazy download outside the lock.  We use snapshot_download to grab
-    # the entire <uid>/ folder (adapter weights + tokenizer files).
-    try:
-        adapter_dir = snapshot_download(
-            repo_id=USERS_REPO, allow_patterns=[f"{uid}/*"],
-            token=get_hf_token(),
-        )
-        # snapshot_download returns the local root that contains
-        # `<uid>/adapter_model.safetensors`.  PEFT's from_pretrained
-        # expects the folder containing adapter_config.json.
-        per_user_dir = str(Path(adapter_dir) / uid)
-        if not (Path(per_user_dir) / "adapter_config.json").exists():
-            raise FileNotFoundError(f"adapter_config.json not in {per_user_dir}")
-        delta = PeftModel.from_pretrained(base_model, per_user_dir)
-        delta.eval()
-        with state_lock:
-            USER_ADAPTER_CACHE[uid] = delta
-            USER_ADAPTER_CACHE.move_to_end(uid)
-            while len(USER_ADAPTER_CACHE) > USER_ADAPTER_CACHE_SIZE:
-                USER_ADAPTER_CACHE.popitem(last=False)
-        return delta, "user"
-    except Exception as e:
-        log.info("No per-user adapter for %s (%s) -- using global", uid, e)
-        return global_adapter, "global"
-def evict_user_cache(uid: str):
-    with state_lock:
-        USER_ADAPTER_CACHE.pop(uid, None)
-# ---- Online-RL logging ----------------------------------------------------
-def _state_to_log(state: dict, model_move: str) -> Dict[str, Any]:
-    """Build a log row from the /predict state and the model's response."""
-    player = state.get("player") or DEFAULT_PLAYER
-    npc = state.get("npc") or DEFAULT_NPC
-    last5 = (state.get("sequence", "").split(",") + ["jab"] * 5)[:5]
-    dist = state.get("distance", "close")
-    distance_m = {"close": 1.5, "mid": 3.0, "far": 4.5}.get(dist, 3.0)
-    return {
-        "ts": int(time.time()),
-        "uid": state.get("playerId", ""),
-        "state": {
-            "player_char_id": player.get("name", "fighter"),
-            "npc_char_id": npc.get("name", "fighter"),
-            "player_speed": int(player.get("speed", 3)),
-            "player_power": int(player.get("power", 3)),
-            "player_range": int(player.get("range", 3)),
-            "player_weight": float(player.get("weight", 1.0)),
-            "player_stance": player.get("stance", "neutral"),
-            "npc_speed": int(npc.get("speed", 3)),
-            "npc_power": int(npc.get("power", 3)),
-            "npc_range": int(npc.get("range", 3)),
-            "npc_weight": float(npc.get("weight", 1.0)),
-            "npc_stance": npc.get("stance", "neutral"),
-            "distance_bucket": dist,
-            "distance": distance_m,
-            "player_stamina": int(player.get("stamina", 100)),
-            "npc_stamina": int(npc.get("stamina", 100)),
-            "round": int(state.get("round", 1)),
-            "last5": last5,
-        },
-        "model_move": model_move,
-        "model_adapter_scope": "user" if state.get("playerId") else "global",
-        "player_next_move": None,  # filled in by next /predict or on flush
-    }
-def _flush_user_log(uid: str) -> int:
-    """Atomically upload the buffered log rows for `uid` to the logs repo."""
-    with state_lock:
-        rows = LOG_BUFFER.pop(uid, [])
-    if not rows:
-        return 0
-    n = len(rows)
-    try:
-        api = HfApi()
-        # Ensure repo exists
-        create_repo(LOGS_REPO, repo_type="dataset", private=True, exist_ok=True)
-        # Download existing, append, re-upload (simple & correct)
-        existing_lines: List[str] = []
-        try:
-            existing = hf_hub_download(
-                repo_id=LOGS_REPO, repo_type="dataset",
-                filename=f"users/{uid}.jsonl", token=get_hf_token(),
-            )
-            with open(existing, "r", encoding="utf-8") as f:
-                existing_lines = f.readlines()
-        except Exception:
-            pass
-        new_lines = [json.dumps(r, ensure_ascii=False) + "\n" for r in rows]
-        with state_lock:
-            FLUSHED_COUNT[uid] += n
-        api.upload_file(
-            path_or_fileobj="".join(existing_lines + new_lines).encode("utf-8"),
-            path_in_repo=f"users/{uid}.jsonl",
-            repo_id=LOGS_REPO, repo_type="dataset",
-            commit_message=f"Append {n} rows for {uid}",
-            token=get_hf_token(),
-        )
-        log.info("Flushed %d rows for %s (total %d)",
-                 n, uid, FLUSHED_COUNT[uid])
-    except Exception as e:
-        log.warning("Log flush failed for %s: %s", uid, e)
-        # Put them back so we don't lose data
-        with state_lock:
-            LOG_BUFFER[uid] = rows + LOG_BUFFER.get(uid, [])
-    return n
-def _post_webhook(path: str, payload: Dict[str, Any]) -> bool:
-    if not (MODAL_WEBHOOK_URL and MODAL_WEBHOOK_SECRET):
-        return False
-    body = json.dumps(payload, separators=(",", ":")).encode("utf-8")
-    sig = hmac.new(MODAL_WEBHOOK_SECRET.encode(), body, hashlib.sha256).hexdigest()
-    try:
-        import urllib.request
-        req = urllib.request.Request(
-            MODAL_WEBHOOK_URL.rstrip("/") + path,
-            data=body, method="POST",
-            headers={
-                "Content-Type": "application/json",
-                "X-Signature": sig,
-                "X-Timestamp": str(int(time.time())),
-            },
-        )
-        with urllib.request.urlopen(req, timeout=5) as resp:
-            log.info("Webhook %s -> %s", path, resp.status)
-            return 200 <= resp.status < 300
-    except Exception as e:
-        log.warning("Webhook %s failed: %s", path, e)
-        return False
-def _maybe_trigger_retrain(uid: str):
-    if not ONLINE_RL_ENABLED:
-        return
-    now = time.time()
-    with state_lock:
-        last_ts = LAST_RETRAIN_REQUEST.get(uid, 0)
-        flushed = FLUSHED_COUNT.get(uid, 0)
-    if now - last_ts < RETRAIN_COOLDOWN_SEC:
-        return
-    if flushed < RETRAIN_THRESHOLD:
-        return
-    with state_lock:
-        LAST_RETRAIN_REQUEST[uid] = now
-    log.info("Triggering retrain for %s (flushed=%d)", uid, flushed)
-    _post_webhook("/retrain", {"uid": uid})
-# ---- FastAPI app ----------------------------------------------------------
-app = FastAPI(title="cyber-duel-tiny")
-app.add_middleware(
-    CORSMiddleware, allow_origins=["*"], allow_methods=["*"], allow_headers=["*"],
-)
-@app.on_event("startup")
-def _startup():
-    load_global_model()
-    if ONLINE_RL_ENABLED:
-        log.info("Online RL enabled (webhook=%s, logs_repo=%s, users_repo=%s)",
-                 MODAL_WEBHOOK_URL, LOGS_REPO, USERS_REPO)
-    else:
-        log.info("Online RL disabled (no MODAL_WEBHOOK_URL/SECRET set)")
-@app.get("/health")
-def health():
-    return {
-        "ready": HAS_MODEL,
-        "has_token": get_hf_token() is not None,
-        "online_rl_enabled": ONLINE_RL_ENABLED,
-        "user_adapters_cached": len(USER_ADAPTER_CACHE),
-        "buffered_users": len(LOG_BUFFER),
-    }
-@app.get("/me")
-def me(playerId: str = ""):
-    if not ONLINE_RL_ENABLED:
-        return {"online_rl_enabled": False}
-    if not playerId:
-        return {"error": "missing playerId"}
-    with state_lock:
-        flushed = FLUSHED_COUNT.get(playerId, 0)
-        last_ts = LAST_RETRAIN_REQUEST.get(playerId, 0)
-        adapter_scope = "user" if playerId in USER_ADAPTER_CACHE else "global"
-    next_at = max(0, RETRAIN_THRESHOLD - flushed)
-    cooldown_left = max(0, int(RETRAIN_COOLDOWN_SEC - (time.time() - last_ts)))
-    return {
-        "playerId": playerId,
-        "rounds_logged": flushed,
-        "next_retrain_in": next_at,
-        "cooldown_left_sec": cooldown_left if last_ts else 0,
-        "adapter_scope": adapter_scope,
-        "online_rl_enabled": True,
-    }
-@app.post("/forget")
-async def forget(request: Request):
-    if not ONLINE_RL_ENABLED:
-        return JSONResponse(content={"ok": False, "reason": "online_rl_disabled"},
-                            status_code=400)
-    try:
-        data = await request.json()
-    except Exception:
-        data = {}
-    uid = (data or {}).get("playerId", "")
-    if not uid or len(uid) < 4 or len(uid) > 64:
-        return JSONResponse(content={"ok": False, "reason": "bad playerId"},
-                            status_code=400)
-    evict_user_cache(uid)
-    with state_lock:
-        LOG_BUFFER.pop(uid, None)
-        LAST_ROW.pop(uid, None)
-        FLUSHED_COUNT.pop(uid, None)
-        LAST_RETRAIN_REQUEST.pop(uid, None)
-        LAST_FLUSH_AT.pop(uid, None)
-    _post_webhook("/forget", {"uid": uid})
-    return {"ok": True, "uid": uid}
-@app.post("/predict")
-async def predict(request: Request):
-    sequence = ""
-    try:
-        try:
-            data = await request.json()
-        except Exception:
-            data = {}
-        sequence = (data or {}).get("sequence", "")
-        state = {
-            "sequence":  sequence,
-            "player":    (data or {}).get("player"),
-            "npc":       (data or {}).get("npc"),
-            "round":     (data or {}).get("round", 1),
-            "distance":  (data or {}).get("distance", "close"),
-            "playerId":  (data or {}).get("playerId", "") or "",
-        }
-        player_prev_move = (data or {}).get("playerPrevMove", "") or ""
-        if not HAS_MODEL:
-            return JSONResponse(
-                content={"reasoning": "(model not loaded)", "counterMove": "jab", "sequence": sequence},
-                status_code=503,
-            )
-        model, scope = get_model_for(state["playerId"] or None)
-        text, counter_move = _generate_with_model(model, state)
-        reasoning = text.split("counter_move:")[0].strip() if "counter_move:" in text else text.strip()
-        # ---- Online RL bookkeeping (only if a playerId was sent) ----
-        if ONLINE_RL_ENABLED and state["playerId"]:
-            uid = state["playerId"]
-            with state_lock:
-                # Backfill the previous row's player_next_move
-                if uid in LAST_ROW and player_prev_move in LEGAL_MOVES:
-                    LAST_ROW[uid]["player_next_move"] = player_prev_move
-                # Save THIS row for the next call to backfill
-                LAST_ROW[uid] = _state_to_log(state, counter_move)
-                # Add a placeholder row carrying the player's own next move (will be
-                # overwritten when the next /predict arrives) so we still log even
-                # if the user never comes back.
-                LOG_BUFFER[uid].append(LAST_ROW[uid])
-                buf_size = len(LOG_BUFFER[uid])
-                flushed = FLUSHED_COUNT[uid]
-            # Flush if buffer is large or stale
-            now = time.time()
-            with state_lock:
-                last_flush = LAST_FLUSH_AT.get(uid, 0)
-            if buf_size >= LOG_BUFFER_FLUSH_ROWS or (
-                buf_size > 0 and now - last_flush > LOG_BUFFER_FLUSH_SEC
-            ):
-                with state_lock:
-                    LAST_FLUSH_AT[uid] = now
-                # Flush in a background thread so /predict isn't blocked
-                threading.Thread(target=_flush_user_log,
-                                 args=(uid,), daemon=True).start()
-            _maybe_trigger_retrain(uid)
-        return JSONResponse(content={
-            "reasoning": reasoning,
-            "counterMove": counter_move,
-            "sequence": sequence,
-            "adapterScope": scope,
-        })
-    except Exception as e:
-        log.exception("predict failed: %s", e)
-        return JSONResponse(
-            content={"reasoning": f"(error: {type(e).__name__}: {e})",
-                     "counterMove": "jab", "sequence": sequence},
-            status_code=500,
-        )
-# ---- Gradio UI (mounted on top of FastAPI for the Build-with-Gradio hackathon)
-def _gradio_predict(sequence: str, round_n: float, distance: str):
-    """Gradio-friendly wrapper that reuses the exact same inference path as
-    /predict — no double HTTP hop, single model instance, single LRU cache."""
-    if not HAS_MODEL:
-        return (
-            "⚠️ **Model not loaded yet.** Hit *Counter* again in a few seconds — "
-            "the 270M Gemma + LoRA adapter is warming up on first call.",
-            "—",
-            "model-not-loaded",
-        )
-    sequence = (sequence or "").strip()
-    if not sequence:
-        return "_No sequence provided._", "—", "no-input"
-    state = {
-        "sequence":  sequence,
-        "player":    dict(DEFAULT_PLAYER),
-        "npc":       dict(DEFAULT_NPC),
-        "round":     int(round_n) if round_n else 1,
-        "distance":  distance or "close",
-        "playerId":  "",
-    }
-    try:
-        model, scope = get_model_for(None)
-        text, counter = _generate_with_model(model, state)
-        reasoning = (
-            text.split("counter_move:")[0].strip()
-            if "counter_move:" in text
-            else text.strip()
-        )
-        if not reasoning:
-            reasoning = "_(no reasoning emitted)_"
-        return reasoning, counter, scope
-    except Exception as e:
-        log.exception("gradio predict failed: %s", e)
-        return (
-            f"⚠️ Inference error: `{type(e).__name__}: {e}`",
-            "jab",
-            "error",
-        )
-def _service_status():
-    return {
-        "ready":        HAS_MODEL,
-        "base":         BASE_MODEL,
-        "adapter":      ADAPTER_MODEL,
-        "online_rl":    ONLINE_RL_ENABLED,
-        "legal_moves":  list(LEGAL_MOVES),
-    }
-with gr.Blocks(
-    title="Cyber Duel Tiny — Combat Advisor",
-    theme=gr.themes.Soft(primary_hue="purple", secondary_hue="blue"),
-    css="""
-    .counter-badge {font-size:1.6em;font-weight:800;letter-spacing:.04em;
-                   color:#fff;
-                   background:linear-gradient(135deg,#7c3aed,#2563eb);
-                   padding:14px 18px;border-radius:12px;text-align:center;
-                   text-transform:uppercase;}
-    .reasoning-box{font-family:ui-monospace,SFMono-Regular,Menlo,monospace;
-                   font-size:0.95em;}
-    .legal-chip{display:inline-block;margin:2px 4px;padding:4px 10px;
-                background:#1f2937;color:#c4b5fd;border-radius:999px;
-                font-size:0.85em;font-family:ui-monospace,monospace;}
-    """,
-) as demo:
-    gr.Markdown(
-        f"""
-        # ⚔️ Cyber Duel Tiny — Combat Advisor
-        Fine-tuned **Gemma 3 270M + LoRA** (`{ADAPTER_MODEL}`) trained on procedural
-        rollouts from the in-game combat resolver. Given the player's last 5 moves,
-        the model recommends one of the 9 legal counter-moves.
-        """
-    )
-    gr.Markdown(
-        "<sub>Legal moves: "
-        + " ".join(f'<span class="legal-chip">{m}</span>' for m in LEGAL_MOVES)
-        + "</sub>"
-    )
-    with gr.Row():
-        status_box = gr.JSON(label="Service status", scale=2)
-    with gr.Row():
-        with gr.Column(scale=1):
-            sequence_in = gr.Textbox(
-                label="Player's last 5 moves (comma-separated)",
-                value="jab,cross,low_kick,roundhouse,uppercut",
-                placeholder="e.g. jab,cross,jab,cross,jab",
-            )
-            with gr.Row():
-                round_in = gr.Slider(1, 5, value=1, step=1, label="Round")
-                distance_in = gr.Radio(
-                    ["close", "mid", "far"], value="close", label="Distance"
-                )
-            run_btn = gr.Button("Counter ⚡", variant="primary", size="lg")
-            gr.Examples(
-                examples=[
-                    ["jab,jab,jab,jab,jab", 1, "close"],
-                    ["uppercut,uppercut,uppercut,uppercut,uppercut", 3, "close"],
-                    ["low_kick,low_kick,roundhouse,roundhouse,uppercut", 2, "mid"],
-                    ["parry,parry,backstep,parry,parry", 4, "mid"],
-                    ["clinch,clinch,clinch,clinch,clinch", 5, "close"],
-                ],
-                inputs=[sequence_in, round_in, distance_in],
-                label="Try these patterns",
-            )
-        with gr.Column(scale=1):
-            move_out = gr.Textbox(
-                label="Counter move",
-                value="—",
-                interactive=False,
-                elem_classes=["counter-badge"],
-            )
-            scope_out = gr.Textbox(
-                label="Adapter scope", value="—", interactive=False
-            )
-            reasoning_out = gr.Markdown(
-                value="_Press *Counter ⚡* to see the model's reasoning._",
-                label="Reasoning",
-                elem_classes=["reasoning-box"],
-            )
-    run_btn.click(
-        _gradio_predict,
-        inputs=[sequence_in, round_in, distance_in],
-        outputs=[reasoning_out, move_out, scope_out],
-    ).then(_service_status, outputs=status_box)
-    demo.load(_service_status, outputs=status_box)
-# Mount Gradio at / on top of the existing FastAPI app.
-# All FastAPI routes (/predict, /health, /me, /forget) keep their original paths
-# — the 3D-game client in the parent project doesn't have to change anything.
-app = gr.mount_gradio_app(app, demo, path="/")
-if __name__ == "__main__":
-    import uvicorn
-    uvicorn.run(app, host="0.0.0.0", port=int(os.environ.get("PORT", 7860)))

+import os
+import sys
+import json
+import logging
+import gradio as gr
+from fastapi import FastAPI, Request
+from fastapi.responses import JSONResponse, HTMLResponse, FileResponse, RedirectResponse
+from fastapi.staticfiles import StaticFiles
+from fastapi.middleware.cors import CORSMiddleware
+# ------------------------------------------------------------------
+# ML stack (optional â€” UI works fine in mock mode without it)
+# ------------------------------------------------------------------
+HAS_ML = False
+try:
+    import torch  # noqa: F401
+    from transformers import AutoTokenizer, AutoModelForCausalLM, AutoConfig  # noqa: F401
+    from peft import PeftModel  # noqa: F401
+    HAS_ML = True
+except Exception:
+    HAS_ML = False
+BASE_MODEL = "google/gemma-3-4b-it"
+ADAPTER_MODEL = "Sathvik0101/gemma-3-combat-npc-adapter"
+# ------------------------------------------------------------------
+# Configuration
+# ------------------------------------------------------------------
+HOST = os.environ.get("HOST", "0.0.0.0")
+PORT = int(os.environ.get("PORT", "7860"))
+SKIP_MODEL_LOAD = os.environ.get("SKIP_MODEL_LOAD", "0") == "1"
+logging.basicConfig(
+    level=os.environ.get("LOG_LEVEL", "INFO"),
+    format="%(asctime)s | %(levelname)s | %(message)s",
+)
+log = logging.getLogger("duel-of-albion")
+# ------------------------------------------------------------------
+# Token / model state
+# ------------------------------------------------------------------
+def get_hf_token():
+    token = os.environ.get("HF_TOKEN") or os.environ.get("HUGGINGFACE_TOKEN")
+    if token:
+        return token
+    token_path = os.path.expanduser("~/.cache/huggingface/token")
+    if os.path.exists(token_path):
+        try:
+            with open(token_path, "r") as f:
+                return f.read().strip()
+        except Exception:
+            pass
+    return None
+hf_token = get_hf_token()
+HAS_MODEL = False
+MODEL_ERROR = ""
+model = None
+tokenizer = None
+# ------------------------------------------------------------------
+# Model loading (skipped when SKIP_MODEL_LOAD=1, when transformers/peft
+# is missing, or when there is no HF token â€” UI testing never needs it)
+# ------------------------------------------------------------------
+if not HAS_ML:
+    log.info("ML stack not installed â€” running in MOCK MODE (UI only).")
+elif SKIP_MODEL_LOAD:
+    log.info("SKIP_MODEL_LOAD=1 â€” skipping model load (UI only).")
+elif not hf_token:
+    log.info("No HF_TOKEN found â€” running in MOCK MODE. Set HF_TOKEN to enable the AI model.")
+else:
+    log.info("Loading tokenizer and base model...")
+    try:
+        from huggingface_hub import snapshot_download
+        import torch
+        tokenizer = AutoTokenizer.from_pretrained(BASE_MODEL, token=hf_token)
+        device_arg = "auto" if torch.cuda.is_available() else None
+        dtype = torch.bfloat16 if torch.cuda.is_available() else torch.float32
+        config = AutoConfig.from_pretrained(BASE_MODEL, token=hf_token)
+        if hasattr(config, "vision_config") and config.vision_config is not None:
+            config.vision_config = None
+            log.info("Stripped vision_config to force text-only load path.")
+        base_model = AutoModelForCausalLM.from_pretrained(
+            BASE_MODEL,
+            config=config,
+            token=hf_token,
+            torch_dtype=dtype,
+            device_map=device_arg,
+        )
+        adapter_path = snapshot_download(repo_id=ADAPTER_MODEL, token=hf_token, force_download=True)
+        model = PeftModel.from_pretrained(base_model, adapter_path)
+        model.eval()
+        warmup_prompt = (
+            "<start_of_turn>user\nYou are an expert fighting game NPC AI. "
+            "The user has performed this sequence of 5 moves: jab,cross,low_kick,roundhouse,uppercut.\n"
+            "Decide on the best counter-move from: jab, cross, low_kick, roundhouse, uppercut, parry, backstep, clinch, throw.\n"
+            "Respond in this format:\n[reasoning]\ncounter_move: [move]"
+            "<end_of_turn>\n<start_of_turn>model\n"
+        )
+        warmup_inputs = tokenizer(warmup_prompt, return_tensors="pt").to(model.device)
+        with torch.no_grad():
+            _ = model.generate(
+                **warmup_inputs,
+                max_new_tokens=20,
+                do_sample=False,
+                pad_token_id=tokenizer.eos_token_id,
+            )
+        HAS_MODEL = True
+        log.info("Model loaded and warmed up.")
+    except Exception as e:
+        import traceback
+        MODEL_ERROR = f"{type(e).__name__}: {e}"
+        log.warning("Model load failed: %s", MODEL_ERROR)
+        log.debug(traceback.format_exc())
+        model = None
+        tokenizer = None
+# ------------------------------------------------------------------
+# Inference helper
+# ------------------------------------------------------------------
+MOCK_COUNTERS = ["jab", "cross", "low_kick", "roundhouse", "uppercut", "parry", "backstep", "clinch", "throw"]
+def run_gemma(moves_sequence: str) -> str:
+    if not HAS_MODEL:
+        import time, random
+        time.sleep(0.25)
+        # Offline mode: return a clean scripted counter without leaking any
+        # raw model-loader diagnostics to the player UI.
+        reasoning = (
+            f"Mock Analysis: player performed {moves_sequence}. "
+            "AI opponent is offline â€” using scripted counters."
+        )
+        return json.dumps({
+            "reasoning": reasoning,
+            "counterMove": random.choice(MOCK_COUNTERS),
+            "sequence": moves_sequence,
+        })
+    prompt = (
+        f"<start_of_turn>user\n"
+        f"You are an expert fighting game NPC AI. "
+        f"The user has performed this sequence of 5 moves: {moves_sequence}.\n"
+        f"Observe the pattern and decide on the best counter-move from: "
+        f"jab, cross, low_kick, roundhouse, uppercut, parry, backstep, clinch, throw.\n"
+        f"Respond in this format:\n"
+        f"[Your reasoning about the player's pattern and tendencies]\n"
+        f"counter_move: [your chosen counter move]"
+        f"<end_of_turn>\n<start_of_turn>model\n"
+    )
+    import torch
+    inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+    with torch.no_grad():
+        outputs = model.generate(
+            **inputs,
+            max_new_tokens=80,
+            temperature=0.2,
+            do_sample=True,
+            pad_token_id=tokenizer.eos_token_id,
+        )
+    text = tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
+    reasoning = "Unable to process reasoning."
+    counter_move = "jab"
+    if "counter_move:" in text:
+        parts = text.split("counter_move:")
+        reasoning = parts[0].strip()
+        counter_move = parts[1].strip()
+    else:
+        reasoning = text.strip()
+    return json.dumps({"reasoning": reasoning, "counterMove": counter_move, "sequence": moves_sequence})
+# ------------------------------------------------------------------
+# FastAPI app setup
+# ------------------------------------------------------------------
+app = FastAPI(title="Duel of Albion - Gemma AI Fighter")
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+# ------------------------------------------------------------------
+# Static files: the built React game
+# ------------------------------------------------------------------
+PROJECT_ROOT = os.path.dirname(os.path.abspath(__file__))
+STATIC_DIR = os.path.join(PROJECT_ROOT, "3d-game", "dist")
+STATIC_DIR_EXISTS = os.path.isdir(STATIC_DIR)
+if STATIC_DIR_EXISTS:
+    # IMPORTANT: do NOT mount anything at /assets.
+    # Gradio serves its own JS/CSS bundles at /assets/* (e.g.
+    # /assets/index-DputZZxm.js). Mounting at /assets would shadow
+    # Gradio's handler and return 404 for those, leaving the Gradio
+    # shell blank. The React build's own assets are served via the
+    # /game/{path:path} catch-all below.
+    @app.get("/favicon.ico", include_in_schema=False)
+    async def favicon_ico():
+        return FileResponse(
+            os.path.join(STATIC_DIR, "favicon.svg"),
+            media_type="image/svg+xml",
+        )
+    @app.get("/favicon.svg", include_in_schema=False)
+    async def favicon_svg():
+        return FileResponse(os.path.join(STATIC_DIR, "favicon.svg"))
+    @app.get("/manifest.json", include_in_schema=False)
+    async def manifest():
+        return JSONResponse(content={
+            "name": "Duel of Albion",
+            "short_name": "Duel of Albion",
+            "start_url": "/game/",
+            "display": "fullscreen",
+            "background_color": "#05040a",
+            "theme_color": "#05040a",
+            "icons": [],
+        })
+    NO_CACHE = {
+        "Cache-Control": "no-cache, no-store, must-revalidate",
+        "Pragma": "no-cache",
+        "Expires": "0",
+    }
+    @app.get("/models/{path:path}", include_in_schema=False)
+    async def game_models(path: str):
+        static_root = os.path.normpath(STATIC_DIR)
+        candidate = os.path.normpath(os.path.join(STATIC_DIR, "models", path))
+        if not candidate.startswith(static_root):
+            return HTMLResponse("Forbidden", status_code=403)
+        if os.path.isfile(candidate):
+            return FileResponse(candidate)
+        return HTMLResponse("Not Found", status_code=404)
+    @app.get("/game", include_in_schema=False)
+    @app.get("/game/", include_in_schema=False)
+    async def game_index():
+        return FileResponse(
+            os.path.join(STATIC_DIR, "index.html"),
+            headers=NO_CACHE,
+        )
+    @app.get("/game/{path:path}", include_in_schema=False)
+    async def game_spa(path: str):
+        # Path-traversal guard
+        static_root = os.path.normpath(STATIC_DIR)
+        candidate = os.path.normpath(os.path.join(STATIC_DIR, path))
+        if not candidate.startswith(static_root):
+            return HTMLResponse("Forbidden", status_code=403)
+        if os.path.isfile(candidate):
+            return FileResponse(candidate)
+        # SPA fallback â€” unknown path -> index.html
+        return FileResponse(
+            os.path.join(STATIC_DIR, "index.html"),
+            headers=NO_CACHE,
+        )
+else:
+    @app.get("/game", include_in_schema=False)
+    @app.get("/game/", include_in_schema=False)
+    async def game_not_built():
+        return HTMLResponse(
+            "<h1 style='font-family:sans-serif;color:#f0e6d2;background:#05040a;"
+            "padding:40px;'>React game not built</h1>"
+            "<p style='font-family:sans-serif;color:#b8a88a;background:#05040a;"
+            "padding:0 40px 40px;'>Run <code>npm run build</code> in "
+            "<code>3d-game/</code> to produce the dist directory.</p>",
+            status_code=404,
+        )
+# ------------------------------------------------------------------
+# API endpoints used by the React game
+# ------------------------------------------------------------------
+@app.get("/health")
+async def health():
+    return JSONResponse(content={
+        "ready": HAS_MODEL,
+        "has_ml": HAS_ML,
+        "skip_model_load": SKIP_MODEL_LOAD,
+        "has_token": bool(hf_token),
+    })
+@app.post("/predict")
+async def predict(request: Request):
+    try:
+        data = await request.json()
+    except Exception:
+        data = {}
+    sequence = data.get("sequence", "")
+    result_str = run_gemma(sequence)
+    try:
+        result = json.loads(result_str)
+    except Exception:
+        result = {"reasoning": result_str, "counterMove": "jab", "sequence": sequence}
+    return JSONResponse(content=result)
+# ------------------------------------------------------------------
+# Gradio UI â€” full-screen game shell
+# ------------------------------------------------------------------
+# The game is always served through a Gradio container. The React app is
+# embedded full-screen in an iframe so it keeps its own rendering / input
+# logic, while Gradio provides the hosting layer (HF Spaces, local, etc.).
+css = """
+/* Kill every Gradio container from html down â€” nothing should add
+   padding, margin, gaps, borders, or constrained height. */
+html,body,#root,.app,.gradio-container,
+.gradio-container>.main,.gradio-container>.main>.wrap,
+.gradio-container .column,.gradio-container .column>.form,
+.gradio-container [class*="container"],
+.gradio-container [class*="panel"],
+.gradio-container [class*="gap"] {
+  background:#05040a!important;
+  padding:0!important;margin:0!important;
+  max-width:none!important;width:100%!important;height:100%!important;
+  min-height:100vh!important;
+  border:none!important;box-shadow:none!important;gap:0!important;
+  overflow:hidden!important;
+}
+/* Hide all Gradio chrome: splash, footer, loader, status bar */
+#app_splash,.splash,.loading,.loader,
+.progress,.progress-bar,.meta-loader,
+footer,.footer,.gradio-footer,.built-with,
+#component-status,.meta,[class*="built-with"],
+[class*="splash"],[class*="loader"],
+.svelte-1ipelgc {
+  display:none!important;visibility:hidden!important;opacity:0!important;
+  height:0!important;width:0!important;overflow:hidden!important;
+}
+/* The Column with class game-wrap fills the viewport */
+.game-wrap,
+.game-wrap>.form {
+  position:fixed!important;inset:0!important;
+  width:100vw!important;height:100vh!important;
+  padding:0!important;margin:0!important;
+  overflow:hidden!important;background:#05040a!important;
+  z-index:2147483647;
+}
+/* The iframe itself is also fixed so it ignores any intermediate wrappers
+   Gradio may insert around gr.HTML */
+#game-iframe,.game-wrap iframe {
+  position:fixed!important;top:0!important;left:0!important;
+  width:100vw!important;height:100vh!important;
+  border:none!important;display:block!important;
+  background:#05040a!important;z-index:2147483647;
+}
+"""
+with gr.Blocks(title="Duel of Albion", css=css, theme=gr.themes.Soft()) as demo:
+    if STATIC_DIR_EXISTS:
+        game_url = f"/game/?v={os.environ.get('BUILD_ID', int.from_bytes(os.urandom(2), 'big'))}"
+    else:
+        game_url = "/game"
+    with gr.Column(elem_classes="game-wrap"):
+        gr.HTML(
+            f'<iframe id="game-iframe" src="{game_url}" allowfullscreen '
+            'allow="autoplay; fullscreen; gamepad; xr-spatial-tracking" '
+            'sandbox="allow-scripts allow-same-origin allow-forms allow-popups" '
+            'style="background:#05040a;"></iframe>'
+        )
+app = gr.mount_gradio_app(app, demo, path="/")
+log.info("Mounted Gradio shell at /.")
+if __name__ == "__main__":
+    log.info("=" * 60)
+    log.info("Duel of Albion â€” Gradio server")
+    log.info("  Project root : %s", PROJECT_ROOT)
+    log.info("  Static dir  : %s (exists=%s)", STATIC_DIR, STATIC_DIR_EXISTS)
+    log.info("  Has ML stack: %s", HAS_ML)
+    log.info("  Has model   : %s", HAS_MODEL)
+    log.info("  HF token    : %s", "yes" if hf_token else "no")
+    log.info("  URL         : http://%s:%s/", "localhost" if HOST == "0.0.0.0" else HOST, PORT)
+    log.info("=" * 60)
+    import uvicorn
+    uvicorn.run(app, host=HOST, port=PORT, log_level="info")

static/assets/index-DUDqKLMt.css ADDED Viewed

	@@ -0,0 +1 @@

+ body{-webkit-font-smoothing:antialiased;-moz-osx-font-smoothing:grayscale;background-color:#050510;width:100vw;height:100vh;margin:0;font-family:Segoe UI,system-ui,-apple-system,sans-serif;overflow:hidden}#root{width:100%;height:100%}*{box-sizing:border-box;margin:0;padding:0}html,body,#root{color:#fff;background:#050510;width:100%;height:100%;font-family:Segoe UI,system-ui,-apple-system,sans-serif;overflow:hidden}.game-container{width:100vw;height:100vh;position:relative}.game-container canvas{display:block;width:100%!important;height:100%!important}.hud{pointer-events:none;z-index:10;justify-content:space-between;align-items:flex-start;gap:16px;width:100%;padding:16px 24px;display:flex;position:absolute;top:0;left:0}.hp-section{width:35%}.hp-label{text-transform:uppercase;letter-spacing:2px;margin-bottom:4px;font-size:11px;font-weight:700}.hp-label.player{color:#00d4ff;text-shadow:0 0 10px #00d4ff66}.hp-label.npc{color:#f34;text-align:right;text-shadow:0 0 10px #f346}.hp-bar-bg{background:#1a1a2e;border:1px solid #333;border-radius:3px;width:100%;height:14px;overflow:hidden}.hp-bar{border-radius:2px;height:100%;transition:width .3s}.hp-bar.player{background:linear-gradient(90deg,#00d4ff,#08f);box-shadow:0 0 10px #00d4ff44}.hp-bar.npc{float:right;background:linear-gradient(90deg,#f34,#f64);box-shadow:0 0 10px #f344}.center-info{text-align:center;pointer-events:none;z-index:10;position:absolute;top:12px;left:50%;transform:translate(-50%)}.round-info{color:#555;letter-spacing:2px;text-transform:uppercase;font-size:10px}.score{letter-spacing:4px;margin-top:4px;font-size:22px;font-weight:900}.score .player{color:#00d4ff}.score .npc{color:#f34}.score .dash{color:#444}.hit-splash{pointer-events:none;z-index:20;text-shadow:0 0 20px #ffffff80;font-size:30px;font-weight:900;animation:.5s ease-out forwards hitFade;position:absolute;top:38%;left:50%;transform:translate(-50%,-50%)}@keyframes hitFade{0%{opacity:1;transform:translate(-50%,-50%)scale(1.3)}to{opacity:0;transform:translate(-50%,-70%)scale(1)}}.controls-hint{color:#444;pointer-events:none;z-index:10;font-size:10px;line-height:1.8;position:absolute;bottom:14px;left:16px}.controls-hint kbd{color:#666;background:#1a1a2e;border:1px solid #333;border-radius:3px;margin:0 1px;padding:2px 5px;font-family:inherit;font-size:9px;display:inline-block}.combo-info{color:#444;pointer-events:none;z-index:10;text-align:right;font-size:10px;line-height:1.6;position:absolute;bottom:14px;right:16px}.overlay{z-index:100;cursor:pointer;background:#050510eb;flex-direction:column;justify-content:center;align-items:center;width:100%;height:100%;display:flex;position:absolute;top:0;left:0}.overlay h1{letter-spacing:6px;text-transform:uppercase;background:linear-gradient(135deg,#00d4ff,#a4f,#f34);-webkit-text-fill-color:transparent;text-align:center;-webkit-background-clip:text;background-clip:text;margin-bottom:20px;font-size:42px;font-weight:900}.overlay .subtitle{color:#666;text-align:center;max-width:420px;margin-bottom:24px;font-size:14px;line-height:1.7}.overlay .start-hint{color:#555;font-size:12px;animation:2s ease-in-out infinite pulse}@keyframes pulse{0%,to{opacity:.5}50%{opacity:1}}.game-over{z-index:50;background:#050510f0;flex-direction:column;justify-content:center;align-items:center;width:100%;height:100%;display:flex;position:absolute;top:0;left:0}.game-over h1{letter-spacing:4px;margin-bottom:12px;font-size:48px;font-weight:900}.game-over .subtitle{color:#888;margin-bottom:32px;font-size:15px}.game-over button{text-transform:uppercase;letter-spacing:2px;cursor:pointer;color:#fff;pointer-events:all;background:linear-gradient(135deg,#00d4ff,#06f);border:none;border-radius:4px;padding:14px 44px;font-size:14px;font-weight:700;transition:filter .2s,transform .2s}.game-over button:hover{filter:brightness(1.2);transform:scale(1.05)}.round-overlay{z-index:40;pointer-events:none;background:#000000bf;flex-direction:column;justify-content:center;align-items:center;width:100%;height:100%;animation:.3s ease-out roundFadeIn;display:flex;position:absolute;top:0;left:0}@keyframes roundFadeIn{0%{opacity:0}to{opacity:1}}.round-overlay h2{letter-spacing:3px;font-size:40px;font-weight:900}.round-overlay p{color:#888;margin-top:8px;font-size:16px}

static/assets/index-Dp4pBtge.js ADDED Viewed

The diff for this file is too large to render. See raw diff

static/favicon.svg ADDED Viewed

static/icons.svg ADDED Viewed

static/index.html ADDED Viewed

	@@ -0,0 +1,30 @@

+<!doctype html>
+<html lang="en">
+  <head>
+    <meta charset="UTF-8" />
+    <link rel="icon" type="image/svg+xml" href="/favicon.svg" />
+    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
+    <title>Shadow Duel AI</title>
+    <script type="module" crossorigin src="/assets/index-Dp4pBtge.js"></script>
+    <link rel="stylesheet" crossorigin href="/assets/index-DUDqKLMt.css">
+    <script>
+    // AI Bridge: receives AI_RESPONSE from Gradio parent and forwards to game
+    window.addEventListener("message", function(e) {
+        if (e.data && e.data.type === "AI_RESPONSE") {
+            window.dispatchEvent(new CustomEvent("ai-response", { detail: e.data }));
+        }
+    });
+    window.sendAIRequest = function(sequence) {
+        try {
+            window.parent.postMessage({
+                type: 'AI_REQUEST',
+                sequence: sequence
+            }, '*');
+        } catch(e) {}
+    };
+    </script>
+  </head>
+  <body>
+    <div id="root"></div>
+  </body>
+</html>

three.min.js ADDED Viewed

The diff for this file is too large to render. See raw diff