fix: query budget 500, sync all hardcoded refs across codebase 65f7de1 Itachi-1824 commited on Apr 11
fix: UI polish β 9 scenario count, styled empty state, richer Try It tab with How It Works + curl examples 0246380 Itachi-1824 commited on Apr 10
feat: playground all-fields-visible grid, loop detection, 200 query budget 2e85e72 Itachi-1824 commited on Apr 10
feat: playground overhaul β tool-specific fields, scenario picker, live status dashboard c58042f Itachi-1824 commited on Apr 10
fix: brutal audit β reset tool_call_counts, date dedup, unused vars, playground overhaul with scenario picker + status dashboard b4d7ce3 Itachi-1824 commited on Apr 10
fix: python 3.11 f-string compat, inference OPENENV_BASE_URL fix, README action/observation spaces 3ca879f Itachi-1824 commited on Apr 10
feat: /grader endpoint, validate-submission.sh, session cleanup, final polish 2dc4e1e Itachi-1824 commited on Apr 10
fix: DEBUG to stderr, scores (0.001,0.999), OPENENV_BASE_URL support, clean imports 1f75c3f Itachi-1824 commited on Apr 10
fix: 3 showstoppers (dockerignore, END format, efficiency gaming) + license + unused imports 87bdc12 Itachi-1824 commited on Apr 10
fix: all critical audit issues β [END] format, finding matching, methodology scoring, LLM timeout, error handling 682f6be Itachi-1824 commited on Apr 10
fix: typed Action model, OPENAI_API_KEY support, proper spec compliance 9c88cc5 Itachi-1824 commited on Apr 10
fix: auto-grade on budget exhaustion + max steps, no more 0.01 dead scores e834bf5 Itachi-1824 commited on Apr 10
fix: readable state graphs (TD layout, short labels, progress-only) f44e662 Itachi-1824 commited on Apr 10
feat: test suite (9 tests), dynamic leaderboard, mermaid graphs 5a7cd93 Itachi-1824 commited on Apr 10
fix: mount gradio at /web (hf iframe path), disable openenv default ui 696bc46 Itachi-1824 commited on Apr 10
feat: custom gradio dashboard (6 tabs, charcoal+gold, leaderboard, playground, architecture) b2cf922 Itachi-1824 commited on Apr 10
feat: premium ui (charcoal+gold authority palette), multi-model eval, /web mount fix 51a14c0 Itachi-1824 commited on Apr 10
feat: eu ai act compliance auditor β mcp-based openenv environment 5d5e37e Itachi-1824 commited on Apr 10