Commit History

fix: query budget 500, sync all hardcoded refs across codebase
65f7de1

Itachi-1824 commited on

fix: UI polish β€” 9 scenario count, styled empty state, richer Try It tab with How It Works + curl examples
0246380

Itachi-1824 commited on

feat: playground all-fields-visible grid, loop detection, 200 query budget
2e85e72

Itachi-1824 commited on

feat: playground overhaul β€” tool-specific fields, scenario picker, live status dashboard
c58042f

Itachi-1824 commited on

fix: brutal audit β€” reset tool_call_counts, date dedup, unused vars, playground overhaul with scenario picker + status dashboard
b4d7ce3

Itachi-1824 commited on

fix: render placeholders in reset observation message
93016c9

Itachi-1824 commited on

fix: python 3.11 f-string compat, inference OPENENV_BASE_URL fix, README action/observation spaces
3ca879f

Itachi-1824 commited on

feat: investigation-grade overhaul + procedural generation
107f92d

Itachi-1824 commited on

feat: /grader endpoint, validate-submission.sh, session cleanup, final polish
2dc4e1e

Itachi-1824 commited on

fix: DEBUG to stderr, scores (0.001,0.999), OPENENV_BASE_URL support, clean imports
1f75c3f

Itachi-1824 commited on

fix: 3 showstoppers (dockerignore, END format, efficiency gaming) + license + unused imports
87bdc12

Itachi-1824 commited on

fix: all critical audit issues β€” [END] format, finding matching, methodology scoring, LLM timeout, error handling
682f6be

Itachi-1824 commited on

fix: typed Action model, OPENAI_API_KEY support, proper spec compliance
9c88cc5

Itachi-1824 commited on

fix: auto-grade on budget exhaustion + max steps, no more 0.01 dead scores
e834bf5

Itachi-1824 commited on

fix: replace oversized mermaid with clean text flow diagrams
372e4aa

Itachi-1824 commited on

fix: readable state graphs (TD layout, short labels, progress-only)
f44e662

Itachi-1824 commited on

feat: test suite (9 tests), dynamic leaderboard, mermaid graphs
5a7cd93

Itachi-1824 commited on

feat: mermaid state-graph diagrams in scenarios tab
a07df20

Itachi-1824 commited on

fix: mount gradio at / (maverick pattern)
a0d8e70

Itachi-1824 commited on

fix: mount gradio at /web (hf iframe path), disable openenv default ui
696bc46

Itachi-1824 commited on

feat: custom gradio dashboard (6 tabs, charcoal+gold, leaderboard, playground, architecture)
b2cf922

Itachi-1824 commited on

fix: enable web interface, mount custom ui at /dashboard
2406619

Itachi-1824 commited on

feat: premium ui (charcoal+gold authority palette), multi-model eval, /web mount fix
51a14c0

Itachi-1824 commited on

fix: robust gradio mount with html fallback for hf spaces
a2bdc87

Itachi-1824 commited on

feat: eu ai act compliance auditor β€” mcp-based openenv environment
5d5e37e

Itachi-1824 commited on