Commit History

eval with expanded GT and leaf metrics
7995e65

Food Desert commited on

Normalize GT annotations: expand implications, exclude non-evaluable tags
14e5c38

Claude commited on

eval with min_why=explicit + implications
6fc4b56

Food Desert commited on

eval results with min_why=explicit bug fix
de8b5a3

Food Desert commited on

Add latest eval results
096cdd3

Food Desert commited on

Add n=10 eval results for analysis
df66964

Food Desert commited on

Add alias-based character tag filtering for Stage 3
c6be992

Food Desert commited on