Spaces:

FoodDesert
/

Prompt_Squirrel_RAG

Running

App Files Files Community

Prompt_Squirrel_RAG

Commit History

Add structural tag inference (Stage 3s) and compact eval output

a16e111

Claude commited on Feb 12

Default min_why to strong_implied; add retrieval gap analysis script

4968635

Claude commited on Feb 11

eval with expanded GT and leaf metrics

7995e65

Food Desert commited on Feb 11

Normalize GT annotations: expand implications, exclude non-evaluable tags

14e5c38

Claude commited on Feb 11

eval with min_why=explicit + implications

6fc4b56

Food Desert commited on Feb 10

Add tag implication expansion (fox→canine→canid→mammal)

eeada1d

Claude commited on Feb 10

eval results with min_why=explicit bug fix

de8b5a3

Food Desert commited on Feb 10

Remove data/eval_results/ from .gitignore so eval results are tracked

3edd051

Claude commited on Feb 10

Fix min_why not passed to workers in parallel eval mode

054dd0f

Claude commited on Feb 10

Add latest eval results

096cdd3

Food Desert commited on Feb 10

Add --min-why threshold to filter Stage 3 selections by confidence level

09a248d

Claude commited on Feb 10

adding tag implications file

962e2b4

Food Desert commited on Feb 10

Add diagnostic eval metrics, why-distribution tracking, and generic character filter

349b999

Claude commited on Feb 10

Add n=10 eval results for analysis

df66964

Food Desert commited on Feb 10

Add parallel processing to eval pipeline with ThreadPoolExecutor

12dfa28

Claude commited on Feb 10

Add independent character tag metrics to eval pipeline

f1b4da2

Claude commited on Feb 10

Improve eval harness: shuffle samples, always write results

133d74c

Claude commited on Feb 10

Add end-to-end evaluation harness for pipeline metrics

6909d06

Claude commited on Feb 10

Expand alias filter tests with real CSV data and pipeline tests

ea9e11c

Claude commited on Feb 9

Add alias-based character tag filtering for Stage 3

c6be992

Food Desert commited on Mar 7

initial commit

4fdda86
unverified

FoodDesert commited on Feb 8

Commit History

Add structural tag inference (Stage 3s) and compact eval output a16e111

Default min_why to strong_implied; add retrieval gap analysis script 4968635

eval with expanded GT and leaf metrics 7995e65

Normalize GT annotations: expand implications, exclude non-evaluable tags 14e5c38

eval with min_why=explicit + implications 6fc4b56

Add tag implication expansion (fox→canine→canid→mammal) eeada1d

eval results with min_why=explicit bug fix de8b5a3

Remove data/eval_results/ from .gitignore so eval results are tracked 3edd051

Fix min_why not passed to workers in parallel eval mode 054dd0f

Add latest eval results 096cdd3

Add --min-why threshold to filter Stage 3 selections by confidence level 09a248d

adding tag implications file 962e2b4

Add diagnostic eval metrics, why-distribution tracking, and generic character filter 349b999

Add n=10 eval results for analysis df66964

Add parallel processing to eval pipeline with ThreadPoolExecutor 12dfa28

Add independent character tag metrics to eval pipeline f1b4da2

Improve eval harness: shuffle samples, always write results 133d74c

Add end-to-end evaluation harness for pipeline metrics 6909d06

Expand alias filter tests with real CSV data and pipeline tests ea9e11c

Add alias-based character tag filtering for Stage 3 c6be992

initial commit 4fdda86 unverified

Add structural tag inference (Stage 3s) and compact eval output

a16e111

Default min_why to strong_implied; add retrieval gap analysis script

4968635

eval with expanded GT and leaf metrics

7995e65

Normalize GT annotations: expand implications, exclude non-evaluable tags

14e5c38

eval with min_why=explicit + implications

6fc4b56

Add tag implication expansion (fox→canine→canid→mammal)

eeada1d

eval results with min_why=explicit bug fix

de8b5a3

Remove data/eval_results/ from .gitignore so eval results are tracked

3edd051

Fix min_why not passed to workers in parallel eval mode

054dd0f

Add latest eval results

096cdd3

Add --min-why threshold to filter Stage 3 selections by confidence level

09a248d

adding tag implications file

962e2b4

Add diagnostic eval metrics, why-distribution tracking, and generic character filter

349b999

Add n=10 eval results for analysis

df66964

Add parallel processing to eval pipeline with ThreadPoolExecutor

12dfa28

Add independent character tag metrics to eval pipeline

f1b4da2

Improve eval harness: shuffle samples, always write results

133d74c

Add end-to-end evaluation harness for pipeline metrics

6909d06

Expand alias filter tests with real CSV data and pipeline tests

ea9e11c

Add alias-based character tag filtering for Stage 3

c6be992

initial commit

4fdda86
unverified