Prompt_Squirrel_RAG / scripts /eval_pipeline.py

Commit History

Add diagnostic eval metrics, why-distribution tracking, and generic character filter
349b999

Claude commited on

Add parallel processing to eval pipeline with ThreadPoolExecutor
12dfa28

Claude commited on

Add independent character tag metrics to eval pipeline
f1b4da2

Claude commited on

Improve eval harness: shuffle samples, always write results
133d74c

Claude commited on

Add end-to-end evaluation harness for pipeline metrics
6909d06

Claude commited on