oncall-guide-ai / evaluation /direct_llm_evaluator.py

Commit History

Update query file references for full evaluation and correct typo in pre_user_query_evaluate.txt for pre-test.
e84171b

YanBoChen commited on

Enhance Direct LLM Evaluator and Judge Evaluator:
40d39ed

YanBoChen commited on

Add multi-system evaluation support for clinical actionability and evidence quality metrics
16a2990

YanBoChen commited on

feat: Add Extraction, LLM Judge, and Relevance Chart Generators
17613c8

YanBoChen commited on