oncall-guide-ai / evaluation /direct_llm_evaluator.py

Commit History

Enhance direct LLM evaluation with retry mechanism for 504 timeouts and improved guidance format
3edd46d

YanBoChen commited on

Update query file references for full evaluation and correct typo in pre_user_query_evaluate.txt for pre-test.
e84171b

YanBoChen commited on

Enhance Direct LLM Evaluator and Judge Evaluator:
40d39ed

YanBoChen commited on

Add multi-system evaluation support for clinical actionability and evidence quality metrics
16a2990

YanBoChen commited on

feat: Add Extraction, LLM Judge, and Relevance Chart Generators
17613c8

YanBoChen commited on