Add comprehensive evaluation reports and execution time breakdown for Hospital Customization System
24f6a16
YanBoChencommited on
Update query file references for full evaluation and correct typo in pre_user_query_evaluate.txt for pre-test.
e84171b
YanBoChencommited on
Merge branch 'newbranchYB-newest' into Merged20250805
abbc1cd
YanBoChencommited on
Add adaptive relevance thresholds for query complexity in PrecisionMRRAnalyzer; fix typo in condition mapping for postpartum hemorrhage
7620d26
YanBoChencommited on
Update threshold values in latency evaluator and coverage chart generator; enhance precision and MRR analysis with corrected thresholds and new chart generator for detailed metrics visualization.
5d4792a
YanBoChencommited on
Refactor relevance calculation and update thresholds in latency evaluator; enhance precision and MRR analyzer with angular distance metrics; increase timeout for primary generation in fallback configuration.
b0f56ec
YanBoChencommited on
Enhance Direct LLM Evaluator and Judge Evaluator:
40d39ed
YanBoChencommited on
feat(evaluation): add visualization generators for generating png files