Spaces:

m42-health
/

MEDIC-Benchmark

Running

tathagataraha commited on May 16

Commit

dc56e08

1 Parent(s): fb84311

[ADD] Healthbench Citation

Files changed (1) hide show

src/about.py CHANGED Viewed

@@ -269,7 +269,10 @@ CROSS_EVALUATION_METRICS = """
 """
 HEALTHBENCH_METRICS = """
-OpenAI HealthBench
 """
 CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"

 """
 HEALTHBENCH_METRICS = """
+HealthBench consists of 5,000 multi-turn conversations between users (patients or clinicians) and AI models, covering a wide range of medical topics and scenarios. Each conversation is accompanied by a set of physician-created rubric criteria, totaling over 48,562 unique items, to grade model responses based on accuracy, relevance, and safety.
+For more information, refer to the [HealthBench paper](https://cdn.openai.com/pdf/bd7a39d5-9e9f-47b3-903c-8b847ca650c7/healthbench_paper.pdf) and the [OpenAI blog post](https://openai.com/index/healthbench/).
+**Judge Used**: [meta-llama/Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct)
 """
 CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"