Spaces:
Running
Running
Commit
·
dc56e08
1
Parent(s):
fb84311
[ADD] Healthbench Citation
Browse files- src/about.py +4 -1
src/about.py
CHANGED
@@ -269,7 +269,10 @@ CROSS_EVALUATION_METRICS = """
|
|
269 |
"""
|
270 |
|
271 |
HEALTHBENCH_METRICS = """
|
272 |
-
|
|
|
|
|
|
|
273 |
"""
|
274 |
|
275 |
CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"
|
|
|
269 |
"""
|
270 |
|
271 |
HEALTHBENCH_METRICS = """
|
272 |
+
HealthBench consists of 5,000 multi-turn conversations between users (patients or clinicians) and AI models, covering a wide range of medical topics and scenarios. Each conversation is accompanied by a set of physician-created rubric criteria, totaling over 48,562 unique items, to grade model responses based on accuracy, relevance, and safety.
|
273 |
+
For more information, refer to the [HealthBench paper](https://cdn.openai.com/pdf/bd7a39d5-9e9f-47b3-903c-8b847ca650c7/healthbench_paper.pdf) and the [OpenAI blog post](https://openai.com/index/healthbench/).
|
274 |
+
|
275 |
+
**Judge Used**: [meta-llama/Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct)
|
276 |
"""
|
277 |
|
278 |
CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"
|