Update Space (evaluate main: 05209ece)
Browse files
README.md
CHANGED
|
@@ -10,6 +10,10 @@ pinned: false
|
|
| 10 |
tags:
|
| 11 |
- evaluate
|
| 12 |
- metric
|
|
|
|
|
|
|
|
|
|
|
|
|
| 13 |
---
|
| 14 |
|
| 15 |
# Metric Card for Code Eval
|
|
|
|
| 10 |
tags:
|
| 11 |
- evaluate
|
| 12 |
- metric
|
| 13 |
+
description: >-
|
| 14 |
+
This metric implements the evaluation harness for the HumanEval problem solving dataset
|
| 15 |
+
described in the paper "Evaluating Large Language Models Trained on Code"
|
| 16 |
+
(https://arxiv.org/abs/2107.03374).
|
| 17 |
---
|
| 18 |
|
| 19 |
# Metric Card for Code Eval
|