tohid.abedini
commited on
Commit
·
984c68a
1
Parent(s):
7c72599
test
Browse files
utils.py
CHANGED
@@ -130,7 +130,7 @@ LLM_BENCHMARKS_ABOUT_TEXT = f"""
|
|
130 |
> - **GSM8k Persian**
|
131 |
> - **Multiple Choice Persian**
|
132 |
>
|
133 |
-
> Each dataset is available in Persian, providing a robust testing ground for models in a non-English setting.
|
134 |
>
|
135 |
> 3. **Open-Source Dataset Sample**
|
136 |
> A sample of the evaluation dataset is hosted on [Hugging Face Datasets](https://huggingface.co/datasets/PartAI/llm-leaderboard-datasets-sample), offering the AI community a glimpse of the benchmark content and format. This sample allows developers to pre-assess their models against representative data before a full leaderboard evaluation.
|
|
|
130 |
> - **GSM8k Persian**
|
131 |
> - **Multiple Choice Persian**
|
132 |
>
|
133 |
+
> Each dataset is available in Persian, providing a robust testing ground for models in a non-English setting. The datasets collectively contain over **40k samples** across various categories such as **Common Knowledge**, **Reasoning**, **Summarization**, **Math**, and **Specialized Examinations**, offering comprehensive coverage of diverse linguistic and technical challenges.
|
134 |
>
|
135 |
> 3. **Open-Source Dataset Sample**
|
136 |
> A sample of the evaluation dataset is hosted on [Hugging Face Datasets](https://huggingface.co/datasets/PartAI/llm-leaderboard-datasets-sample), offering the AI community a glimpse of the benchmark content and format. This sample allows developers to pre-assess their models against representative data before a full leaderboard evaluation.
|