Spaces:

opll-org
/

Open-Persian-LLM-Leaderboard

Running

tohid.abedini commited on Nov 16, 2024

Commit

984c68a

1 Parent(s): 7c72599

test

Files changed (1) hide show

utils.py CHANGED Viewed

@@ -130,7 +130,7 @@ LLM_BENCHMARKS_ABOUT_TEXT = f"""
 >    - **GSM8k Persian**
 >    - **Multiple Choice Persian**
 >
->    Each dataset is available in Persian, providing a robust testing ground for models in a non-English setting.
 >
 > 3. **Open-Source Dataset Sample**
 >    A sample of the evaluation dataset is hosted on [Hugging Face Datasets](https://huggingface.co/datasets/PartAI/llm-leaderboard-datasets-sample), offering the AI community a glimpse of the benchmark content and format. This sample allows developers to pre-assess their models against representative data before a full leaderboard evaluation.

 >    - **GSM8k Persian**
 >    - **Multiple Choice Persian**
 >
+>    Each dataset is available in Persian, providing a robust testing ground for models in a non-English setting. The datasets collectively contain over **40k samples** across various categories such as **Common Knowledge**, **Reasoning**, **Summarization**, **Math**, and **Specialized Examinations**, offering comprehensive coverage of diverse linguistic and technical challenges.
 >
 > 3. **Open-Source Dataset Sample**
 >    A sample of the evaluation dataset is hosted on [Hugging Face Datasets](https://huggingface.co/datasets/PartAI/llm-leaderboard-datasets-sample), offering the AI community a glimpse of the benchmark content and format. This sample allows developers to pre-assess their models against representative data before a full leaderboard evaluation.