Spaces:

UlrickBL
/

benchmark_overview

Running

UlrickBL commited on Jan 19

Commit

777ab08

verified ·

1 Parent(s): 99f7969

Update index.html

Files changed (1) hide show

index.html CHANGED Viewed

@@ -108,6 +108,9 @@
 </head>
 <body>
     <h1>LLM Benchmark overview</h1>
     <div class="filter">
         <label for="metricFilter">Filter by Evaluated task:</label>
         <select id="metricFilter">

 </head>
 <body>
     <h1>LLM Benchmark overview</h1>
+    <div>Overview of Benchmarks for LLM Evaluation
+As the development and evaluation of large language models (LLMs) continue to evolve, I conducted an overview of the principal benchmarks commonly found in research papers. My goal is to create a clear and comprehensive resource that summarizes what is being tested in LLMs, with concrete examples, key metrics, and direct links to related papers and repositories. This document serves as a centralized matrix that will be continuously updated with insights from future papers I review.</div>
     <div class="filter">
         <label for="metricFilter">Filter by Evaluated task:</label>
         <select id="metricFilter">