Spaces:

UlrickBL
/

benchmark_overview

Running

UlrickBL commited on Jan 19

Commit

192167f

verified ·

1 Parent(s): 777ab08

Update index.html

Files changed (1) hide show

index.html CHANGED Viewed

@@ -108,9 +108,7 @@
 </head>
 <body>
     <h1>LLM Benchmark overview</h1>
-    <div>Overview of Benchmarks for LLM Evaluation
-As the development and evaluation of large language models (LLMs) continue to evolve, I conducted an overview of the principal benchmarks commonly found in research papers. My goal is to create a clear and comprehensive resource that summarizes what is being tested in LLMs, with concrete examples, key metrics, and direct links to related papers and repositories. This document serves as a centralized matrix that will be continuously updated with insights from future papers I review.</div>
     <div class="filter">
         <label for="metricFilter">Filter by Evaluated task:</label>
         <select id="metricFilter">

 </head>
 <body>
     <h1>LLM Benchmark overview</h1>
+    <div>As the development and evaluation of large language models (LLMs) continue to evolve, I conducted an overview of the principal benchmarks commonly found in research papers. My goal is to create a clear and comprehensive resource that summarizes what is being tested in LLMs, with concrete examples, key metrics, and direct links to related papers and repositories. This document serves as a centralized matrix that will be continuously updated with insights from future papers I review.</div>
     <div class="filter">
         <label for="metricFilter">Filter by Evaluated task:</label>
         <select id="metricFilter">