Spaces:

UlrickBL
/

benchmark_overview

Running

UlrickBL commited on Apr 8

Commit

80c20a7

verified ·

1 Parent(s): 6bc3b5e

Update index.html

Files changed (1) hide show

index.html CHANGED Viewed

@@ -107,7 +107,7 @@
     </style>
 </head>
 <body>
-    <h1>LLM Benchmark overview</h1>
     <div>As the development and evaluation of large language models (LLMs) continue to evolve, I conducted an overview of the principal benchmarks commonly found in research papers. My goal is to create a clear and comprehensive resource that summarizes what is being tested in LLMs, with concrete examples, key metrics, and direct links to related papers and repositories. This document serves as a centralized matrix that will be continuously updated with insights from future papers I review.</div>
     <div class="filter">
         <label for="metricFilter">Filter by Evaluated task:</label>

     </style>
 </head>
 <body>
+    <h1>LLM Benchmark overview (update ongoing) </h1>
     <div>As the development and evaluation of large language models (LLMs) continue to evolve, I conducted an overview of the principal benchmarks commonly found in research papers. My goal is to create a clear and comprehensive resource that summarizes what is being tested in LLMs, with concrete examples, key metrics, and direct links to related papers and repositories. This document serves as a centralized matrix that will be continuously updated with insights from future papers I review.</div>
     <div class="filter">
         <label for="metricFilter">Filter by Evaluated task:</label>