Spaces:
Running
Running
Update index.html
Browse files- index.html +1 -1
index.html
CHANGED
@@ -107,7 +107,7 @@
|
|
107 |
</style>
|
108 |
</head>
|
109 |
<body>
|
110 |
-
<h1>LLM Benchmark overview</h1>
|
111 |
<div>As the development and evaluation of large language models (LLMs) continue to evolve, I conducted an overview of the principal benchmarks commonly found in research papers. My goal is to create a clear and comprehensive resource that summarizes what is being tested in LLMs, with concrete examples, key metrics, and direct links to related papers and repositories. This document serves as a centralized matrix that will be continuously updated with insights from future papers I review.</div>
|
112 |
<div class="filter">
|
113 |
<label for="metricFilter">Filter by Evaluated task:</label>
|
|
|
107 |
</style>
|
108 |
</head>
|
109 |
<body>
|
110 |
+
<h1>LLM Benchmark overview (update ongoing) </h1>
|
111 |
<div>As the development and evaluation of large language models (LLMs) continue to evolve, I conducted an overview of the principal benchmarks commonly found in research papers. My goal is to create a clear and comprehensive resource that summarizes what is being tested in LLMs, with concrete examples, key metrics, and direct links to related papers and repositories. This document serves as a centralized matrix that will be continuously updated with insights from future papers I review.</div>
|
112 |
<div class="filter">
|
113 |
<label for="metricFilter">Filter by Evaluated task:</label>
|