File size: 2,139 Bytes
783ea25
1d31670
 
 
783ea25
1d31670
 
 
 
 
 
 
 
 
 
 
783ea25
 
0c88e20
1d31670
 
 
 
 
 
 
 
 
 
 
8c70e8b
1d31670
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b1a3d67
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
title: Icelandic LLM Leaderboard
colorFrom: blue
colorTo: green
sdk: docker
hf_oauth: true
pinned: true
license: apache-2.0
tags:
- leaderboard
- modality:text
- submission:automatic
- test:public
- language:icelandic
- eval:language
short_description: Track, rank and evaluate LLMs on Icelandic language tasks
---

# Icelandic LLM Leaderboard

A comprehensive leaderboard for evaluating Large Language Models (LLMs) on Icelandic language tasks. This leaderboard tracks model performance across various Icelandic benchmarks including WinoGrande-IS, GED, Inflection, Belebele-IS, ARC-Challenge-IS, and WikiQA-IS.

## Features

- 📊 Interactive table with advanced sorting and filtering
- 🔍 Semantic model search with regex support
- 📌 Pin models for easy comparison
- 📱 Responsive and modern React interface
- 🎨 Dark/Light mode support
- ⚡️ Optimized performance with virtualization
-   Specialized for Icelandic language evaluation

## Benchmarks

### Core Icelandic Tasks
- **WinoGrande-IS (3-shot)**: Icelandic common sense reasoning
- **GED**: Grammatical error detection in Icelandic
- **Inflection (1-shot)**: Icelandic morphological inflection
- **Belebele-IS**: Icelandic reading comprehension
- **ARC-Challenge-IS**: Icelandic science questions
- **WikiQA-IS**: Icelandic question answering

## Architecture

The leaderboard uses a modern React frontend with a FastAPI backend, containerized with Docker for seamless deployment on Hugging Face Spaces.

### Frontend (React)
- Material-UI components
- TanStack Table for advanced data handling
- Real-time filtering and search capabilities

### Backend (FastAPI)  
- Integration with Hugging Face repositories
- Automatic data synchronization
- RESTful API endpoints

## Data Sources

The leaderboard pulls evaluation results from:
- **Results Repository**: `mideind/icelandic-llm-leaderboard-results`
- **Requests Repository**: `mideind/icelandic-llm-leaderboard-requests`

## Contributing

To submit a model for evaluation, please follow the submission guidelines in the leaderboard interface.

## License

Apache 2.0 License - see LICENSE file for details.