File size: 2,139 Bytes
783ea25 1d31670 783ea25 1d31670 783ea25 0c88e20 1d31670 8c70e8b 1d31670 b1a3d67 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 |
---
title: Icelandic LLM Leaderboard
colorFrom: blue
colorTo: green
sdk: docker
hf_oauth: true
pinned: true
license: apache-2.0
tags:
- leaderboard
- modality:text
- submission:automatic
- test:public
- language:icelandic
- eval:language
short_description: Track, rank and evaluate LLMs on Icelandic language tasks
---
# Icelandic LLM Leaderboard
A comprehensive leaderboard for evaluating Large Language Models (LLMs) on Icelandic language tasks. This leaderboard tracks model performance across various Icelandic benchmarks including WinoGrande-IS, GED, Inflection, Belebele-IS, ARC-Challenge-IS, and WikiQA-IS.
## Features
- 📊 Interactive table with advanced sorting and filtering
- 🔍 Semantic model search with regex support
- 📌 Pin models for easy comparison
- 📱 Responsive and modern React interface
- 🎨 Dark/Light mode support
- ⚡️ Optimized performance with virtualization
- Specialized for Icelandic language evaluation
## Benchmarks
### Core Icelandic Tasks
- **WinoGrande-IS (3-shot)**: Icelandic common sense reasoning
- **GED**: Grammatical error detection in Icelandic
- **Inflection (1-shot)**: Icelandic morphological inflection
- **Belebele-IS**: Icelandic reading comprehension
- **ARC-Challenge-IS**: Icelandic science questions
- **WikiQA-IS**: Icelandic question answering
## Architecture
The leaderboard uses a modern React frontend with a FastAPI backend, containerized with Docker for seamless deployment on Hugging Face Spaces.
### Frontend (React)
- Material-UI components
- TanStack Table for advanced data handling
- Real-time filtering and search capabilities
### Backend (FastAPI)
- Integration with Hugging Face repositories
- Automatic data synchronization
- RESTful API endpoints
## Data Sources
The leaderboard pulls evaluation results from:
- **Results Repository**: `mideind/icelandic-llm-leaderboard-results`
- **Requests Repository**: `mideind/icelandic-llm-leaderboard-requests`
## Contributing
To submit a model for evaluation, please follow the submission guidelines in the leaderboard interface.
## License
Apache 2.0 License - see LICENSE file for details.
|