
OpenCompass
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
👋 join us on Discord and WeChat
follow us on Github
OpenCompass is a platform focused on evaluation of AGI, include Large Language Model and Multi-modality Model. We aim to:
- develop high-quality libraries to reduce the difficulties in evaluation
- provide convincing leaderboards for improving the understanding of the large models
- create powerful toolchains targeting a variety of abilities and tasks
- build solid benchmarks to support the large model research
Collections
1
spaces
14
pinned
Running
25
Openvlm Subjective Leaderboard
🌎
VLMEvalKit Subjectivce Benchmark Results
pinned
Running
2
CompassAcademic Leaderboard Full Version
🦀
Compass Academic Leaderboard Full Version
pinned
Running
37
Open LMM Reasoning Leaderboard
🥇
A Leaderboard that demonstrates LMM reasoning capabilities
pinned
Running
6
Compass Academic Leaderboard
🦀
Compass Academic Leaderboard
pinned
Running
on
CPU Upgrade
778
Open VLM Leaderboard
🌎
VLMEvalKit Evaluation Results Collection
pinned
Running
21
MMBench Leaderboard
🚀
View and filter MMBench leaderboard data
models
8

opencompass/anah-7b
Text Classification
•
Updated
•
29

opencompass/anah-20b
Text Classification
•
Updated
•
14

opencompass/anah-v2
Text Classification
•
Updated
•
70
•
4

opencompass/CompassJudger-1-14B-Instruct
Text Generation
•
Updated
•
103
•
2

opencompass/CompassJudger-1-32B-Instruct
Text Generation
•
Updated
•
126
•
15

opencompass/CompassJudger-1-1.5B-Instruct
Updated
•
51
•
1

opencompass/CompassJudger-1-7B-Instruct
Updated
•
398
•
9

opencompass/mixtral-8x7b-32k
Updated
•
1
datasets
11
opencompass/NeedleBench
Viewer
•
Updated
•
6.8k
•
262
•
5
opencompass/compass_academic_predictions
Viewer
•
Updated
•
4.42M
•
7
opencompass/LiveMathBench
Viewer
•
Updated
•
283
•
1.49k
•
7
opencompass/Creation-MMBench
Viewer
•
Updated
•
765
•
124
•
2
opencompass/anah
Viewer
•
Updated
•
783
•
75
•
3
opencompass/AIME2025
Viewer
•
Updated
•
30
•
5.24k
•
19
opencompass/mmmlu_lite
Viewer
•
Updated
•
20k
•
34
•
2
opencompass/MMBench-Video
Preview
•
Updated
•
175
•
7
opencompass/flames
Viewer
•
Updated
•
537
•
36
opencompass/CriticBench
Updated
•
146
•
4