In-the-wild Interactions with Search-LLMs w/ Human Preferences
LMArena
community
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
LMArena is an open platform for crowdsourced AI benchmarking, originally created by researchers from UC Berkeley SkyLab.
We have officially graduated from LMSYS.org!
Free chat with the best AI models at lmarena.ai, and see rankings at lmarena.ai/leaderboard.
spaces
9
Running
4.56k
LMArena Leaderboard
🏆
Display LMArena Leaderboard
Running
189
Chatbot Arena
💬
Browse text leaderboard data
Running
3
Arena Hard Viewer
⚡
Browse and evaluate model judgments from benchmarks
Running
28
Llama-4-Maverick-03-26-Experimental Battles
🔥
Browse and compare model conversation outcomes
Running
Prompt Freshness
😻
Select similarity and language to filter prompts
models
20

lmarena-ai/p2l-7b-grk-01112025
7B
•
Updated
•
4
•
4

lmarena-ai/p2l-7b-grk-02222025
7B
•
Updated
•
16
•
6

lmarena-ai/p2l-0.5b-bt-01132025
0.5B
•
Updated
•
4

lmarena-ai/p2l-1.5b-bt-01132025
2B
•
Updated
•
4

lmarena-ai/p2l-3b-bt-01132025
3B
•
Updated
•
4

lmarena-ai/p2l-7b-bt-01132025
7B
•
Updated
•
34
•
2

lmarena-ai/p2l-135m-bt-01132025
0.1B
•
Updated
•
811

lmarena-ai/p2l-360m-bt-01132025
0.4B
•
Updated
•
4

lmarena-ai/p2l-135m-rk-01132025
0.1B
•
Updated
•
5

lmarena-ai/p2l-360m-rk-01132025
0.4B
•
Updated
•
3
datasets
22
lmarena-ai/arena-human-preference-140k
Viewer
•
Updated
•
136k
•
1.03k
•
5
lmarena-ai/search-arena-24k
Viewer
•
Updated
•
24.1k
•
327
•
15
lmarena-ai/arena-hard-auto
Updated
•
241
•
3
lmarena-ai/categories-benchmark-eval
Preview
•
Updated
•
43
•
4
lmarena-ai/search-arena-v1-7k
Viewer
•
Updated
•
7k
•
266
•
22
lmarena-ai/webdev-arena-preference-10k
Viewer
•
Updated
•
10.5k
•
254
•
11
lmarena-ai/repochat-arena-preference-4k
Viewer
•
Updated
•
3.84k
•
42
•
4
lmarena-ai/arena-human-preference-100k
Viewer
•
Updated
•
106k
•
678
•
48
lmarena-ai/VisionArena-Chat
Viewer
•
Updated
•
199k
•
2.76k
•
5
lmarena-ai/VisionArena-Battle
Viewer
•
Updated
•
29.8k
•
107
•
7