arxiv:2507.12284
Zadorozhny
pavul
ยท
AI & ML interests
PLP, AI, Agents
Recent Activity
authored
a paper
16 days ago
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language
Models on Software Engineering Tasks
authored
a paper
16 days ago
MERA Code: A Unified Framework for Evaluating Code Generation Across
Tasks
upvoted
a
paper
16 days ago
MERA Code: A Unified Framework for Evaluating Code Generation Across
Tasks