Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models Paper • 2512.21337 • Published 6 days ago • 25
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness Paper • 2512.15374 • Published 13 days ago • 5