SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models Paper • 2603.19028 • Published 10 days ago • 16
ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 9 days ago • 40
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation Paper • 2603.19039 • Published 10 days ago • 49
Large Multimodal Models as General In-Context Classifiers Paper • 2602.23229 • Published about 1 month ago • 26
How to Take a Memorable Picture? Empowering Users with Actionable Feedback Paper • 2602.21877 • Published Feb 25 • 16