Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Paper • 2504.15280 • Published Apr 21 • 25
SOS: Synthetic Object Segments Improves Detection, Segmentat Collection Dataset Collections for SOS: Synthetic Object Segments Improve Detection, Segmentation, and Grounding | Github Link: https://github.com/weikaih04/SOS • 10 items • Updated May 23 • 1
CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated 13 days ago • 7
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 101
TaskMeAnything Collection A collection of TaskMeAnything resources [https://github.com/JieyuZ2/TaskMeAnything] • 12 items • Updated Aug 4, 2024 • 3