2 2 152

Sudeep Pillai PRO

spillai

https://people.csail.mit.edu/spillai/

AI & ML interests

Self-supervised learning, Few-shot learning, Computer Vision, Robotics

Recent Activity

liked a dataset 17 days ago

nvidia/ToolScale

commented on a paper about 1 month ago

Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution

upvoted a paper about 1 month ago

Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution

View all activity

Organizations

liked a dataset 17 days ago

nvidia/ToolScale

Viewer • Updated 8 days ago • 4.06k • 3.37k • 163

liked 2 datasets 3 months ago

VisuLogic/VisuLogic

Viewer • Updated Jul 9 • 1k • 1.04k • 11

omkarthawakar/VRC-Bench

Viewer • Updated Jan 13 • 1k • 162 • 23

liked a model 3 months ago

RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic

Text Generation • 236B • Updated Oct 3 • 371 • 4

liked 5 models 7 months ago

liked a dataset 10 months ago

allenai/olmOCR-mix-0225

Viewer • Updated Feb 25 • 259k • 906 • 169

liked a Space 10 months ago

Video-Bench Leaderboard

🏆

Submit and view model evaluation results

liked a Space 12 months ago

Open VLM Leaderboard

🌎

952

VLMEvalKit Evaluation Results Collection

liked 3 models about 1 year ago

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • 4B • Updated 14 days ago • 508k • 722

vidore/colsmolvlm-v0.1

Visual Document Retrieval • Updated Mar 14 • 69 • 53

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 15.2k • 1.53k

liked 2 datasets about 1 year ago

lmms-lab/ChartQA

Viewer • Updated Mar 8, 2024 • 2.5k • 15.5k • 19

Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 3.62k • 556

liked 3 models over 1 year ago

MrLight/dse-phi35-vidore-ft

Updated Sep 7, 2024 • 20 • 10

Groq/Llama-3-Groq-70B-Tool-Use

Text Generation • 71B • Updated Aug 28, 2024 • 94 • 159

vidore/colpali

Visual Document Retrieval • Updated about 1 month ago • 6.46k • 466