Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
152
Sudeep Pillai
PRO
spillai
Follow
vikrantkakade4071's profile picture
multimodalart's profile picture
Mi6paulino's profile picture
7 followers
·
105 following
https://people.csail.mit.edu/spillai/
sudeeppillai
spillai
AI & ML interests
Self-supervised learning, Few-shot learning, Computer Vision, Robotics
Recent Activity
liked
a dataset
17 days ago
nvidia/ToolScale
commented
on
a paper
about 1 month ago
Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution
upvoted
a
paper
about 1 month ago
Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution
View all activity
Organizations
spillai
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
17 days ago
nvidia/ToolScale
Viewer
•
Updated
8 days ago
•
4.06k
•
3.37k
•
163
liked
2 datasets
3 months ago
VisuLogic/VisuLogic
Viewer
•
Updated
Jul 9
•
1k
•
1.04k
•
11
omkarthawakar/VRC-Bench
Viewer
•
Updated
Jan 13
•
1k
•
162
•
23
liked
a model
3 months ago
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic
Text Generation
•
236B
•
Updated
Oct 3
•
371
•
4
liked
5 models
7 months ago
manycore-research/SpatialLM-Llama-1B
Text Generation
•
1B
•
Updated
Mar 21
•
118
•
992
prithivMLmods/FastThink-0.5B-Tiny
Text Generation
•
0.5B
•
Updated
Jan 31
•
47
•
7
showlab/ShowUI-2B
Updated
Mar 11
•
2.68k
•
269
RedHatAI/Qwen2.5-VL-3B-Instruct-FP8-dynamic
Image-to-Text
•
4B
•
Updated
Apr 22
•
11.1k
•
3
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int4
Text Generation
•
33B
•
Updated
Oct 9, 2024
•
161k
•
40
liked
a dataset
10 months ago
allenai/olmOCR-mix-0225
Viewer
•
Updated
Feb 25
•
259k
•
906
•
169
liked
a Space
10 months ago
Sleeping
16
Video-Bench Leaderboard
🏆
16
Submit and view model evaluation results
liked
a Space
12 months ago
Running
on
CPU Upgrade
952
Open VLM Leaderboard
🌎
952
VLMEvalKit Evaluation Results Collection
liked
3 models
about 1 year ago
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
4B
•
Updated
14 days ago
•
508k
•
722
vidore/colsmolvlm-v0.1
Visual Document Retrieval
•
Updated
Mar 14
•
69
•
53
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Feb 4
•
15.2k
•
1.53k
liked
2 datasets
about 1 year ago
lmms-lab/ChartQA
Viewer
•
Updated
Mar 8, 2024
•
2.5k
•
15.5k
•
19
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
Jan 24
•
60k
•
3.62k
•
556
liked
3 models
over 1 year ago
MrLight/dse-phi35-vidore-ft
Updated
Sep 7, 2024
•
20
•
10
Groq/Llama-3-Groq-70B-Tool-Use
Text Generation
•
71B
•
Updated
Aug 28, 2024
•
94
•
159
vidore/colpali
Visual Document Retrieval
•
Updated
about 1 month ago
•
6.46k
•
466
Load more