arxiv:2511.09611
Xiangtai Li
LXT
AI & ML interests
Computer Vision, Multi-Modal Understanding, Generative AI
Recent Activity
upvoted
a
paper
about 9 hours ago
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
upvoted
a
paper
about 9 hours ago
STEP3-VL-10B Technical Report
upvoted
a
paper
3 days ago
BabyVision: Visual Reasoning Beyond Language