arxiv:2601.16208
Jihan Yang PRO
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
liked
a dataset
1 day ago
allenai/Molmo2-VideoCapQA
liked
a dataset
3 days ago
jasonzhango/SPAR-7M
authored
a paper
20 days ago
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders