-
DreamLLM: Synergistic Multimodal Comprehension and Creation
Paper • 2309.11499 • Published • 59 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 131 -
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding
Paper • 2405.08344 • Published • 16
Yiming Wu
weleen
AI & ML interests
Computer Vision
Recent Activity
updated
a dataset
about 22 hours ago
weleen/take_the_banana_and_insert_into_the_bottle
published
a dataset
about 22 hours ago
weleen/take_the_banana_and_insert_into_the_bottle
updated
a dataset
about 23 hours ago
weleen/pick_up_banana