-
DreamLLM: Synergistic Multimodal Comprehension and Creation
Paper • 2309.11499 • Published • 59 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 131 -
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding
Paper • 2405.08344 • Published • 15
Yiming Wu
weleen
AI & ML interests
Computer Vision
Recent Activity
upvoted
a
collection
about 1 month ago
Inference Optimized Checkpoints (with Model Optimizer)
updated
a dataset
about 1 month ago
weleen/take_the_banana_and_insert_into_the_bottle
updated
a model
about 2 months ago
weleen/grab_bread_and_put