Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jianwei Yang's picture
23 6 18

Jianwei Yang

jw2yang
shuyuej's profile picture Mileena's profile picture klasicjosh's profile picture
·
https://jwyang.github.io/
  • jw2yang4ai
  • jwyang

AI & ML interests

Computer Vision, Vision and Language, Machine Learning

Recent Activity

updated a model about 22 hours ago
microsoft/Magma-8B
updated a dataset 15 days ago
MagmaAI/Magma-AITW-SoM
published a dataset 15 days ago
MagmaAI/Magma-AITW-SoM
View all activity

Organizations

Microsoft's profile picture CVPR Demo Track's profile picture X-Decoder's profile picture Pix2Gif's profile picture Multimodal AI Agents's profile picture

jw2yang's activity

upvoted a paper 3 months ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 58
upvoted a paper 7 months ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 17
upvoted 3 papers about 1 year ago

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Paper • 2404.16821 • Published Apr 25, 2024 • 58

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Paper • 2404.16375 • Published Apr 25, 2024 • 18

Pix2Gif: Motion-Guided Diffusion for GIF Generation

Paper • 2403.04634 • Published Mar 7, 2024 • 18
upvoted a paper over 1 year ago

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 28
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs