Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wjzheng's picture
7 42

wjzheng

DevinZh
·
https://wjzhengnlp.github.io/

AI & ML interests

None yet

Organizations

Text Mining Group, Nanjing University of Science and Technology's profile picture

upvoted a paper 4 months ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 34
upvoted 5 papers almost 2 years ago

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 50

DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion

Paper • 2309.12424 • Published Sep 21, 2023 • 11

VidChapters-7M: Video Chapters at Scale

Paper • 2309.13952 • Published Sep 25, 2023 • 11

VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Paper • 2309.15091 • Published Sep 26, 2023 • 33

End-to-End Speech Recognition Contextualization with Large Language Models

Paper • 2309.10917 • Published Sep 19, 2023 • 10
upvoted a paper about 2 years ago

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Paper • 2307.06942 • Published Jul 13, 2023 • 23
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs