15 89 7

Ming Li

limingcv

https://liming-ai.github.io

liming-ai

AI & ML interests

Computer Vision, AIGC, VLM/LLM

Recent Activity

upvoted a paper 17 days ago

SemanticGen: Video Generation in Semantic Space

upvoted a paper 22 days ago

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

upvoted a paper 22 days ago

Kling-Omni Technical Report

View all activity

Organizations

upvoted a paper 17 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 18 days ago • 90

upvoted 2 papers 22 days ago

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Paper • 2512.13507 • Published 26 days ago • 38

Kling-Omni Technical Report

Paper • 2512.16776 • Published 23 days ago • 164

upvoted a paper about 1 month ago

Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation

Paper • 2512.02457 • Published Dec 2, 2025 • 13

upvoted 3 papers 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Paper • 2510.22946 • Published Oct 27, 2025 • 17

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 29

upvoted a paper 3 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 96

upvoted 2 papers 4 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24, 2025 • 82

RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10, 2025 • 73

upvoted 3 papers 5 months ago

upvoted a paper 6 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 75

upvoted a paper 7 months ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10, 2025 • 105

upvoted 5 papers 8 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20, 2025 • 133

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

PointArena: Probing Multimodal Grounding Through Language-Guided Pointing

Paper • 2505.09990 • Published May 15, 2025 • 12

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 154

DanceGRPO: Unleashing GRPO on Visual Generation

Paper • 2505.07818 • Published May 12, 2025 • 32

Ming Li

AI & ML interests

Recent Activity

Organizations

limingcv's activity