17 28 2

kaipeng

kpzhang996

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

stdstu123/Yume-I2V-540P

upvoted a paper 17 days ago

Yume: An Interactive World Generation Model

commented on a paper 17 days ago

Yume: An Interactive World Generation Model

View all activity

Organizations

liked a model 17 days ago

stdstu123/Yume-I2V-540P

Image-to-Video • Updated 17 days ago • 25

upvoted a paper 17 days ago

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published 18 days ago • 77

commented a paper 17 days ago

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published 18 days ago • 77 •

upvoted a paper 23 days ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published 24 days ago • 63

upvoted a paper 27 days ago

Neural-Driven Image Editing

Paper • 2507.05397 • Published Jul 7 • 26

commented a paper 27 days ago

Neural-Driven Image Editing

Paper • 2507.05397 • Published Jul 7 • 26 •

liked a dataset about 2 months ago

Lixsp11/Sekai-Project

Viewer • Updated Jun 27 • 344k • 441 • 26

upvoted a paper about 2 months ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18 • 64

commented a paper about 2 months ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18 • 64 •

upvoted a paper about 2 months ago

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation

Paper • 2506.09427 • Published Jun 11 • 9

commented a paper about 2 months ago

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation

Paper • 2506.09427 • Published Jun 11 • 9 •

authored 9 papers 2 months ago

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT

Paper • 2406.18583 • Published Jun 5, 2024

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published Jul 10, 2024 • 10

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Paper • 2410.08695 • Published Oct 11, 2024

ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality

Paper • 2412.04062 • Published Dec 5, 2024 • 9

LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation

Paper • 2501.12976 • Published Jan 22

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Paper • 2504.05782 • Published Apr 8 • 4

kaipeng

AI & ML interests

Recent Activity

Organizations

kpzhang996's activity