23 38 3

kaipeng

kpzhang996

https://kpzhang93.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 20 hours ago

Generative World Renderer

commentedon a paper 5 days ago

Generative World Renderer

upvoted a paper 5 days ago

Generative World Renderer

View all activity

Organizations

None yet

authored a paper about 20 hours ago

Generative World Renderer

Paper • 2604.02329 • Published 6 days ago • 94

authored a paper 9 days ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published 12 days ago • 51

submitted a paper to Daily Papers 9 days ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published 12 days ago • 51

authored 17 papers 14 days ago

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Paper • 2508.06851 • Published Aug 9, 2025

InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles

Paper • 2508.16072 • Published Aug 22, 2025 • 4

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 217

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published Sep 5, 2025 • 47

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15, 2025 • 107

TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning

Paper • 2511.01833 • Published Nov 3, 2025 • 16

InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models

Paper • 2506.18385 • Published Jun 23, 2025

Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry

Paper • 2510.27410 • Published Oct 31, 2025

SVBench: Evaluation of Video Generation Models on Social Reasoning

Paper • 2512.21507 • Published Dec 25, 2025 • 8

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published Dec 26, 2025 • 61

ProSoftArena: Benchmarking Hierarchical Capabilities of Multimodal Agents in Professional Software Environments

Paper • 2601.02399 • Published Dec 30, 2025

Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models

Paper • 2601.07287 • Published Jan 12 • 5

MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences

Paper • 2601.07251 • Published Jan 12 • 11

World Craft: Agentic Framework to Create Visualizable Worlds via Text

Paper • 2601.09150 • Published Jan 14 • 19

kaipeng

AI & ML interests

Recent Activity

Organizations

kpzhang996's activity