-
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Paper • 2312.07509 • Published • 12 -
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Paper • 2403.14773 • Published • 11 -
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
Paper • 2403.15377 • Published • 27 -
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Paper • 2403.14148 • Published • 22
Goodman PRO
Motolov
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
a686d380/h-corpus-2023:大神请问这个data可以去train Qwen2.5吗
updated
a collection
about 1 year ago
3D
Organizations
Consistency
Text to Image
-
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Paper • 2403.12015 • Published • 69 -
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Paper • 2403.16990 • Published • 26 -
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
Paper • 2403.16627 • Published • 22 -
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Paper • 2403.17005 • Published • 13
AI editing
Animate
3D
-
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
Paper • 2403.12365 • Published • 11 -
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
Paper • 2403.17001 • Published • 6 -
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper • 2403.17920 • Published • 18 -
DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Paper • 2403.17237 • Published • 11
MMLM
Video Gen
-
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Paper • 2312.07509 • Published • 12 -
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Paper • 2403.14773 • Published • 11 -
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
Paper • 2403.15377 • Published • 27 -
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Paper • 2403.14148 • Published • 22
Animate
Consistency
3D
-
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
Paper • 2403.12365 • Published • 11 -
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
Paper • 2403.17001 • Published • 6 -
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper • 2403.17920 • Published • 18 -
DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Paper • 2403.17237 • Published • 11
Text to Image
-
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Paper • 2403.12015 • Published • 69 -
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Paper • 2403.16990 • Published • 26 -
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
Paper • 2403.16627 • Published • 22 -
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Paper • 2403.17005 • Published • 13
MMLM
AI editing