MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Paper β’ 2406.18790 β’ Published Jun 26, 2024 β’ 35
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model Paper β’ 2404.09967 β’ Published Apr 15, 2024 β’ 22
Controllable Music Production with Diffusion Models and Guidance Gradients Paper β’ 2311.00613 β’ Published Nov 1, 2023 β’ 26
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Paper β’ 2306.07954 β’ Published Jun 13, 2023 β’ 111