Spaces:
Running
Running
metadata
title: README
emoji: π
colorFrom: green
colorTo: red
sdk: static
pinned: false
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
[Paper] β [Project Page] β [Github]
[π€ Online Demo] β [π€ Dataset Card]
If you find VisualCloze is helpful, please consider to star β the Github Repo. Thanks!
π° News
- [2025-6-26] πππ VisualCloze has been accepted by ICCV 2025.
- [2025-5-15] π€π€π€ VisualCloze has been merged into the official pipelines of diffusers. For usage guidance, please refer to the Full Model Card 384 and Full Model Card 512.
- [2025-5-18] π₯³π₯³π₯³ We have released the LoRA weights supporting diffusers at LoRA Model Card 384 and LoRA Model Card 512.
π Key Features
An in-context learning based universal image generation framework.
- Support various in-domain tasks.
- Generalize to unseen tasks through in-context learning.
- Unify multiple tasks into one step and generate both target image and intermediate results.
- Support reverse-engineering a set of conditions from a target image.
π₯ Examples are shown in the project page.