sungnyun commited on
Commit
c772c9e
·
1 Parent(s): 387cd44

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md CHANGED
@@ -1,3 +1,37 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ pipeline_tag: text-to-image
7
  ---
8
+
9
+ <br>
10
+
11
+ # DiffBlender Model Card
12
+
13
+ This repo contains the models from our paper [**DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models**](https://arxiv.org/abs/2305.15194).
14
+
15
+
16
+ ## Model details
17
+
18
+ **Model type:**
19
+ DiffBlender successfully synthesizes complex combinations of input modalities.
20
+ It enables flexible manipulation of conditions, providing the customized generation aligned with user preferences.
21
+ We designed its structure to intuitively extend to additional modalities while achieving a low training cost through a partial update of hypernetworks.
22
+ We provide its model checkpoint, trained with six modalities: sketch, depth map, grounding box, keypoints, color palette, and style embedding.
23
+
24
+ **License:**
25
+ Apache 2.0 License
26
+
27
+ **Where to send questions or comments about the model:**
28
+ https://github.com/sungnyun/diffblender/issues
29
+
30
+
31
+ ## Training dataset
32
+ [Microsoft COCO 2017 dataset](https://cocodataset.org/#home)
33
+
34
+
35
+ <br>
36
+
37
+ More detials are in our project page, https://sungnyun.github.io/diffblender/.