Spaces:

ByteDance
/

XVerse

Running on Zero

App Files Files Community

helloworld-S commited on 4 days ago

Commit

3e9dad5

verified ·

1 Parent(s): 1a14f73

Update app.py

Browse files

Files changed (1) hide show

app.py +40 -1

app.py CHANGED Viewed

@@ -315,7 +315,39 @@ if __name__ == "__main__":
     with gr.Blocks() as demo:
-        gr.Markdown("### XVerse Demo")
         with gr.Row():
             with gr.Column():
                 prompt = gr.Textbox(label="Prompt", value="")
@@ -333,6 +365,13 @@ if __name__ == "__main__":
                 # 将其他设置参数压缩到 Advanced Accordion 内
                 with gr.Accordion("Advanced", open=False):
                     # 使用 Row 和 Column 来布局四个图像和描述
                     with gr.Row():
                         target_height = gr.Slider(512, 1024, step=128, value=768, label="Generated Height", info="")

     with gr.Blocks() as demo:
+        gr.Markdown("""
+### Official demo for "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation"
+<p align="center">
+    <a href="https://arxiv.org/abs/2506.21416">
+            <img alt="Build" src="https://img.shields.io/badge/arXiv%20paper-2506.21416-b31b1b.svg">
+    </a>
+    <a href="https://bytedance.github.io/XVerse/">
+        <img alt="Project Page" src="https://img.shields.io/badge/Project-Page-blue">
+    </a>
+    <a href="https://huggingface.co/ByteDance/XVerse">
+        <img alt="Build" src="https://img.shields.io/badge/🤗-HF%20Model-yellow">
+    </a>
+    <a href="https://github.com/ByteDance/XVerse">
+        <img alt="Build" src="https://img.shields.io/badge/Github-Repo-blue">
+    </a>
+</p>
+#### Input Images and Prompts
+* **Prompt**: The textual description guiding the image generation.
+* **Upload Image**: Click "Image X" to upload your desired reference image.
+* **Image Description**: Enter a description in the "Caption X" input box. You can also click "Auto Caption" to generate a description automatically.
+* **Detection & Segmentation**: Click "Det & Seg" to perform detection and segmentation on the uploaded image.
+* **Crop Face**: Use "Crop Face" to automatically crop the face from the image.
+* **ID Checkbox**: Check or uncheck "ID or not" to determine whether to use ID-related weights for that specific input image.
+> **⚠️ Important Usage Notes:**
+>
+> * **Prompt Construction**: The main text prompt **MUST** include the exact text you entered in the `Image Description` field for each active image. **Generation will fail if this description is missing from the prompt.**
+>     * *Example*: If you upload two images and set their descriptions as "a man with red hair" (for Image 1) and "a woman with blue eyes" (for Image 2), your main prompt might be: "A `a man with red hair` walking beside `a woman with blue eyes` in a park."
+>     * You can then write your main prompt simply as: "`ENT1` walking beside `ENT2` in a park." The code will **automatically replace** these placeholders with the full description text before generation.
+""")
         with gr.Row():
             with gr.Column():
                 prompt = gr.Textbox(label="Prompt", value="")
                 # 将其他设置参数压缩到 Advanced Accordion 内
                 with gr.Accordion("Advanced", open=False):
+                    gr.Markdown("""#### Advanced Settings Explained
+The Gradio demo provides several parameters to control your image generation process:
+* **Generated Height/Width**: Use the sliders to set the shape of the output image.
+* **Weight_id/ip**: Adjust these weight parameters. Higher values generally lead to better subject consistency but might slightly impact the naturalness of the generated image.
+* **latent_lora_scale and vae_lora_scale**: Control the LoRA scale. Similar to Weight_id/ip, larger LoRA values can improve subject consistency but may reduce image naturalness.
+* **vae_skip_iter_before and vae_skip_iter_after**: Configure VAE skip iterations. Skipping more steps can result in better naturalness but might compromise subject consistency.
+""")
                     # 使用 Row 和 Column 来布局四个图像和描述
                     with gr.Row():
                         target_height = gr.Slider(512, 1024, step=128, value=768, label="Generated Height", info="")