Spaces:

alexnasa
/

PartCrafter

Running on Zero

App Files Files Community

alexnasa commited on 26 days ago

Commit

c1fa643

verified ·

1 Parent(s): d37f4d2

Update app.py

Browse files

Files changed (1) hide show

app.py +32 -2

app.py CHANGED Viewed

@@ -214,7 +214,37 @@ def run_triposg(image_path: str,
                 progress=gr.Progress(track_tqdm=True),):
     """
-    Generate 3D part meshes from an input image.
     """
     max_num_expanded_coords = 1e9
@@ -384,4 +414,4 @@ if __name__ == "__main__":
     demo = build_demo()
     demo.unload(cleanup)
     demo.queue()
-    demo.launch()

                 progress=gr.Progress(track_tqdm=True),):
     """
+    Generate structured 3D meshes from a 2D image using the PartCrafter pipeline.
+    This function takes a single 2D image as input and produces a set of part-based 3D meshes,
+    using compositional latent diffusion with attention to structure and part separation.
+    Optionally removes the background using a pretrained background removal model (RMBG),
+    and outputs a merged object mesh, a split preview (exploded view), and a downloadable ZIP of all parts.
+    Args:
+        image_path (str): Path to the input image file on disk.
+        num_parts (int, optional): Number of distinct parts to decompose the object into. Defaults to 1.
+        seed (int, optional): Random seed for reproducibility. Defaults to 0.
+        num_tokens (int, optional): Number of tokens used during latent encoding. Higher values yield finer detail. Defaults to 1024.
+        num_inference_steps (int, optional): Number of diffusion inference steps. More steps improve quality but increase runtime. Defaults to 50.
+        guidance_scale (float, optional): Classifier-free guidance scale. Higher values emphasize adherence to conditioning. Defaults to 7.0.
+        use_flash_decoder (bool, optional): Whether to use FlashAttention in the decoder for performance. Defaults to False.
+        rmbg (bool, optional): Whether to apply background removal before processing. Defaults to True.
+        session_id (str, optional): Optional session ID to manage export paths. If not provided, a random UUID is generated.
+        progress (gr.Progress, optional): Gradio progress object for visual feedback. Automatically handled by Gradio.
+    Returns:
+        Tuple[str, str, str, str]:
+            - `merged_path` (str): File path to the merged full object mesh (`object.glb`).
+            - `split_preview_path` (str): File path to the exploded-view mesh (`split.glb`).
+            - `export_dir` (str): Directory where all generated meshes were saved.
+            - `zip_path` (str): Path to the ZIP file containing all individual part meshes.
+    Notes:
+        - This function utilizes HuggingFace pretrained weights for both part generation and background removal.
+        - The final output includes exploded and merged views to visualize object structure.
+        - Parts are exported in `.glb` format, and zipped for bulk download.
+        - Generation time depends on the number of parts and inference parameters.
     """
     max_num_expanded_coords = 1e9
     demo = build_demo()
     demo.unload(cleanup)
     demo.queue()
+    demo.launch(mcp_server=True)