Spaces:
Build error
Build error
Fix README.md for Hugging Face Spaces
Browse files
README.md
CHANGED
@@ -1,53 +1,42 @@
|
|
1 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
|
3 |
-
|
4 |
-
<img src="assets/teaser.png">
|
5 |
-
</p>
|
6 |
-
|
7 |
-
<div align="center">
|
8 |
-
<a href=https://3d.hunyuan.tencent.com/sceneTo3D target="_blank"><img src=https://img.shields.io/badge/Official%20Site-333399.svg?logo=homepage height=22px></a>
|
9 |
-
<a href=https://huggingface.co/tencent/HunyuanWorld-1 target="_blank"><img src=https://img.shields.io/badge/%F0%9F%A4%97%20Models-d96902.svg height=22px></a>
|
10 |
-
<a href=https://3d-models.hunyuan.tencent.com/world/ target="_blank"><img src= https://img.shields.io/badge/Page-bb8a2e.svg?logo=github height=22px></a>
|
11 |
-
<a href=https://discord.gg/dNBrdrGGMa target="_blank"><img src= https://img.shields.io/badge/Discord-white.svg?logo=discord height=22px></a>
|
12 |
-
<a href=https://x.com/TencentHunyuan target="_blank"><img src=https://img.shields.io/badge/Hunyuan-black.svg?logo=x height=22px></a>
|
13 |
-
<a href="#community-resources" target="_blank"><img src=https://img.shields.io/badge/Community-lavender.svg?logo=homeassistantcommunitystore height=22px></a>
|
14 |
-
</div>
|
15 |
-
|
16 |
-
[//]: # ( <a href=# target="_blank"><img src=https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv height=22px></a>)
|
17 |
-
|
18 |
-
[//]: # ( <a href=# target="_blank"><img src= https://img.shields.io/badge/Colab-8f2628.svg?logo=googlecolab height=22px></a>)
|
19 |
-
|
20 |
-
[//]: # ( <a href="#"><img alt="PyPI - Downloads" src="https://img.shields.io/pypi/v/mulankit?logo=pypi" height=22px></a>)
|
21 |
|
22 |
-
|
23 |
|
24 |
-
|
25 |
-
"To see a World in a Grain of Sand, and a Heaven in a Wild Flower"
|
26 |
-
</p>
|
27 |
-
|
28 |
-
https://github.com/user-attachments/assets/513c9529-2b34-4872-b38f-4f291f3ae1c7
|
29 |
|
30 |
-
|
31 |
-
-
|
32 |
-
-
|
|
|
|
|
|
|
|
|
33 |
|
34 |
-
|
35 |
|
36 |
-
|
37 |
-
|--------------------------------------------------|-------------------------------------------------------|---------------------------------------------|---------------------------------------------------|
|
38 |
-
| <img src="assets/qrcode/wechat.png" height=140> | <img src="assets/qrcode/xiaohongshu.png" height=140> | <img src="assets/qrcode/x.png" height=140> | <img src="assets/qrcode/discord.png" height=140> |
|
39 |
|
40 |
-
|
|
|
41 |
|
42 |
-
|
43 |
-
Creating immersive and playable 3D worlds from texts or images remains a fundamental challenge in computer vision and graphics. Existing world generation approaches typically fall into two categories: video-based methods that offer rich diversity but lack 3D consistency and rendering efficiency, and 3D-based methods that provide geometric consistency but struggle with limited training data and memory-inefficient representations. To address these limitations, we present HunyuanWorld 1.0, a novel framework that combines the best of both sides for generating immersive, explorable, and interactive 3D worlds from text and image conditions. Our approach features three key advantages: 1) 360° immersive experiences via panoramic world proxies; 2) mesh export capabilities for seamless compatibility with existing computer graphics pipelines; 3) disentangled object representations for augmented interactivity. The core of our framework is a semantically layered 3D mesh representation that leverages panoramic images as 360° world proxies for semantic-aware world decomposition and reconstruction, enabling the generation of diverse 3D worlds. Extensive experiments demonstrate that our method achieves state-of-the-art performance in generating coherent, explorable, and interactive 3D worlds while enabling versatile applications in virtual reality, physical simulation, game development, and interactive content creation.
|
44 |
-
|
45 |
-
<p align="center">
|
46 |
-
<img src="assets/application.png">
|
47 |
-
</p>
|
48 |
|
49 |
-
### Architecture
|
50 |
-
Tencent HunyuanWorld-1.0's generation architecture integrates panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction to achieve high-quality scene-scale 360° 3D world generation, supporting both text and image inputs.
|
51 |
|
52 |
<p align="left">
|
53 |
<img src="assets/arch.jpg">
|
|
|
1 |
+
---
|
2 |
+
title: HunyuanWorld-1.0 Demo
|
3 |
+
emoji: 🌍
|
4 |
+
colorFrom: blue
|
5 |
+
colorTo: green
|
6 |
+
sdk: gradio
|
7 |
+
sdk_version: "4.42.0"
|
8 |
+
app_file: app.py
|
9 |
+
pinned: false
|
10 |
+
license: other
|
11 |
+
models:
|
12 |
+
- black-forest-labs/FLUX.1-dev
|
13 |
+
- tencent/HunyuanWorld-1
|
14 |
+
hardware: nvidia-t4-small
|
15 |
+
---
|
16 |
|
17 |
+
# HunyuanWorld-1.0 Demo Space
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
|
19 |
+
This is a Gradio demo for [Tencent-Hunyuan/HunyuanWorld-1.0](https://github.com/Tencent-Hunyuan/HunyuanWorld-1.0), a one-stop solution for text-driven 3D scene generation.
|
20 |
|
21 |
+
## How to Use
|
|
|
|
|
|
|
|
|
22 |
|
23 |
+
1. **Panorama Generation**:
|
24 |
+
- **Text-to-Panorama**: Enter a text prompt and generate a 360° panorama image.
|
25 |
+
- **Image-to-Panorama**: Upload an image and provide a prompt to extend it into a panorama.
|
26 |
+
2. **Scene Generation**:
|
27 |
+
- After generating a panorama, click "Send to Scene Generation".
|
28 |
+
- Provide labels for foreground objects to be separated into layers.
|
29 |
+
- Click "Generate 3D Scene" to create a 3D mesh from the panorama.
|
30 |
|
31 |
+
## Technical Details
|
32 |
|
33 |
+
This space combines two core functionalities of the HunyuanWorld-1.0 model:
|
|
|
|
|
34 |
|
35 |
+
- **Panorama Generation**: Creates immersive 360° images from text or existing images.
|
36 |
+
- **3D Scene Reconstruction**: Decomposes a panorama into layers and reconstructs a 3D mesh.
|
37 |
|
38 |
+
This demo is running on an NVIDIA T4 GPU. Due to the size of the models, the initial startup may take a few minutes.
|
|
|
|
|
|
|
|
|
|
|
39 |
|
|
|
|
|
40 |
|
41 |
<p align="left">
|
42 |
<img src="assets/arch.jpg">
|