mooki0 commited on
Commit
0581c45
·
verified ·
1 Parent(s): 57276d4

Fix README.md for Hugging Face Spaces

Browse files
Files changed (1) hide show
  1. README.md +30 -41
README.md CHANGED
@@ -1,53 +1,42 @@
1
- [中文阅读](README_zh_cn.md)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
- <p align="center">
4
- <img src="assets/teaser.png">
5
- </p>
6
-
7
- <div align="center">
8
- <a href=https://3d.hunyuan.tencent.com/sceneTo3D target="_blank"><img src=https://img.shields.io/badge/Official%20Site-333399.svg?logo=homepage height=22px></a>
9
- <a href=https://huggingface.co/tencent/HunyuanWorld-1 target="_blank"><img src=https://img.shields.io/badge/%F0%9F%A4%97%20Models-d96902.svg height=22px></a>
10
- <a href=https://3d-models.hunyuan.tencent.com/world/ target="_blank"><img src= https://img.shields.io/badge/Page-bb8a2e.svg?logo=github height=22px></a>
11
- <a href=https://discord.gg/dNBrdrGGMa target="_blank"><img src= https://img.shields.io/badge/Discord-white.svg?logo=discord height=22px></a>
12
- <a href=https://x.com/TencentHunyuan target="_blank"><img src=https://img.shields.io/badge/Hunyuan-black.svg?logo=x height=22px></a>
13
- <a href="#community-resources" target="_blank"><img src=https://img.shields.io/badge/Community-lavender.svg?logo=homeassistantcommunitystore height=22px></a>
14
- </div>
15
-
16
- [//]: # ( <a href=# target="_blank"><img src=https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv height=22px></a>)
17
-
18
- [//]: # ( <a href=# target="_blank"><img src= https://img.shields.io/badge/Colab-8f2628.svg?logo=googlecolab height=22px></a>)
19
-
20
- [//]: # ( <a href="#"><img alt="PyPI - Downloads" src="https://img.shields.io/pypi/v/mulankit?logo=pypi" height=22px></a>)
21
 
22
- <br>
23
 
24
- <p align="center">
25
- "To see a World in a Grain of Sand, and a Heaven in a Wild Flower"
26
- </p>
27
-
28
- https://github.com/user-attachments/assets/513c9529-2b34-4872-b38f-4f291f3ae1c7
29
 
30
- ## 🔥 News
31
- - July 26, 2025: 👋 We present the technical report of HunyuanWorld-1.0, please check out the details and spark some discussion!
32
- - July 26, 2025: 🤗 We release the first open-source, simulation-capable, immersive 3D world generation model, HunyuanWorld-1.0!
 
 
 
 
33
 
34
- > Join our **[Wechat](#)** and **[Discord](https://discord.gg/dNBrdrGGMa)** group to discuss and find help from us.
35
 
36
- | Wechat Group | Xiaohongshu | X | Discord |
37
- |--------------------------------------------------|-------------------------------------------------------|---------------------------------------------|---------------------------------------------------|
38
- | <img src="assets/qrcode/wechat.png" height=140> | <img src="assets/qrcode/xiaohongshu.png" height=140> | <img src="assets/qrcode/x.png" height=140> | <img src="assets/qrcode/discord.png" height=140> |
39
 
40
- ## ☯️ **HunyuanWorld 1.0**
 
41
 
42
- ### Abstract
43
- Creating immersive and playable 3D worlds from texts or images remains a fundamental challenge in computer vision and graphics. Existing world generation approaches typically fall into two categories: video-based methods that offer rich diversity but lack 3D consistency and rendering efficiency, and 3D-based methods that provide geometric consistency but struggle with limited training data and memory-inefficient representations. To address these limitations, we present HunyuanWorld 1.0, a novel framework that combines the best of both sides for generating immersive, explorable, and interactive 3D worlds from text and image conditions. Our approach features three key advantages: 1) 360° immersive experiences via panoramic world proxies; 2) mesh export capabilities for seamless compatibility with existing computer graphics pipelines; 3) disentangled object representations for augmented interactivity. The core of our framework is a semantically layered 3D mesh representation that leverages panoramic images as 360° world proxies for semantic-aware world decomposition and reconstruction, enabling the generation of diverse 3D worlds. Extensive experiments demonstrate that our method achieves state-of-the-art performance in generating coherent, explorable, and interactive 3D worlds while enabling versatile applications in virtual reality, physical simulation, game development, and interactive content creation.
44
-
45
- <p align="center">
46
- <img src="assets/application.png">
47
- </p>
48
 
49
- ### Architecture
50
- Tencent HunyuanWorld-1.0's generation architecture integrates panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction to achieve high-quality scene-scale 360° 3D world generation, supporting both text and image inputs.
51
 
52
  <p align="left">
53
  <img src="assets/arch.jpg">
 
1
+ ---
2
+ title: HunyuanWorld-1.0 Demo
3
+ emoji: 🌍
4
+ colorFrom: blue
5
+ colorTo: green
6
+ sdk: gradio
7
+ sdk_version: "4.42.0"
8
+ app_file: app.py
9
+ pinned: false
10
+ license: other
11
+ models:
12
+ - black-forest-labs/FLUX.1-dev
13
+ - tencent/HunyuanWorld-1
14
+ hardware: nvidia-t4-small
15
+ ---
16
 
17
+ # HunyuanWorld-1.0 Demo Space
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
 
19
+ This is a Gradio demo for [Tencent-Hunyuan/HunyuanWorld-1.0](https://github.com/Tencent-Hunyuan/HunyuanWorld-1.0), a one-stop solution for text-driven 3D scene generation.
20
 
21
+ ## How to Use
 
 
 
 
22
 
23
+ 1. **Panorama Generation**:
24
+ - **Text-to-Panorama**: Enter a text prompt and generate a 360° panorama image.
25
+ - **Image-to-Panorama**: Upload an image and provide a prompt to extend it into a panorama.
26
+ 2. **Scene Generation**:
27
+ - After generating a panorama, click "Send to Scene Generation".
28
+ - Provide labels for foreground objects to be separated into layers.
29
+ - Click "Generate 3D Scene" to create a 3D mesh from the panorama.
30
 
31
+ ## Technical Details
32
 
33
+ This space combines two core functionalities of the HunyuanWorld-1.0 model:
 
 
34
 
35
+ - **Panorama Generation**: Creates immersive 360° images from text or existing images.
36
+ - **3D Scene Reconstruction**: Decomposes a panorama into layers and reconstructs a 3D mesh.
37
 
38
+ This demo is running on an NVIDIA T4 GPU. Due to the size of the models, the initial startup may take a few minutes.
 
 
 
 
 
39
 
 
 
40
 
41
  <p align="left">
42
  <img src="assets/arch.jpg">