Improve model card: Add detailed usage, paper abstract, and update metadata
Browse filesThis PR significantly enhances the model card for Hunyuan3D 2.1 by:
* **Updating metadata**: Correctly sets the `library_name` to `diffusers`, improving the model's discoverability and categorization on the Hugging Face Hub, as indicated by the model's internal structure (`_diffusers_version`) and provided code examples.
* **Adding paper details**: Includes the full paper title, link to the Hugging Face Paper page, and the abstract at the top of the card for quick access to core information.
* **Integrating comprehensive usage instructions**: Adds the "Get Started" section from the GitHub repository, covering installation requirements, code usage examples, and how to run the Gradio demo. This directly addresses the need for "sample usage" and makes the model much more accessible for users.
* **Enriching model documentation**: Incorporates "News", "Architecture", "Performance", and "Models Zoo" sections from the GitHub README, providing a deeper understanding of the model's capabilities, design, and evaluation.
* **Updating external links and acknowledgements**: Refines existing badges with more specific URLs and adds new ones (e.g., for X and Community) from the GitHub repository, ensuring users have access to the most current resources. Expands acknowledgements to include all relevant projects.
These changes transform the model card into a more complete and user-friendly resource, making it easier for researchers and developers to understand, use, and contribute to Hunyuan3D 2.1.
@@ -1,30 +1,150 @@
|
|
1 |
---
|
2 |
-
|
|
|
|
|
|
|
3 |
license: other
|
4 |
license_name: tencent-hunyuan-community
|
5 |
license_link: https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1/blob/main/LICENSE
|
6 |
-
language:
|
7 |
-
- en
|
8 |
-
- zh
|
9 |
-
tags:
|
10 |
-
- image-to-3d
|
11 |
-
- text-to-3d
|
12 |
pipeline_tag: image-to-3d
|
|
|
|
|
|
|
13 |
---
|
14 |
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
<p align="center">
|
16 |
<img src="https://raw.githubusercontent.com/Tencent-Hunyuan/Hunyuan3D-2.1/refs/heads/main/assets/images/teaser.jpg">
|
17 |
</p>
|
18 |
|
19 |
<div align="center">
|
20 |
-
<a href=https://3d.hunyuan.tencent.com target="_blank"><img src=https://img.shields.io/badge/
|
21 |
<a href=https://huggingface.co/spaces/tencent/Hunyuan3D-2.1 target="_blank"><img src=https://img.shields.io/badge/%F0%9F%A4%97%20Demo-276cb4.svg height=22px></a>
|
22 |
<a href=https://huggingface.co/tencent/Hunyuan3D-2.1 target="_blank"><img src=https://img.shields.io/badge/%F0%9F%A4%97%20Models-d96902.svg height=22px></a>
|
23 |
-
<a href=https://
|
24 |
-
<a href=https://discord.gg/
|
25 |
-
|
|
|
|
|
26 |
</div>
|
27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
## π BibTeX
|
29 |
|
30 |
If you found this repository helpful, please cite our report:
|
@@ -58,13 +178,12 @@ If you found this repository helpful, please cite our report:
|
|
58 |
}
|
59 |
```
|
60 |
|
61 |
-
|
62 |
-
|
63 |
## Acknowledgements
|
64 |
|
65 |
We would like to thank the contributors to
|
66 |
-
the [TripoSG](https://github.com/VAST-AI-Research/TripoSG), [DINOv2](https://github.com/facebookresearch/dinov2), [Stable Diffusion](https://github.com/Stability-AI/stablediffusion), [FLUX](https://github.com/black-forest-labs/flux), [diffusers](https://github.com/huggingface/diffusers)
|
67 |
-
and [
|
|
|
68 |
|
69 |
## Star History
|
70 |
|
|
|
1 |
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
- zh
|
5 |
+
library_name: diffusers
|
6 |
license: other
|
7 |
license_name: tencent-hunyuan-community
|
8 |
license_link: https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1/blob/main/LICENSE
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
pipeline_tag: image-to-3d
|
10 |
+
tags:
|
11 |
+
- image-to-3d
|
12 |
+
- text-to-3d
|
13 |
---
|
14 |
|
15 |
+
This model was presented in the paper [Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material](https://huggingface.co/papers/2506.15442).
|
16 |
+
|
17 |
+
## Abstract
|
18 |
+
|
19 |
+
3D AI-generated content (AIGC) is a passionate field that has significantly accelerated the creation of 3D models in gaming, film, and design. Despite the development of several groundbreaking models that have revolutionized 3D generation, the field remains largely accessible only to researchers, developers, and designers due to the complexities involved in collecting, processing, and training 3D models. To address these challenges, we introduce Hunyuan3D 2.1 as a case study in this tutorial. This tutorial offers a comprehensive, step-by-step guide on processing 3D data, training a 3D generative model, and evaluating its performance using Hunyuan3D 2.1, an advanced system for producing high-resolution, textured 3D assets. The system comprises two core components: the Hunyuan3D-DiT for shape generation and the Hunyuan3D-Paint for texture synthesis. We will explore the entire workflow, including data preparation, model architecture, training strategies, evaluation metrics, and deployment. By the conclusion of this tutorial, you will have the knowledge to finetune or develop a robust 3D generative model suitable for applications in gaming, virtual reality, and industrial design.
|
20 |
+
|
21 |
<p align="center">
|
22 |
<img src="https://raw.githubusercontent.com/Tencent-Hunyuan/Hunyuan3D-2.1/refs/heads/main/assets/images/teaser.jpg">
|
23 |
</p>
|
24 |
|
25 |
<div align="center">
|
26 |
+
<a href=https://3d.hunyuan.tencent.com target="_blank"><img src=https://img.shields.io/badge/Official%20Site-333399.svg?logo=homepage height=22px></a>
|
27 |
<a href=https://huggingface.co/spaces/tencent/Hunyuan3D-2.1 target="_blank"><img src=https://img.shields.io/badge/%F0%9F%A4%97%20Demo-276cb4.svg height=22px></a>
|
28 |
<a href=https://huggingface.co/tencent/Hunyuan3D-2.1 target="_blank"><img src=https://img.shields.io/badge/%F0%9F%A4%97%20Models-d96902.svg height=22px></a>
|
29 |
+
<a href=https://3d-models.hunyuan.tencent.com/ target="_blank"><img src= https://img.shields.io/badge/Page-bb8a2e.svg?logo=github height=22px></a>
|
30 |
+
<a href=https://discord.gg/dNBrdrGGMa target="_blank"><img src= https://img.shields.io/badge/Discord-white.svg?logo=discord height=22px></a>
|
31 |
+
<a href=https://arxiv.org/pdf/2506.15442 target="_blank"><img src=https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv height=22px></a>
|
32 |
+
<a href=https://x.com/TencentHunyuan target="_blank"><img src=https://img.shields.io/badge/Hunyuan-black.svg?logo=x height=22px></a>
|
33 |
+
<a href="#community-resources" target="_blank"><img src=https://img.shields.io/badge/Community-lavender.svg?logo=homeassistantcommunitystore height=22px></a>
|
34 |
</div>
|
35 |
|
36 |
+
## π₯ News
|
37 |
+
|
38 |
+
- Jun 19, 2025: π We present the [technical report](https://arxiv.org/pdf/2506.15442) of Hunyuan3D-2.1, please check out the details and spark some discussion!
|
39 |
+
- Jun 13, 2025: π€ We release the first production-ready 3D asset generation model, Hunyuan3D-2.1!
|
40 |
+
|
41 |
+
> Join our **[Wechat](#)** and **[Discord](https://discord.gg/dNBrdrGGMa)** group to discuss and find help from us.
|
42 |
+
|
43 |
+
| Wechat Group | Xiaohongshu | X | Discord |
|
44 |
+
|--------------------------------------------------|-------------------------------------------------------|---------------------------------------------|---------------------------------------------------|
|
45 |
+
| <img src="assets/qrcode/wechat.png" height=140> | <img src="assets/qrcode/xiaohongshu.png" height=140> | <img src="assets/qrcode/x.png" height=140> | <img src="assets/qrcode/discord.png" height=140> |
|
46 |
+
|
47 |
+
## β―οΈ **Hunyuan3D 2.1**
|
48 |
+
|
49 |
+
### Architecture
|
50 |
+
|
51 |
+
Tencent Hunyuan3D-2.1 is a scalable 3D asset creation system that advances state-of-the-art 3D generation through two pivotal innovations: Fully Open-Source Framework and Physically-Based Rendering (PBR) Texture Synthesis. For the first time, the system releases full model weights and training code, enabling community developers to directly fine-tune and extend the model for diverse downstream applications. This transparency accelerates academic research and industrial deployment. Moreover, replacing the prior RGB-based texture model, the upgraded PBR pipeline leverages physics-grounded material simulation to generate textures with photorealistic light interaction (e.g., metallic reflections, subsurface scattering).
|
52 |
+
|
53 |
+
<p align="left">
|
54 |
+
<img src="assets/images/pipeline.png">
|
55 |
+
</p>
|
56 |
+
|
57 |
+
### Performance
|
58 |
+
|
59 |
+
We have evaluated Hunyuan3D 2.1 with other open-source as well as close-source 3d-generation methods.
|
60 |
+
The numerical results indicate that Hunyuan3D 2.1 surpasses all baselines in the quality of generated textured 3D assets
|
61 |
+
and the condition following ability.
|
62 |
+
|
63 |
+
| Model | ULIP-T(β¬) | ULIP-I(β¬) | Uni3D-T(β¬) | Uni3D-I(β¬) |
|
64 |
+
|-------------------------|-----------|-------------|-------------|---------------|
|
65 |
+
| Michelangelo | 0.0752 | 0.1152 | 0.2133 | 0.2611 |
|
66 |
+
| Craftsman | 0.0745 | 0.1296 | 0.2375 | 0.2987 |
|
67 |
+
| TripoSG | 0.0767 | 0.1225 | 0.2506 | 0.3129 |
|
68 |
+
| Step1X-3D | 0.0735 | 0.1183 | 0.2554 | 0.3195 |
|
69 |
+
| Trellis | 0.0769 | 0.1267 | 0.2496 | 0.3116 |
|
70 |
+
| Direct3D-S2 | 0.0706 | 0.1134 | 0.2346 | 0.2930 |
|
71 |
+
| Hunyuan3D-Shape-2.1 | **0.0774** | **0.1395** | **0.2556** | **0.3213** |
|
72 |
+
|
73 |
+
|
74 |
+
| Model | CLIP-FiD(β¬) | CMMD(β¬) | CLIP-I(β¬) | LPIPS(β¬) |
|
75 |
+
|-------------------------|-----------|-------------|-------------|---------------|
|
76 |
+
| SyncMVD-IPA | 28.39 | 2.397 | 0.8823 | 0.1423 |
|
77 |
+
| TexGen | 28.24 | 2.448 | 0.8818 | 0.1331 |
|
78 |
+
| Hunyuan3D-2.0 | 26.44 | 2.318 | 0.8893 | 0.1261 |
|
79 |
+
| Hunyuan3D-Paint-2.1 | **24.78** | **2.191** | **0.9207** | **0.1211** |
|
80 |
+
|
81 |
+
|
82 |
+
## π Models Zoo
|
83 |
+
|
84 |
+
It takes 10 GB VRAM for shape generation, 21GB for texture generation and 29GB for shape and texture generation in total.
|
85 |
+
|
86 |
+
|
87 |
+
| Model | Description | Date | Size | Huggingface |
|
88 |
+
|----------------------------|-----------------------------|------------|------|-------------------------------------------------------------------------------------------|
|
89 |
+
| Hunyuan3D-Shape-v2-1 | Image to Shape Model | 2025-06-14 | 3.3B | [Download](https://huggingface.co/tencent/Hunyuan3D-2.1/tree/main/hunyuan3d-dit-v2-1) |
|
90 |
+
| Hunyuan3D-Paint-v2-1 | Texture Generation Model | 2025-06-14 | 2B | [Download](https://huggingface.co/tencent/Hunyuan3D-2.1/tree/main/hunyuan3d-paint-v2-1) |
|
91 |
+
|
92 |
+
|
93 |
+
## π€ Get Started with Hunyuan3D 2.1
|
94 |
+
|
95 |
+
Hunyuan3D 2.1 supports Macos, Windows, Linux. You may follow the next steps to use Hunyuan3D 2.1 via:
|
96 |
+
|
97 |
+
### Install Requirements
|
98 |
+
We test our model on an A100 GPU with Python 3.10 and PyTorch 2.5.1+cu124.
|
99 |
+
```bash
|
100 |
+
pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu124
|
101 |
+
pip install -r requirements.txt
|
102 |
+
|
103 |
+
cd hy3dpaint/custom_rasterizer
|
104 |
+
pip install -e .
|
105 |
+
cd ../..
|
106 |
+
cd hy3dpaint/DifferentiableRenderer
|
107 |
+
bash compile_mesh_painter.sh
|
108 |
+
cd ../..
|
109 |
+
|
110 |
+
wget https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth -P hy3dpaint/ckpt
|
111 |
+
```
|
112 |
+
|
113 |
+
### Code Usage
|
114 |
+
|
115 |
+
We designed a diffusers-like API to use our shape generation model - Hunyuan3D-Shape and texture synthesis model -
|
116 |
+
Hunyuan3D-Paint.
|
117 |
+
|
118 |
+
```python
|
119 |
+
import sys
|
120 |
+
sys.path.insert(0, './hy3dshape')
|
121 |
+
sys.path.insert(0, './hy3dpaint')
|
122 |
+
from textureGenPipeline import Hunyuan3DPaintPipeline
|
123 |
+
from textureGenPipeline import Hunyuan3DPaintPipeline, Hunyuan3DPaintConfig
|
124 |
+
from hy3dshape.pipelines import Hunyuan3DDiTFlowMatchingPipeline
|
125 |
+
|
126 |
+
# let's generate a mesh first
|
127 |
+
shape_pipeline = Hunyuan3DDiTFlowMatchingPipeline.from_pretrained('tencent/Hunyuan3D-2.1')
|
128 |
+
mesh_untextured = shape_pipeline(image='assets/demo.png')[0]
|
129 |
+
|
130 |
+
paint_pipeline = Hunyuan3DPaintPipeline(Hunyuan3DPaintConfig(max_num_view=6, resolution=512))
|
131 |
+
mesh_textured = paint_pipeline(mesh_path, image_path='assets/demo.png')
|
132 |
+
```
|
133 |
+
|
134 |
+
|
135 |
+
### Gradio App
|
136 |
+
|
137 |
+
You could also host a [Gradio](https://www.gradio.app/) App in your own computer via:
|
138 |
+
|
139 |
+
|
140 |
+
```bash
|
141 |
+
python3 gradio_app.py \
|
142 |
+
--model_path tencent/Hunyuan3D-2.1 \
|
143 |
+
--subfolder hunyuan3d-dit-v2-1 \
|
144 |
+
--texgen_model_path tencent/Hunyuan3D-2.1 \
|
145 |
+
--low_vram_mode
|
146 |
+
```
|
147 |
+
|
148 |
## π BibTeX
|
149 |
|
150 |
If you found this repository helpful, please cite our report:
|
|
|
178 |
}
|
179 |
```
|
180 |
|
|
|
|
|
181 |
## Acknowledgements
|
182 |
|
183 |
We would like to thank the contributors to
|
184 |
+
the [TripoSG](https://github.com/VAST-AI-Research/TripoSG), [Trellis](https://github.com/microsoft/TRELLIS), [DINOv2](https://github.com/facebookresearch/dinov2), [Stable Diffusion](https://github.com/Stability-AI/stablediffusion), [FLUX](https://github.com/black-forest-labs/flux), [diffusers](https://github.com/huggingface/diffusers), [HuggingFace](https://huggingface.co), [CraftsMan3D](https://github.com/wyysf-98/CraftsMan3D),
|
185 |
+
and [Michelangelo](https://github.com/NeuralCarver/Michelangelo/tree/main) repositories, for their open research and
|
186 |
+
exploration.
|
187 |
|
188 |
## Star History
|
189 |
|