GarmentGPT / README.md

Mathewgar

Update README.md

f9442a2 verified 4 months ago

preview code

raw

history blame contribute delete

936 Bytes

metadata

license: apache-2.0
language:
  - en
  - zh

GarmentGPT Models

This repository contains all the necessary model components for the GarmentGPT project.

Models Included

This repository hosts three key components:

Vision-Language Model (LLM): A fine-tuned multi-modal model responsible for generating discrete garment tokens from an input image.
Edge Codec: A VQ-VAE-based model for decoding edge indices into high-fidelity geometric curves. The configuration is in codec_config.yaml and weights are in codec_model.pth.
RT Codec: A VQ-VAE-based model for decoding location indices into 3D panel rotation and translation. The configuration is in rt_config.yaml and weights are in rt_model.pth.

Usage

These models are designed to be used with the main application code available at https://github.com/ChimerAI-MMLab/Garment-GPT. The inference script will automatically download these files.