-
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Paper • 2312.04461 • Published • 62 -
InstantID: Zero-shot Identity-Preserving Generation in Seconds
Paper • 2401.07519 • Published • 58 -
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Paper • 2306.07691 • Published • 9
Manik Hossain
manik-hossain
·
AI & ML interests
None yet
Organizations
startup
-
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Paper • 2312.04461 • Published • 62 -
InstantID: Zero-shot Identity-Preserving Generation in Seconds
Paper • 2401.07519 • Published • 58 -
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Paper • 2306.07691 • Published • 9
Audio
datasets
0
None public yet