InstantX

community

https://huggingface.co/InstantX

instantX-research

AI & ML interests

We open source generative models

Recent Activity

jamesliu1217 authored a paper 15 days ago

CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

jamesliu1217 submitted a paper 15 days ago

CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

zhen-nan authored a paper 19 days ago

L2P: Unlocking Latent Potential for Pixel Generation

View all activity

authored a paper 15 days ago

CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

Paper • 2605.25378 • Published 19 days ago • 61

submitted a paper to Daily Papers 15 days ago

CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

Paper • 2605.25378 • Published 19 days ago • 61

authored 2 papers 19 days ago

L2P: Unlocking Latent Potential for Pixel Generation

Paper • 2605.12013 • Published May 12 • 36

PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset

Paper • 2605.20147 • Published 25 days ago • 11

submitted a paper to Daily Papers about 1 month ago

L2P: Unlocking Latent Potential for Pixel Generation

Paper • 2605.12013 • Published May 12 • 36

authored a paper about 2 months ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 102

submitted a paper to Daily Papers 4 months ago

JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion

Paper • 2601.22143 • Published Jan 29 • 12

authored 2 papers 6 months ago

DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation

Paper • 2512.02931 • Published Dec 2, 2025

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 51

authored 3 papers 6 months ago

Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model

Paper • 2403.07764 • Published Mar 12, 2024 • 1

Stable-Hair v2: Real-World Hair Transfer via Multiple-View Diffusion Model

Paper • 2507.07591 • Published Jul 10, 2025

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 178

authored a paper 6 months ago

PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

Paper • 2512.04082 • Published Dec 3, 2025 • 14

authored a paper 6 months ago

DiP: Taming Diffusion Models in Pixel Space

Paper • 2511.18822 • Published Nov 24, 2025 • 30

authored 6 papers 8 months ago

InstantIR: Blind Image Restoration with Instant Generative Reference

Paper • 2410.06551 • Published Oct 9, 2024 • 6

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly

Paper • 2502.05761 • Published Feb 9, 2025 • 7

Dynamic Pyramid Network for Efficient Multimodal Large Language Model

Paper • 2503.20322 • Published Mar 26, 2025 • 1

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Paper • 2506.07977 • Published Jun 9, 2025 • 40

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published Oct 16, 2025 • 86