10 36 59

Jiaming Han

csuhan

https://csuhan.com

csuhan

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 8 days ago

Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens

upvoted a collection 27 days ago

BitDance

liked a Space about 1 month ago

shallowdream204/BitDance-14B-64x

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens

Paper • 2603.19232 • Published 9 days ago • 33

upvoted a collection 27 days ago

BitDance

Collection

BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 10 items • Updated 26 days ago • 11

liked a Space about 1 month ago

BitDance-14B-64x

🚀

Open-source autoregressive model with binary visual tokens.

authored 2 papers about 1 month ago

UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model

Paper • 2602.14178 • Published Feb 15 • 14

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published Feb 15 • 53

upvoted 2 papers about 1 month ago

UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^{128} for Unified Multimodal Large Language Model

Paper • 2602.14178 • Published Feb 15 • 14

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published Feb 15 • 53

updated a dataset about 1 month ago

csuhan/bitdance_demo

Viewer • Updated Feb 15 • 141 • 603

published a dataset about 1 month ago

csuhan/bitdance_demo

Viewer • Updated Feb 15 • 141 • 603

liked 2 datasets about 2 months ago

#2 opened 2 months ago by

Mejistus

upvoted 2 papers 3 months ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published Dec 19, 2025 • 37

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

upvoted 3 papers 4 months ago

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published Dec 2, 2025 • 33

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 244

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 28

liked a dataset 5 months ago

jasonzhango/SPAR-7M-RGBD

Updated Jun 15, 2025 • 733 • 7

upvoted a paper 6 months ago

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Paper • 2510.08555 • Published Oct 9, 2025 • 65

liked a dataset 6 months ago

WINDop/OpenGPT-4o-Image

Updated Nov 2, 2025 • 1.42k • 22

Jiaming Han

AI & ML interests

Recent Activity

Organizations

csuhan's activity

BitDance-14B-64x

Can some details about the image generation process be added?