Hyungyu seo
hgseo
Β·
AI & ML interests
None yet
Recent Activity
reacted
to
BestWishYsh's
post with π about 8 hours ago
π Introducing Helios: a 14B real-time long-video generation model!
Itβs completely wildβfaster than 1.3B models and achieves this without using self-forcing. Welcome to the new era of video generation! ππ
π» Code: https://github.com/PKU-YuanGroup/Helios
π Page: https://pku-yuangroup.github.io/Helios-Page
π Paper: https://huggingface.co/papers/2603.04379
πΉ True Single-GPU Extreme Speed β‘οΈ
No need to rely on traditional workarounds like KV-cache, quantization, sparse/linear attention, or TinyVAE. Helios hits an end-to-end 19.5 FPS on a single H100!
Training is also highly accessible: an 80GB VRAM can fit four 14B models.
πΉ Solving Long-Video "Drift" from the Core π₯
Tired of visual drift and repetitive loops? We ditched traditional hacks (like error banks, self-forcing, or keyframe sampling).
Instead, our innovative training strategy simulates & eliminates drift directly, keeping minute-long videos incredibly coherent with stunning quality. β¨
πΉ 3 Model Variants for Full Coverage π οΈ
With a unified architecture natively supporting T2V, I2V, and V2V, we are open-sourcing 3 flavors:
1οΈβ£ Base: Single-stage denoising for extreme high-fidelity.
2οΈβ£ Mid: Pyramid denoising + CFG-Zero for the perfect balance of quality & throughput.
3οΈβ£ Distilled: Adversarial Distillation (DMD) for ultra-fast, few-step generation.
πΉ Day-0 Ecosystem Ready π
We wanted deployment to be a breeze from the second we launched. Helios drops with comprehensive Day-0 hardware and framework support:
β
Huawei Ascend-NPU
β
HuggingFace Diffusers
β
vLLM-Omni
β
SGLang-Diffusion
Try it out and let us know what you think!
liked
a model 3 days ago
Qwen/Qwen3.5-9B-Base liked
a model 3 days ago
Qwen/Qwen3.5-9B