Scaling RL to Long Videos
Efficient-Large-Model
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 20 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 54 β’ β’ 22 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 12 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 400 β’ β’ 1
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 46.9k β’ 20 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 247 β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 4.43k β’ 2 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 6 β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
412
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 62 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 24 β’ 4
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 540 β’ β’ 211 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 252 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 28 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 23
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 1.46k β’ 35 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 884 β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 7.08k β’ 29 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 17 β’ 5
Scaling RL to Long Videos
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 46.9k β’ 20 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 247 β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 4.43k β’ 2 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 6 β’ 1
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 20 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 54 β’ β’ 22 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 12 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 400 β’ β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
412
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 62 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 24 β’ 4
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 540 β’ β’ 211 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 252 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 28 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 23
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 1.46k β’ 35 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 884 β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 7.08k β’ 29 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 17 β’ 5