HuggingFace Transformers can load us.
Efficient-Large-Model
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 13k β’ 22 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 7.09k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 14.2k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 6 β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
413
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 39 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 54 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 44 β’ 5
A series of VILA models that specialize for **long-context** abilities
Scaling RL to Long Videos
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 21 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 81 β’ β’ 23 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 14 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 835 β’ β’ 1
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 457 β’ β’ 213 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 70 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 33 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 56
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 748 β’ 37 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 173 β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 7.9k β’ 30 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 57 β’ 5
HuggingFace Transformers can load us.
Scaling RL to Long Videos
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 13k β’ 22 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 7.09k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 14.2k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 6 β’ 1
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 21 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 81 β’ β’ 23 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 14 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 835 β’ β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
413
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 39 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 54 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 44 β’ 5
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 457 β’ β’ 213 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 70 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 33 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 56
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 748 β’ 37 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 173 β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 7.9k β’ 30 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 57 β’ 5