Ligeng Zhu's picture

Ligeng Zhu

Ligeng-Zhu

·

AI & ML interests

None yet

Recent Activity

published a model about 2 hours ago

Efficient-Large-Model/nvila-internal-33b-video-v1

liked a dataset 2 days ago

HuggingFaceM4/FineVision

updated a model 4 days ago

Efficient-Large-Model/NVILA-Lite-15B-hf-0904

View all activity

Organizations

authored 6 papers 8 months ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 53

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Paper • 2409.04429 • Published Sep 6, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published Oct 14, 2024 • 12

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19

TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning

Paper • 2007.11622 • Published Jul 22, 2020

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

authored 2 papers about 1 year ago

Wolf: Captioning Everything with a World Summarization Framework

Paper • 2407.18908 • Published Jul 26, 2024 • 33

$VILA^2$: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24, 2024 • 42

authored a paper over 1 year ago

HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

Paper • 2005.14187 • Published May 28, 2020 • 2

authored 5 papers almost 2 years ago

On-Device Training Under 256KB Memory

Paper • 2206.15472 • Published Jun 30, 2022

Deep Leakage from Gradients

Paper • 1906.08935 • Published Jun 21, 2019

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

Paper • 1812.00332 • Published Dec 2, 2018

Sparsely Aggregated Convolutional Networks

Paper • 1801.05895 • Published Jan 18, 2018

PockEngine: Sparse and Efficient Fine-tuning in a Pocket

Paper • 2310.17752 • Published Oct 26, 2023 • 14