ligeng-dev

community

AI & ML interests

None defined yet.

Recent Activity

Ligeng-Zhu updated a model about 10 hours ago

ligeng-dev/tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume

Ligeng-Zhu published a model about 10 hours ago

ligeng-dev/tw-data-train_classified-8node-resume

Ligeng-Zhu published a model about 10 hours ago

ligeng-dev/tw-data-train_final_replaced_from_classified-fix-format-8node-resume

View all activity

updated a model about 10 hours ago

ligeng-dev/tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume

Text Generation • 8B • Updated about 10 hours ago

published 3 models about 10 hours ago

ligeng-dev/tw-data-train_classified-8node-resume

Text Generation • 8B • Updated about 10 hours ago

ligeng-dev/tw-data-train_final_replaced_from_classified-fix-format-8node-resume

Text Generation • 8B • Updated about 11 hours ago

ligeng-dev/tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume

Text Generation • 8B • Updated about 10 hours ago

updated a model about 10 hours ago

ligeng-dev/tw-data-train_classified-8node-resume

Text Generation • 8B • Updated about 10 hours ago

updated a model about 11 hours ago

ligeng-dev/tw-data-train_final_replaced_from_classified-fix-format-8node-resume

Text Generation • 8B • Updated about 11 hours ago

updated a model 2 days ago

ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix

Text Generation • 8B • Updated 2 days ago • 159

published a model 2 days ago

ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix

Text Generation • 8B • Updated 2 days ago • 159

published a model 13 days ago

ligeng-dev/Q3-8B-131072-sft-8x-complete

8B • Updated 13 days ago • 414

updated 2 models 13 days ago

ligeng-dev/Q3-8B-131072-sft-8x-complete

8B • Updated 13 days ago • 414

ligeng-dev/Q3-8B-131072-sft-1x-20260331_091938

Text Generation • 8B • Updated 13 days ago • 895

published a model 13 days ago

ligeng-dev/Q3-8B-131072-sft-1x-20260331_091938

Text Generation • 8B • Updated 13 days ago • 895

updated a dataset 23 days ago

ligeng-dev/devholder

Updated 23 days ago • 30

published a dataset 2 months ago

ligeng-dev/devholder

Updated 23 days ago • 30

updated a Space about 1 year ago

Travel Journey

Browse AI-generated images in a gallery

published a Space about 1 year ago

Travel Journey

Browse AI-generated images in a gallery

updated a dataset about 1 year ago

ligeng-dev/Cambrian

Updated Feb 19, 2025 • 61

authored 3 papers over 1 year ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 52

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Paper • 2409.04429 • Published Sep 6, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published Oct 14, 2024 • 13