Song
songhan
AI & ML interests
efficient AI computing
Recent Activity
upvoted
a
paper
about 2 months ago
Scaling RL to Long Videos
authored
a paper
7 months ago
LServe: Efficient Long-sequence LLM Serving with Unified Sparse
Attention
upvoted
a
paper
9 months ago
NVILA: Efficient Frontier Visual Language Models