Umberto Cappellazzo's picture

5

Umberto Cappellazzo

hisoka94

·

https://umbertocappellazzo.github.io/

AI & ML interests

Continual learning for audio and speech processing. Parameter-efficient transfer learning of pre-trained models. Multimodal Large Language Models.

Organizations

None yet

hisoka94's activity

upvoted 3 papers 9 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

FiT: Flexible Vision Transformer for Diffusion Model

Paper • 2402.12376 • Published Feb 19 • 48

upvoted a collection 10 months ago

MoEs papers reading list

60 items • Updated 13 days ago • 135

upvoted a paper 11 months ago

SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 40