Mostafa Elhoushi
melhoushi
AI & ML interests
Make ML faster, smaller, smarter.
Recent Activity
authored
a paper
about 22 hours ago
CHAI: Clustered Head Attention for Efficient LLM Inference
authored
a paper
about 22 hours ago
Characterizing and Efficiently Accelerating Multimodal Generation Model
Inference
authored
a paper
about 22 hours ago
Guiding Giants: Lightweight Controllers for Weighted Activation Steering
in LLMs