Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
33
4
Mostafa Elhoushi
melhoushi
Follow
77qi's profile picture
Ram07's profile picture
maharshpatelx's profile picture
34 followers
·
6 following
m_elhoushi
mostafaelhoushi
mostafaelhoushi
AI & ML interests
Make ML faster, smaller, smarter.
Recent Activity
authored
a paper
about 7 hours ago
CHAI: Clustered Head Attention for Efficient LLM Inference
authored
a paper
about 7 hours ago
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference
authored
a paper
about 7 hours ago
Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs
View all activity
Organizations
Articles
1
Article
59
Faster Text Generation with Self-Speculative Decoding
Papers
11
arxiv:
2507.04610
arxiv:
2506.00204
arxiv:
2505.20309
arxiv:
2410.00215
Expand 11 papers
models
23
Sort: Recently updated
melhoushi/layerskip-Llama-2-7B-top_v2
Updated
Mar 17
melhoushi/layerskip-Llama-3.2-1B-top_v2
1B
•
Updated
Mar 17
•
9
melhoushi/layerskip-Llama-2-7b-hf-top_v2
Updated
Mar 16
melhoushi/layerskip-SmolLM2-135M-top_v2
0.1B
•
Updated
Mar 16
•
8
melhoushi/layerskip-huggingface-smollm2-135m-topv1
0.1B
•
Updated
Mar 16
•
9
melhoushi/layerskip-huggingface-smollm-135m-topv1
0.1B
•
Updated
Mar 16
•
5
melhoushi/layerskip-llama3.2-1b-topv1
1B
•
Updated
Mar 16
•
8
melhoushi/layerskip-llama2-7b-topv1
Updated
Mar 16
melhoushi/exitskip-llama2-7b-topv1-v1
7B
•
Updated
Jan 24
•
7
melhoushi/layerskip-llama2-7b-topv1-v4
7B
•
Updated
Jan 23
•
13
View 23 models
datasets
0
None public yet