ofirzaf
·
AI & ML interests
Sparsity, Qunatization, Model Compression
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models
view article
Breaking Language Barriers in Mathematical AI: Introducing Hebrew Math Tutor
view article
Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques
upvoted
a
paper
8 months ago
view article
A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake
upvoted
a
paper
11 months ago
upvoted
a
paper
about 1 year ago