-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 103 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 257 -
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Paper • 2404.16710 • Published • 80
Mayor
Eric111
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
mistralai/Mistral-Small-3.2-24B-Instruct-2506
liked
a model
3 days ago
nanonets/Nanonets-OCR-s
liked
a model
3 days ago
nvidia/AceReason-Nemotron-1.1-7B
Organizations
None yet