-
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 260 -
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 82 -
Mixtral of Experts
Paper • 2401.04088 • Published • 159 -
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 109
xiepengli
ginobiLi
AI & ML interests
LLM
Recent Activity
liked
a model
2 days ago
mistralai/Mistral-Small-3.2-24B-Instruct-2506
liked
a model
6 days ago
microsoft/VibeVoice-1.5B
liked
a model
7 days ago
Qwen/Qwen3-Next-80B-A3B-Instruct