-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 405 -
LightThinker: Thinking Step-by-Step Compression
Paper • 2502.15589 • Published • 29 -
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper • 2405.04434 • Published • 21 -
Model Compression and Efficient Inference for Large Language Models: A Survey
Paper • 2402.09748 • Published • 1
Nvar Char
zombieofCrypto
·
AI & ML interests
machine learning to become more zombie-like
Organizations
audio recognition
llm_improvement_research
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 405 -
LightThinker: Thinking Step-by-Step Compression
Paper • 2502.15589 • Published • 29 -
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper • 2405.04434 • Published • 21 -
Model Compression and Efficient Inference for Large Language Models: A Survey
Paper • 2402.09748 • Published • 1
llm_prompts
audio recognition
timeseriesforecasting