Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2504.02495

Deepseek Papers

Deepseek papers collection

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Paper • 2310.16818 • Published Oct 25, 2023 • 32
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 46
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 53
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 61

📝 Cool LLM papers

Starting from 2024-11-15

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 3 days ago • 23
Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 109
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 218
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published Jan 30 • 19

about 14 hours ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 3 days ago • 23

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 3 days ago • 23

about 22 hours ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 69
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 148
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 60
LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 60

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs