Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2504.21776

A deep research agent that autonomously searches the web, navigates web pages, and drafts research reports.

lixiaoxi45/WebThinker-QwQ-32B

Updated 2 days ago • 9 • 4
lixiaoxi45/WebThinker-R1-7B

Updated 2 days ago • 7 • 3
lixiaoxi45/WebThinker-R1-14B

Updated 2 days ago • 5 • 2
lixiaoxi45/WebThinker-R1-32B

Updated 2 days ago • 9 • 3

about 13 hours ago

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published about 1 month ago • 10
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published 23 days ago • 42
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published 19 days ago • 30
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 19 days ago • 84

Papers + RL/Reasoning

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 123
VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published 26 days ago • 25
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published 22 days ago • 27
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published 18 days ago • 14

To Read collection

interesting papers to read

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 63
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 118
START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 111
DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 123

about 5 hours ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 111
ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published 17 days ago • 42
OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published 12 days ago • 33
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 18 days ago • 60

about 14 hours ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 98.1k • 3.36k
deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated Mar 27 • 388k • • 2.84k
Running

5.8k

5.8k

DeepSite

🐳

Generate any application with DeepSeek
ydeng9/OpenVLThinker-7B

Image-Text-to-Text • Updated Mar 25 • 192 • 17

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 29
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 41
Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27, 2024 • 54
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture

Paper • 2405.18991 • Published May 29, 2024 • 12

Large Language Model (LLM) and NLP related papers.

LoRA+: Efficient Low Rank Adaptation of Large Models

Paper • 2402.12354 • Published Feb 19, 2024 • 6
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 22
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Paper • 2402.13249 • Published Feb 20, 2024 • 13
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 70

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs