Add model card and metadata
#2
by
nielsr
HF Staff
- opened
README.md
ADDED
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
pipeline_tag: text-generation
|
3 |
+
library_name: transformers
|
4 |
+
---
|
5 |
+
|
6 |
+
# DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments
|
7 |
+
|
8 |
+
This model, DeepResearcher-7b, is presented in the paper [DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments](https://huggingface.co/papers/2504.03160).
|
9 |
+
|
10 |
+
DeepResearcher is the first comprehensive framework for end-to-end training of LLM-based deep research agents through scaling reinforcement learning (RL) in real-world environments with authentic web search interactions. Qualitative analysis reveals emergent cognitive behaviors from end-to-end RL training, including the ability to formulate plans, cross-validate information, engage in self-reflection, and maintain honesty when unable to find definitive answers. Experiments on open-domain research tasks demonstrate substantial improvements over prompt engineering and RAG-based RL agents.
|
11 |
+
|
12 |
+
Code: https://github.com/GAIR-NLP/DeepResearcher
|