Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
2
Song Huatong
XXsongLALA
Follow
John6666's profile picture
megakey's profile picture
21world's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
updated
a dataset
12 days ago
XXsongLALA/RAG-RL-Hotpotqa-with-2wiki
updated
a model
12 days ago
XXsongLALA/Qwen-2.5-7B-base-RAG-RL
View all activity
Organizations
Papers
2
arxiv:
2503.05592
arxiv:
2412.17743
models
3
Sort: Recently updated
XXsongLALA/Qwen-2.5-7B-base-RAG-RL
Text Generation
•
Updated
12 days ago
•
89
•
5
XXsongLALA/Llama-3.1-8B-instruct-RAG-RL
Text Generation
•
Updated
12 days ago
•
13
XXsongLALA/llama-7b-hh-sft
Updated
May 5, 2024
datasets
1
XXsongLALA/RAG-RL-Hotpotqa-with-2wiki
Preview
•
Updated
12 days ago
•
36
•
2