The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 9 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 9 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 6 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 7
The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 9 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 9 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 6 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 7
models
62
hkust-nlp/WebExplorer-8B
Image-Text-to-Text
•
8B
•
Updated
•
20
•
5
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier
Reinforcement Learning
•
8B
•
Updated
•
6
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning
•
8B
•
Updated
•
7
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning
•
8B
•
Updated
•
6
hkust-nlp/R1-Distill-Verifier-1.5B
2B
•
Updated
•
9
•
1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning
•
8B
•
Updated
•
9
•
1
hkust-nlp/Laser-DE-L4096-1.5B
2B
•
Updated
•
1.28k
hkust-nlp/Laser-DE-L2048-1.5B
2B
•
Updated
•
6
hkust-nlp/Laser-DE-L1024-1.5B
2B
•
Updated
•
7
hkust-nlp/Laser-D-L4096-1.5B
2B
•
Updated
•
21
datasets
28
hkust-nlp/WebExplorer-QA
Viewer
•
Updated
•
100
•
76
•
4
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
47
•
2
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
132
•
53
hkust-nlp/rl-verifier-pitfalls_hacking_data
Viewer
•
Updated
•
6.12k
•
19
•
1
hkust-nlp/deepscaler_simplelr
Viewer
•
Updated
•
40.3k
•
35
hkust-nlp/Laser-Deepscaler-Dataset
Viewer
•
Updated
•
40.8k
•
115
hkust-nlp/LeetCode-O
Preview
•
Updated
•
54
hkust-nlp/GUIMid
Viewer
•
Updated
•
1.85M
•
121
•
5
hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
•
53.1k
•
586
•
6
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
3.36k
•
11