HKUST NLP Group

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

lockon updated a dataset 1 minute ago

hkust-nlp/Toolathlon-Trajectories

SivilTaram authored a paper 6 days ago

Diffusion Language Models are Super Data Learners

AndrewZeng authored a paper 13 days ago

MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning

View all activity

hkust-nlp 's collections 11

Toolathlon

hkust-nlp/Toolathlon-Trajectories

Viewer • Updated 2 minutes ago • 5.82k • 1k • 13
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published 15 days ago • 44

RL-Verifier-Pitfalls

The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B

Reinforcement Learning • 8B • Updated May 28 • 3 • 1
hkust-nlp/R1-Distill-Verifier-1.5B

2B • Updated May 28 • 3 • 1
hkust-nlp/Qwen-2.5-7B-Verifier-HF

Reinforcement Learning • 8B • Updated May 28
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B

Reinforcement Learning • 8B • Updated May 28 • 3

SimpleRL-Zoo

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"

hkust-nlp/SimpleRL-Zoo-Data

Viewer • Updated Mar 25 • 53.1k • 870 • 8
hkust-nlp/Mistral-Small-24B-SimpleRL-Zoo

24B • Updated Mar 24 • 2
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zoo

8B • Updated Mar 24 • 114
hkust-nlp/Qwen-2.5-14B-SimpleRL-Zoo

15B • Updated Mar 24 • 52

PreSelect

hkust-nlp/preselect-fasttext-classifier

Text Classification • Updated Mar 6 • 98 • 8
hkust-nlp/PreSelect-100B

Viewer • Updated Mar 4 • 54.5M • 679 • 11
Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 56

CodeI/O

Collection for CodeI/O @ https://codei-o.github.io/

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50
hkust-nlp/CodeIO-PyEdu-Reasoning

Preview • Updated Jun 18 • 97 • 56
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw

Updated Jun 18 • 65 • 2
hkust-nlp/LeetCode-O

Preview • Updated May 6 • 54

🎯DART-Math

Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving [NeurIPS 2024] @ https://github.com/hkust-nlp/dart-math

DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

Paper • 2407.13690 • Published Jun 18, 2024 • 2
hkust-nlp/dart-math-hard

Viewer • Updated Aug 2, 2024 • 585k • 145 • 14
hkust-nlp/dart-math-dsmath-7b-prop2diff

Text Generation • 7B • Updated Jul 21, 2024 • 8 • 3
hkust-nlp/dart-math-llama3-8b-prop2diff

Text Generation • 8B • Updated Jul 19, 2024 • 64 • 1

WebExplorer

The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"

hkust-nlp/WebExplorer-8B

Image-Text-to-Text • 8B • Updated Sep 11 • 697 • 12
hkust-nlp/WebExplorer-QA

Viewer • Updated Sep 9 • 100 • 261 • 5

Laser

The collection for the Paper "Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping"

hkust-nlp/Laser-Deepscaler-Dataset

Viewer • Updated May 21 • 40.8k • 155
hkust-nlp/Laser-L2048-1.5B

2B • Updated May 20 • 5
hkust-nlp/Laser-L4096-1.5B

2B • Updated May 20
hkust-nlp/Laser-L8192-1.5B

2B • Updated May 20 • 1

SimpleRL

The collection for the Project "Simple Reinforcement Learning for Reasoning"

hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero

8B • Updated Feb 23 • 40 • 3
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL

8B • Updated Feb 23 • 6 • 4

M-STAR

Resources of M-STAR (Multimodal Self-Evolving Training for Reasoning) https://mstar-lmm.github.io/

hkust-nlp/mstar-8b-v1.0

9B • Updated Dec 25, 2024 • 1 • 2
hkust-nlp/mstar-prm-8b-v1.0

9B • Updated Dec 25, 2024 • 1 • 2

Deita

hkust-nlp/deita-llama1-13b-v1.0-sft

Text Generation • Updated Dec 29, 2023 • 1
hkust-nlp/deita-complexity-scorer

Text Generation • Updated Jan 1, 2024 • 100 • 14
hkust-nlp/deita-quality-scorer

Text Generation • Updated Dec 29, 2023 • 168 • 18
hkust-nlp/deita-10k-v0

Viewer • Updated Dec 31, 2023 • 10k • 96 • 30

Toolathlon

hkust-nlp/Toolathlon-Trajectories

Viewer • Updated 2 minutes ago • 5.82k • 1k • 13
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published 15 days ago • 44

WebExplorer

The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"

hkust-nlp/WebExplorer-8B

Image-Text-to-Text • 8B • Updated Sep 11 • 697 • 12
hkust-nlp/WebExplorer-QA

Viewer • Updated Sep 9 • 100 • 261 • 5

RL-Verifier-Pitfalls

The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B

Reinforcement Learning • 8B • Updated May 28 • 3 • 1
hkust-nlp/R1-Distill-Verifier-1.5B

2B • Updated May 28 • 3 • 1
hkust-nlp/Qwen-2.5-7B-Verifier-HF

Reinforcement Learning • 8B • Updated May 28
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B

Reinforcement Learning • 8B • Updated May 28 • 3

Laser

The collection for the Paper "Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping"

hkust-nlp/Laser-Deepscaler-Dataset

Viewer • Updated May 21 • 40.8k • 155
hkust-nlp/Laser-L2048-1.5B

2B • Updated May 20 • 5
hkust-nlp/Laser-L4096-1.5B

2B • Updated May 20
hkust-nlp/Laser-L8192-1.5B

2B • Updated May 20 • 1

SimpleRL-Zoo

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"

hkust-nlp/SimpleRL-Zoo-Data

Viewer • Updated Mar 25 • 53.1k • 870 • 8
hkust-nlp/Mistral-Small-24B-SimpleRL-Zoo

24B • Updated Mar 24 • 2
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zoo

8B • Updated Mar 24 • 114
hkust-nlp/Qwen-2.5-14B-SimpleRL-Zoo

15B • Updated Mar 24 • 52

SimpleRL

The collection for the Project "Simple Reinforcement Learning for Reasoning"

hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero

8B • Updated Feb 23 • 40 • 3
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL

8B • Updated Feb 23 • 6 • 4

PreSelect

hkust-nlp/preselect-fasttext-classifier

Text Classification • Updated Mar 6 • 98 • 8
hkust-nlp/PreSelect-100B

Viewer • Updated Mar 4 • 54.5M • 679 • 11
Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 56

M-STAR

Resources of M-STAR (Multimodal Self-Evolving Training for Reasoning) https://mstar-lmm.github.io/

hkust-nlp/mstar-8b-v1.0

9B • Updated Dec 25, 2024 • 1 • 2
hkust-nlp/mstar-prm-8b-v1.0

9B • Updated Dec 25, 2024 • 1 • 2

CodeI/O

Collection for CodeI/O @ https://codei-o.github.io/

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50
hkust-nlp/CodeIO-PyEdu-Reasoning

Preview • Updated Jun 18 • 97 • 56
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw

Updated Jun 18 • 65 • 2
hkust-nlp/LeetCode-O

Preview • Updated May 6 • 54

Deita

hkust-nlp/deita-llama1-13b-v1.0-sft

Text Generation • Updated Dec 29, 2023 • 1
hkust-nlp/deita-complexity-scorer

Text Generation • Updated Jan 1, 2024 • 100 • 14
hkust-nlp/deita-quality-scorer

Text Generation • Updated Dec 29, 2023 • 168 • 18
hkust-nlp/deita-10k-v0

Viewer • Updated Dec 31, 2023 • 10k • 96 • 30

🎯DART-Math

Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving [NeurIPS 2024] @ https://github.com/hkust-nlp/dart-math

DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

Paper • 2407.13690 • Published Jun 18, 2024 • 2
hkust-nlp/dart-math-hard

Viewer • Updated Aug 2, 2024 • 585k • 145 • 14
hkust-nlp/dart-math-dsmath-7b-prop2diff

Text Generation • 7B • Updated Jul 21, 2024 • 8 • 3
hkust-nlp/dart-math-llama3-8b-prop2diff

Text Generation • 8B • Updated Jul 19, 2024 • 64 • 1

AI & ML interests

Recent Activity

Team members 15

hkust-nlp 's collections 11