Longxu Dou's picture

Longxu Dou

dreamerdeo

·

https://longxudou.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper about 2 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

updated a Space about 2 months ago

sailor2/README

upvoted a paper 3 months ago

Reinforcing General Reasoning without Verifiers

View all activity

Organizations

upvoted a paper about 2 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 46

updated a Space about 2 months ago

README

upvoted 5 papers 3 months ago

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27 • 26

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28 • 29

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26 • 24

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19 • 36

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 46

upvoted an article 4 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

By

•

Oct 24, 2024

• 12

New activity in sail/Sailor2-20B 4 months ago

Do you have a plan to follow the Qwen 3.0 updates?

#5 opened 4 months ago by

martin-from-beijing

New activity in sail/Sailor2-1B-Pre 4 months ago

Improve language tag

#2 opened 4 months ago by

New activity in sail/Sailor2-1B 4 months ago

Improve language tag

#4 opened 4 months ago by

New activity in sail/Sailor2-8B-Pre 4 months ago

Improve language tag

#2 opened 4 months ago by

New activity in sail/Sailor2-8B 4 months ago

Improve language tag

#3 opened 4 months ago by

New activity in sail/Sailor2-20B 4 months ago

Improve language tag

#4 opened 4 months ago by

updated 2 models 4 months ago

sail/Sailor2-20B

Text Generation • 19B • Updated Apr 29 • 100 • 10

sail/Sailor2-20B-Pre

Text Generation • 19B • Updated Apr 29 • 9

New activity in sail/Sailor2-20B-Pre 4 months ago

Improve language tag

#3 opened 4 months ago by

commented a paper 4 months ago

Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published Apr 21 • 121 •

upvoted a paper 4 months ago

Could Thinking Multilingually Empower LLM Reasoning?

Paper • 2504.11833 • Published Apr 16 • 29

authored a paper 4 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21 • 47