The collection for the Paper "Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning"
Mingyang Song
Nickyang
·
AI & ML interests
LRMs, Long-Context LLMs, LLM Judges, Many-Shot ICL
Recent Activity
updated
a model
5 days ago
Nickyang/ConciseR-Zero-7B
updated
a model
5 days ago
Nickyang/ConciseR-Zero-7B-Preview
Organizations
None yet
models
5

Nickyang/ConciseR-Zero-7B
Text Generation
•
Updated
•
63
•
1

Nickyang/ConciseR-Zero-7B-Preview
Text Generation
•
Updated
•
35
•
1

Nickyang/FastCuRL-1.5B-V3
Text Generation
•
Updated
•
16
•
2

Nickyang/FastCuRL-1.5B-V2
Text Generation
•
Updated
•
15
•
1

Nickyang/FastCuRL-1.5B-Preview
Text Generation
•
Updated
•
1.05k
•
7