SRFT - a Yuqian-Fu Collection

Yuqian-Fu 's Collections

SRFT

SRFT

updated 1 day ago

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Paper • 2506.19767 • Published 2 days ago • 12
Yuqian-Fu/SRFT

Text Generation • Updated about 10 hours ago • 2
Elliott/Openr1-Math-46k-8192

Viewer • Updated Apr 23 • 45.8k • 406 • 1