Jin's picture

5 7

Jin

dsjinx

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a Space 3 months ago

hesamation/primer-llm-embedding

liked a dataset 4 months ago

qihoo360/Light-R1-SFTData

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 293

upvoted 2 collections 4 months ago

TinyR1

2 items • Updated Apr 21 • 3

Light-R1

Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated Mar 13 • 12

upvoted 2 articles 4 months ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 215

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 870