Jin

dsjinx

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a Space 3 months ago

hesamation/primer-llm-embedding

liked a dataset 4 months ago

qihoo360/Light-R1-SFTData

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

and 3 others •

Dec 9, 2022

• 293

liked a Space 3 months ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

How Language Models Turn Text into Meaning, From Traditional

liked a dataset 4 months ago

qihoo360/Light-R1-SFTData

Viewer • Updated Mar 17 • 79.4k • 536 • 51

liked a model 4 months ago

qihoo360/TinyR1-32B-Preview

Text Generation • 33B • Updated Apr 16 • 4k • • 328

upvoted 2 collections 4 months ago

TinyR1

Collection

2 items • Updated Apr 21 • 3

Light-R1

Collection

Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated Mar 13 • 12

upvoted an article 4 months ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 215

liked a Space 4 months ago

README

📈

upvoted an article 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 870

liked a Space 4 months ago

5.97k

MTEB Leaderboard

🥇

Embedding Leaderboard

liked 2 models 5 months ago

bespokelabs/Bespoke-Stratos-32B

Text Generation • 33B • Updated Jan 24 • 1.89k • 43

whyhow-ai/PatientSeek

Question Answering • 8B • Updated Jan 27 • 33 • 71

Jin

AI & ML interests

Recent Activity

Organizations

dsjinx's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

LLM Embeddings Explained: A Visual and Intuitive Guide

Open R1: Update #2

README

Open-R1: a fully open reproduction of DeepSeek-R1

MTEB Leaderboard