Zhang Chen's picture

6 76

Zhang Chen

zhangchen1991

·

https://e0397123.github.io/

e0397123

AI & ML interests

dialogue systems

Recent Activity

liked a dataset about 21 hours ago

miromind-ai/MiroRL-GenQA

liked a dataset 1 day ago

Tevatron/browsecomp-plus

liked a dataset 3 days ago

jinzhuoran/RAG-RewardBench

View all activity

Organizations

upvoted a collection 2 months ago

xLAM-2

A family of Large Action Model for multi-turn conversation and tool-use • 10 items • Updated 16 days ago • 20

upvoted an article 6 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 877

upvoted a collection 6 months ago

Deepseek Papers

Deepseek papers collection • 24 items • Updated 10 days ago • 268

upvoted 2 papers 10 months ago

TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models

Paper • 2405.20215 • Published May 30, 2024 • 1

Aligning Language Models Using Follow-up Likelihood as Reward Signal

Paper • 2409.13948 • Published Sep 20, 2024 • 1

upvoted a collection over 1 year ago

Handbook v0.1 models and datasets

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24