2 2 29

Kideng

luo-li-ba-suo

AI & ML interests

Reinforcement Learning, Language Model

Recent Activity

liked a dataset about 1 month ago

hfl/ruozhiba_gpt4

liked a model 9 months ago

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16

liked a model 11 months ago

unsloth/DeepSeek-R1-BF16

View all activity

Organizations

None yet

liked a dataset about 1 month ago

hfl/ruozhiba_gpt4

Viewer • Updated May 18, 2024 • 4.9k • 203 • 89

liked a model 9 months ago

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16

Reinforcement Learning • 8B • Updated Mar 25, 2025 • 626 • 90

liked a model 11 months ago

unsloth/DeepSeek-R1-BF16

Text Generation • 684B • Updated Apr 19, 2025 • 87 • 24

liked a model 12 months ago

MiniMaxAI/MiniMax-Text-01

Text Generation • 456B • Updated Jul 3, 2025 • 1.55k • 652

upvoted a collection over 1 year ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 21

liked a dataset over 1 year ago

NCSOFT/offsetbias

Viewer • Updated Jul 12, 2024 • 8.5k • 48 • 23

liked a Space over 1 year ago

Open LLM Leaderboard

🏆

13.8k

Track, rank and evaluate open LLMs and chatbots

liked a model over 1 year ago

OpenRLHF/Llama-3-8b-rlhf-100k

Text Generation • 8B • Updated Jun 24, 2024 • 11 • 4

liked a dataset over 1 year ago

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17, 2024 • 57.5k • 611 • 156

New activity in allenai/preference-test-sets over 1 year ago

Doubts about “LMSYS Human MT Bench Pairs” split

#2 opened over 1 year ago by

Kideng

liked a dataset almost 2 years ago

argilla/dpo-mix-7k

Viewer • Updated Jul 16, 2024 • 7.5k • 215 • 170

liked a model almost 2 years ago

jondurbin/bagel-34b-v0.4

Text Generation • 34B • Updated Feb 21, 2024 • 9 • 10

liked a dataset almost 2 years ago

berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 416 • 294

upvoted a paper almost 2 years ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151

liked a Space about 2 years ago

LMArena Leaderboard

🏆

4.7k

Display LMArena Leaderboard

liked a model about 2 years ago

LingxinAI/CharacterGLM-6b

Updated Feb 2, 2024 • 55

liked a dataset about 2 years ago

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.58k • 395

liked a model about 2 years ago

Qwen/Qwen-72B

Text Generation • 72B • Updated Oct 9, 2024 • 2.3k • 360

liked a Space about 2 years ago

Yi-34B-Chat

🔥

344

liked a dataset about 2 years ago

BAAI/JudgeLM-100K

Preview • Updated Oct 27, 2023 • 84 • 51

Kideng

AI & ML interests

Recent Activity

Organizations

Kideng's activity

Open LLM Leaderboard

Doubts about “LMSYS Human MT Bench Pairs” split

LMArena Leaderboard

Yi-34B-Chat