2 2 21

Zhicheng Wang

Dicer

https://blog.dicer.fun

Dicer-Zz

AI & ML interests

NLP

Recent Activity

liked a model 6 days ago

Qwen/Qwen3-Embedding-0.6B

liked a model about 2 months ago

thenlper/gte-large-zh

updated a model 6 months ago

Dicer/ppo-Huggy

View all activity

Organizations

liked a model 6 days ago

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated Jun 20 • 3.57M • • 496

liked a model about 2 months ago

thenlper/gte-large-zh

updated a model 6 months ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated Feb 25 • 13

published a model 6 months ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated Feb 25 • 13

updated a model 6 months ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 25 • 1

published a model 6 months ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 25 • 1

upvoted 2 articles 6 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 209

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 437

liked 5 datasets 10 months ago

liked a model 11 months ago

XLabs-AI/flux-controlnet-collections

Text-to-Image • Updated Aug 30, 2024 • 13.1k • 516

liked a Space over 1 year ago

6.25k

MTEB Leaderboard

🥇

Embedding Leaderboard

liked a model over 1 year ago

openbmb/MiniCPM-2B-sft-fp32

Text Generation • Updated Sep 7, 2024 • 1.31k • 295

liked a dataset over 1 year ago

bigscience/P3

Viewer • Updated Mar 4, 2024 • 122M • 6.4k • 229

liked a model over 1 year ago

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • 7B • Updated 28 days ago • 681k • • 2.92k

liked a dataset almost 2 years ago

Muennighoff/natural-instructions

Viewer • Updated Dec 23, 2022 • 7.15M • 2.41k • 68

liked a model over 2 years ago

databricks/dolly-v2-12b

Text Generation • Updated Jun 30, 2023 • 6.54k • 1.96k

Zhicheng Wang

AI & ML interests

Recent Activity

Organizations

Dicer's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Vision Language Models Explained

MTEB Leaderboard