1 2 16

wang

zhaokai

gklab

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

DeepSeek-V3.1

liked a model 3 months ago

Menlo/Jan-nano

liked a model 5 months ago

Qwen/Qwen3-30B-A3B

View all activity

Organizations

upvoted a collection about 1 month ago

DeepSeek-V3.1

Collection

4 items • Updated 3 days ago • 234

liked a model 3 months ago

Menlo/Jan-nano

Text Generation • 4B • Updated Jul 4 • 2.97k • • 486

liked a model 5 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26 • 265k • • 789

liked a dataset 7 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21 • 110k • 992 • 695

liked a Space 7 months ago

3.25k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 2 models 8 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24 • 2.99M • • 1.45k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 507k • • 12.7k

liked a model 9 months ago

deepseek-ai/DeepSeek-V3-Base

685B • Updated Mar 27 • 9.83k • 1.67k

upvoted a collection about 1 year ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Jul 21 • 225

liked 3 models about 1 year ago

liked a model over 1 year ago

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • 9B • Updated Jan 15 • 79.6k • 1.4k

liked a Space over 1 year ago

1.08k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality text data for LLMs using FineWeb

liked a dataset over 1 year ago

Skywork/SkyPile-150B

Viewer • Updated Dec 7, 2023 • 1.76M • 6.22k • 388

liked a model almost 2 years ago

SkunkworksAI/phi-2

Text Generation • 3B • Updated Dec 13, 2023 • 184 • 132

liked 2 models about 2 years ago

huggyllama/llama-30b

Text Generation • 33B • Updated Apr 7, 2023 • 2.27k • 48

huggyllama/llama-65b

Text Generation • 65B • Updated Apr 7, 2023 • 6.05k • 77