ldwang

ldwang

AI & ML interests

LLM, VL, Infra

Recent Activity

updated a collection 1 day ago
MiscBlogs
updated a model 1 day ago
BAAI/Aquila-135M
updated a collection 2 days ago
MiscModels
View all activity

Organizations

Beijing Academy of Artificial Intelligence's profile picture PetiteTech's profile picture

ldwang's activity

New activity in O1-OPEN/OpenO1-SFT-Ultra 24 days ago

Quality tagging

#3 opened 24 days ago by
ldwang
New activity in yys/OpenOrca-Chinese 26 days ago

Other Chinese datasets

#2 opened 26 days ago by
ldwang
New activity in Tristan/dclm-fasttext-oh-eli5-410m about 1 month ago

About training details

#1 opened about 1 month ago by
ldwang
New activity in Weyaxi/leaderboard-results-to-modelcard 2 months ago

evaluation

2
#18 opened 2 months ago by
ldwang
New activity in Qwen/Qwen2.5-7B 2 months ago

evaluation

#4 opened 2 months ago by
ldwang
New activity in Qwen/Qwen2.5-0.5B-Instruct 2 months ago

Evaluation

#4 opened 2 months ago by
ldwang
New activity in tencent/Tencent-Hunyuan-Large 2 months ago

How to evaluate models

#10 opened 2 months ago by
ldwang
New activity in HuggingFaceTB/SmolLM2-1.7B-Instruct 2 months ago
New activity in HuggingFaceTB/SmolLM2-1.7B-Instruct 3 months ago

About evaluation

1
#7 opened 3 months ago by
ldwang
New activity in HuggingFaceTB/SmolLM2-135M 3 months ago
New activity in openbmb/MiniCPM3-4B 3 months ago

About cooldown

#31 opened 3 months ago by
ldwang
New activity in Zyphra/Zyda-2 3 months ago

About annealing process

1
#117 opened 3 months ago by
ldwang
New activity in TRI-ML/DCLM-1B-v0 3 months ago
New activity in BAAI/CCI3-HQ 3 months ago
New activity in BAAI/CCI3-Data 3 months ago
New activity in BAAI/CCI3-HQ-Classifier 3 months ago
New activity in opencsg/chinese-fineweb-edu-v2 3 months ago

About classifier

#3 opened 3 months ago by
ldwang