Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
31
Peidong Wang
WDong
Follow
21world's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
1 day ago
AceReason
liked
a dataset
about 2 months ago
open-r1/OpenR1-Math-220k
upvoted
a
paper
3 months ago
TeleAntiFraud-28k: A Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection
View all activity
Organizations
WDong
's models
23
Sort: Recently updated
WDong/dpo_0625_iter2_after_dpo_0.6
Updated
Jun 28, 2024
•
69
WDong/sft_06221544_policy2
Updated
Jun 28, 2024
•
5
WDong/sft_0626_after_2_dpo_9
Updated
Jun 28, 2024
•
22
WDong/sft_0622_policy2
Updated
Jun 28, 2024
•
64
WDong/dpo_06230018_policy2_0.6
Updated
Jun 28, 2024
•
4
WDong/dpo_06230018_policy2_0.01
Updated
Jun 28, 2024
•
3
WDong/dpo_06221544_policy2
Updated
Jun 28, 2024
•
3
WDong/dpo_0622_policy2
Updated
Jun 28, 2024
•
3
WDong/dpo_0621
Updated
Jun 28, 2024
•
3
WDong/Qwen2-7B-Instruct-dpo-06230018-policy2-0.6
Text Generation
•
Updated
Jun 24, 2024
•
12
WDong/lora_06072000
Updated
Jun 8, 2024
•
4
WDong/7B_lora_06051615
Updated
Jun 8, 2024
•
7
WDong/Qwen1.5-7B-sft-0506_9_8
Text Generation
•
Updated
May 7, 2024
•
18
WDong/Qwen1.5-7B-sft-0506_7_7
Text Generation
•
Updated
May 6, 2024
•
9
WDong/Qwen1.5-7B-sft-0502
Text Generation
•
Updated
May 2, 2024
•
10
WDong/7B-0428
Text Generation
•
Updated
Apr 28, 2024
•
9
WDong/Qwen1.5-7B-SFT-0425
Updated
Apr 25, 2024
WDong/qwen1.5-1.8B-seed-sft
Text Generation
•
Updated
Apr 22, 2024
•
16
WDong/CartPole
Reinforcement Learning
•
Updated
Mar 18, 2024
WDong/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Mar 13, 2024
•
4
WDong/Taxi-v3
Reinforcement Learning
•
Updated
Mar 13, 2024
WDong/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Mar 13, 2024
WDong/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 10, 2024
•
2