Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
14
1
Tongyao
PRO
tyzhu
Follow
0 followers
·
1 following
tongyao-zhu
AI & ML interests
Natural Language Processing
Recent Activity
updated
a model
less than a minute ago
tyzhu/opcd-0.5b
published
a model
1 minute ago
tyzhu/opcd-0.5b
updated
a model
about 23 hours ago
tyzhu/sft_my_sep1_ds
View all activity
Organizations
None yet
tyzhu
's models
671
Sort: Recently updated
tyzhu/opcd-0.5b
Text Generation
•
0.6B
•
Updated
less than a minute ago
tyzhu/sft_my_sep1_ds
Updated
about 23 hours ago
tyzhu/tinyllama_out_sg_sep26
Updated
3 days ago
tyzhu/alfworld_ppo_qwen_2.5_1.5b
Updated
4 days ago
tyzhu/litgpt_out_sep1_my_ds
Updated
6 days ago
tyzhu/opencoder484
Updated
6 days ago
•
48
tyzhu/ragen_sft_my_sep23
Updated
7 days ago
tyzhu/ppo_qwen2.5_1.5b-sftnew2-step86
Updated
7 days ago
tyzhu/qwen25-0.5b-ckpt-cc-8k
Updated
8 days ago
tyzhu/ppo_qwen2.5_1.5b-sftnew-step210
Updated
8 days ago
tyzhu/mathmedium2-mutualposclip0.5-llama3.2-3b-it-oldreward-4k-corrupt_delete-0.8
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip0.5-llama3.2-3b-it-oldreward-4k-corrupt_delete-0.2
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip0.5-llama3.2-3b-it-oldreward-4k-corrupt-0.8
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip0.5-llama3.2-3b-it-oldreward-4k-corrupt-0.2
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip0.2-llama3.2-3b-it-oldreward-4k-corrupt_delete-0.8
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip0.2-llama3.2-3b-it-oldreward-4k-corrupt_delete-0.2
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip0.0-llama3.2-3b-it-oldreward-4k-corrupt-0.8
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip0.0-llama3.2-3b-it-oldreward-4k-corrupt-0.2
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip-0.0-llama3.2-3b-it-oldreward-4k-corrupt-0.8
Updated
9 days ago
tyzhu/mathmedium2-mutualposclip-0.0-llama3.2-3b-it-oldreward-4k-corrupt-0.2
Updated
10 days ago
tyzhu/mathhard2-mutualposclip0.5-llama3.2-3b-it-oldreward-4k-corrupt-0.8
Updated
10 days ago
tyzhu/mathhard2-mutualposclip-0.0-llama3.2-3b-it-oldreward-4k-corrupt-0.8
Updated
10 days ago
tyzhu/mathhard2-mutualposclip-0.0-llama3.2-3b-it-oldreward-4k-corrupt-0.2
Updated
10 days ago
tyzhu/tinyllama_wandb_
Updated
10 days ago
tyzhu/helmet_output
Updated
10 days ago
tyzhu/tinyllama_wandb
Updated
11 days ago
tyzhu/ruler_old_native_sep26_sg
Updated
11 days ago
tyzhu/fewshot_outputs_sg_sep25_
Updated
11 days ago
tyzhu/sft_verl_agent
Updated
11 days ago
tyzhu/verl-agent-checkpoints-2-sg
Updated
12 days ago
Previous
1
2
3
...
23
Next