Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
12
24
Shengyi Costa Huang
vwxyzjn
Follow
ezzaldeen's profile picture
joe32140's profile picture
bharatxoni's profile picture
74 followers
·
20 following
http://costa.sh
vwxyzjn
vwxyzjn
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-R1-0528
updated
a dataset
2 months ago
vwxyzjn/the-algorithm-python
updated
a dataset
2 months ago
vwxyzjn/rlvr_acecoder
View all activity
Organizations
vwxyzjn
's models
393
Sort: Recently updated
vwxyzjn/ppo_async
Updated
Feb 5
•
14
vwxyzjn/ppo_sync
Updated
Feb 5
•
22
vwxyzjn/online_dpo_sync
Updated
Feb 5
•
14
vwxyzjn/online_dpo_async
Updated
Feb 5
•
19
vwxyzjn/rm_zephyr_new
Text Classification
•
7B
•
Updated
Sep 26, 2024
•
12
vwxyzjn/online_dpo_vllm_thread_beta_0.03__allenai_open_instruct_dev
Updated
Sep 11, 2024
vwxyzjn/reward_modeling__EleutherAI_pythia-14m
Updated
Aug 24, 2024
•
7
vwxyzjn/online_dpo_vllm__vwxyzjn_btulu
Updated
Aug 23, 2024
•
9
vwxyzjn/online_dpo_vllm__allenai_llama-3-tulu-2-8b
Updated
Aug 19, 2024
•
4
vwxyzjn/btulu
Text Generation
•
8B
•
Updated
Aug 19, 2024
•
8
vwxyzjn/online_dpo_tulu_2
Text Generation
•
Updated
Aug 19, 2024
•
21
vwxyzjn/gkd-model
Updated
Aug 15, 2024
vwxyzjn/reward_modeling__allenai_llama-3-tulu-2-8b
Updated
Aug 11, 2024
•
25
vwxyzjn/online_dpo__cleanrl_EleutherAI_pythia-1b-deduped__sft__tldr
Updated
Aug 9, 2024
vwxyzjn/online_dpo__EleutherAI_pythia-14m
Updated
Aug 8, 2024
vwxyzjn/online_dpo__EleutherAI_pythia-1b-deduped
Updated
Aug 8, 2024
vwxyzjn/tulu3_7b_llama3
Updated
Aug 7, 2024
•
3
vwxyzjn/tulu3_7b_llama3-10000-max-samples
Updated
Aug 6, 2024
•
71
vwxyzjn/reward_modeling__EleutherAI_pythia-1b-deduped
Updated
Aug 5, 2024
vwxyzjn/EleutherAI_pythia-14m__reward_modeling__tldr
Updated
Aug 5, 2024
vwxyzjn/rejection_sampling_23251
Updated
Aug 4, 2024
vwxyzjn/online_sft_test1
Updated
Jul 25, 2024
vwxyzjn/online_sft_test
Updated
Jul 25, 2024
vwxyzjn/online_dpo_test
Updated
Jul 24, 2024
vwxyzjn/summarize_from_feedback_details
Updated
Jul 19, 2024
vwxyzjn/online_dpo_llmjudge_tldr_6.9b
Text Generation
•
7B
•
Updated
Jul 19, 2024
•
6
vwxyzjn/online_dpo_llmjudge
Text Generation
•
1B
•
Updated
Jul 17, 2024
•
6
vwxyzjn/online_dpo_llmjudge_tldr
Updated
Jul 16, 2024
vwxyzjn/online_dpo_tldr_6.9b
Text Generation
•
7B
•
Updated
Jul 16, 2024
•
6
vwxyzjn/online_dpo_tldr
Text Generation
•
1B
•
Updated
Jul 15, 2024
•
6
Previous
1
2
3
...
14
Next