Datasets and Models for Advprompter
Simon Yu PRO
simonycl
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
updated
a model
about 4 hours ago
the-acorn-ai/Qwen3-4B-Leon-0521-sft-lora-merged
published
a model
about 4 hours ago
the-acorn-ai/Qwen3-4B-Leon-0521-sft-lora-merged
Organizations
Collections
2
spaces
1
models
283

simonycl/gemma_3_27b_cmv_hard_persuasion_judge_new
Image-Text-to-Text
•
Updated
•
18

simonycl/gemma_3_27b_cmv_hard_persuasion_judge_new_overwrites
Image-Text-to-Text
•
Updated
•
9

simonycl/gemma_3_27b_cmv_hard_persuasion_judge
Updated
•
3

simonycl/cmv_hard_gemma3-12b-it_full_sft
Image-Text-to-Text
•
Updated
•
8

simonycl/temp_file_1
Updated

simonycl/llama3-8b-shp-rm
Updated
•
1

simonycl/qwen-2.5-7b-distill-tic-tac-toe-iter2
Updated

simonycl/qwen-2.5-7b-distill-tic-tac-toe-iter1
Updated

simonycl/qwen-2.5-7b-distill-sft-32b-tic-tac-toe
Updated

simonycl/tic-tac-toe-qwen-distill-7b-iter2
Updated
•
2
datasets
108
simonycl/Anthropic-persuasion-pairs-delta-1-negate
Viewer
•
Updated
•
59
•
27
simonycl/Anthropic-persuasion-pairs-delta-1
Viewer
•
Updated
•
59
•
25
simonycl/SHP_cmv_train
Viewer
•
Updated
•
38.2k
•
12
simonycl/cmv_hard_skywork
Viewer
•
Updated
•
102k
•
11
simonycl/llama-3.3-70b-ultrainteract-filtered
Viewer
•
Updated
•
81k
•
17
simonycl/qwen_2.5_70b_ultrainteract
Viewer
•
Updated
•
81k
•
17
simonycl/llama-3.3-70b-ultrainteract
Viewer
•
Updated
•
162k
•
9
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-annotate-judge-mtbench_cot_truth
Viewer
•
Updated
•
6
•
37
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_reason
Viewer
•
Updated
•
61.1k
•
26
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_safe
Viewer
•
Updated
•
61.1k
•
63