Scott Geng
scottgeng00
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
Reinforcement Learning for Reasoning in Large Language Models with One
Training Example
upvoted
a
paper
11 days ago
ReasonIR: Training Retrievers for Reasoning Tasks
updated
a dataset
25 days ago
scottgeng00/nabirds_zs_qwen3b_synth
Organizations
Collections
1
Papers
1
models
0
None public yet
datasets
12
scottgeng00/nabirds_zs_qwen3b_synth
Updated
•
16
scottgeng00/nabirds_custom_split
Viewer
•
Updated
•
54.1k
•
77
scottgeng00/nabirds_no-label-merge_cot_qwen-2.5-vl-3b
Viewer
•
Updated
•
22.9k
•
29
scottgeng00/nabirds_no-label-merge_cot-expert_qwen-2.5-vl-3b
Viewer
•
Updated
•
22.2k
•
35
scottgeng00/dpo_model_ladder
Viewer
•
Updated
•
265k
•
139
scottgeng00/olmo2_dpo_model_ladder_v2
Viewer
•
Updated
•
367k
•
68
scottgeng00/olmo2_dpo_model_ladder
Viewer
•
Updated
•
367k
•
90
scottgeng00/olmo-2-1124-7b-preference-mix-model-ladder
Updated
•
4
scottgeng00/nabirds_no-label-merge_cot-expert_llava-1.6-7b-m_hface
Viewer
•
Updated
•
18.5k
•
46
scottgeng00/nabirds_no-label-merge_llava-1.6-7b-m_hface
Viewer
•
Updated
•
18k
•
30