1 2

Scott Geng

scottgeng00

AI & ML interests

None yet

Recent Activity

updated a dataset 2 days ago

scottgeng00/amc_22-24

published a dataset 2 days ago

scottgeng00/amc_22-24

upvoted a paper 4 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

View all activity

Organizations

Collections 1

Papers 1

arxiv:2406.05184

models 0

None public yet

datasets 13

Scott Geng

AI & ML interests

Recent Activity

Organizations

Collections 1

scottgeng00/nabirds_no-label-merge_llava-1.6-7b-m_hface

scottgeng00/nabirds_no-label-merge_cot-expert_llava-1.6-7b-m_hface

scottgeng00/nabirds_no-label-merge_llava-1.6-7b-m_hface

scottgeng00/nabirds_no-label-merge_cot-expert_llava-1.6-7b-m_hface

Papers 1

models 0

datasets 13

scottgeng00/amc_22-24

scottgeng00/nabirds_zs_qwen3b_synth

scottgeng00/nabirds_custom_split

scottgeng00/nabirds_no-label-merge_cot_qwen-2.5-vl-3b

scottgeng00/nabirds_no-label-merge_cot-expert_qwen-2.5-vl-3b

scottgeng00/dpo_model_ladder

scottgeng00/olmo2_dpo_model_ladder_v2

scottgeng00/olmo2_dpo_model_ladder

scottgeng00/olmo-2-1124-7b-preference-mix-model-ladder

scottgeng00/nabirds_no-label-merge_cot-expert_llava-1.6-7b-m_hface

Scott Geng

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 1

models 0

datasets 13 Sort: Recently updated

datasets 13