Hamish Ivison's picture

Hamish Ivison

hamishivi

·

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a model about 12 hours ago

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit_10pct_step160

published a model about 12 hours ago

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit_10pct_step160

updated a model 3 days ago

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit__42__1777143486_step_200

View all activity

Organizations

Collections 8

View 8 collections

Papers 14

arxiv:2512.13961

arxiv:2511.19399

arxiv:2511.07317

arxiv:2503.01807

models 272

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit_10pct_step160

9B • Updated about 12 hours ago

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit421777143486_step_200

9B • Updated 3 days ago • 155

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_overlong421777163763_step_200

9B • Updated 3 days ago • 155

hamishivi/vip_grpo_base_p32_2403_qwen3_4b_math11774385112_step500

196k • Updated 5 days ago • 14

hamishivi/vip_grpo_base_p32_2403_qwen3_4b_math11774385112_step1000

196k • Updated 5 days ago • 297

hamishivi/qwen3.5_sft

9B • Updated 5 days ago • 165

hamishivi/qwen3.5_sft_w_incompletes

9B • Updated 5 days ago • 297

hamishivi/qwen3.5_tmax_breakdown_test_step100

9B • Updated 6 days ago • 266

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo421776749915_step400

9B • Updated 8 days ago • 156

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo421776699606_step100

9B • Updated 8 days ago • 410

View 272 models

datasets 219

hamishivi/dapo-grpo-1000-steps-math-pairs-sae

Viewer • Updated 4 days ago • 200 • 24

hamishivi/dapo-math-pairs-value-sae

Viewer • Updated 6 days ago • 200 • 35

hamishivi/dapo-math-pairs-value

Viewer • Updated 6 days ago • 200 • 40

hamishivi/agent-task-termigen

Viewer • Updated 6 days ago • 3.56k • 36

hamishivi/swerl-tmax-10k-verified

Viewer • Updated 7 days ago • 6.17k • 90

hamishivi/swerl-tmax-10k

Viewer • Updated 7 days ago • 9.46k • 237

hamishivi/agent-task-terminal-traj

Viewer • Updated 7 days ago • 5.65k • 50

hamishivi/agent-task-r2e-gym

Viewer • Updated 7 days ago • 8.1k • 29

hamishivi/agent-task-endless-terminals

Viewer • Updated 7 days ago • 2.49k • 39

hamishivi/agent-task-swe-gym

Viewer • Updated 7 days ago • 407 • 28

View 219 datasets