AI & ML interests
None defined yet.
Recent Activity
models
45
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step300
4B
•
Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step250
4B
•
Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step400
4B
•
Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advorm-n5-eta200-stepLen256-stepSplit-length-step250
4B
•
Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta200-stepLen256-stepSplit-length-step250
1B
•
Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advorm-n5-eta100-stepLen256-stepSplit-nn-step200
4B
•
Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advorm-n5-eta0-stepLen0-stepSplit-nn-step200
4B
•
Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta0-stepLen0-stepSplit-nn-step500
1B
•
Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta100-stepLen256-stepSplit-nn-step500
1B
•
Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta100-stepLen256-stepSplit-nn-step400
1B
•
Updated