2xQwen2.5-Coder-3B-Minister-Aux

This model is a fine-tuned version of Qwen/Qwen2.5-Coder-3B using Multi-LLM Group Relative Policy Optimization (MAGRPO) on HumanEval dataset.

Downloads last month
14
Safetensors
Model size
3.09B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for LovelyBuggies/2xQwen2.5-Coder-3B-Minister-Aux

Base model

Qwen/Qwen2.5-3B
Finetuned
(40)
this model
Quantizations
1 model