2xQwen2.5-Coder-3B-Minister-Main

This model is a fine-tuned version of Qwen/Qwen2.5-Coder-3B using Multi-LLM Group Relative Policy Optimization (MAGRPO) on HumanEval dataset.

Downloads last month
10
Safetensors
Model size
3.09B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for LovelyBuggies/2xQwen2.5-Coder-3B-Minister-Main

Base model

Qwen/Qwen2.5-3B
Finetuned
(40)
this model
Quantizations
1 model