LovelyBuggies
/

2xQwen2.5-Coder-3B-Minister-Aux

Text Generation

text-generation-inference

Model card Files Files and versions Community

2xQwen2.5-Coder-3B-Minister-Aux

This model is a fine-tuned version of Qwen/Qwen2.5-Coder-3B using Multi-LLM Group Relative Policy Optimization (MAGRPO) on HumanEval dataset.

Downloads last month: 14

Safetensors

Model size

3.09B params

Tensor type

F32

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LovelyBuggies/2xQwen2.5-Coder-3B-Minister-Aux

Base model

Qwen/Qwen2.5-3B

Finetuned

Qwen/Qwen2.5-Coder-3B

Finetuned

(40)

this model

Quantizations

1 model