cloudyu
/

Yi-34Bx2-MoE-60B-DPO

Text Generation

Mixture of Experts

text-generation-inference

Model card Files Files and versions Community

This is DPO improved version of cloudyu/Yi-34Bx2-MoE-60B
DPO Trainer
metrics not test!

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	25.91
IFEval (0-Shot)	53.19
BBH (3-Shot)	31.26
MATH Lvl 5 (4-Shot)	6.19
GPQA (0-shot)	9.62
MuSR (0-shot)	14.32
MMLU-PRO (5-shot)	40.85

Downloads last month: 11

Safetensors

Model size

60.8B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for cloudyu/Yi-34Bx2-MoE-60B-DPO

Quantizations

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

53.190
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

31.260
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

6.190
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

9.620
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

14.320
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

40.850

View on Papers With Code