AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-iterDPO-iter2-4k Text Generation • 0.0B • Updated 5 days ago • 15 • 1
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-iterDPO-iter2-4k Text Generation • 0.0B • Updated 5 days ago • 13 • 1
mradermacher/Qwen2.5-7B-Instruct-wildfeedback-iterDPO-iter2-4k-GGUF 8B • Updated 3 days ago • 2.11k • 1
mradermacher/Qwen2.5-14B-Instruct-wildfeedback-RPO-iterDPO-iter2-4k-GGUF 15B • Updated 3 days ago • 1.57k • 1