moscowx21/Qwen2.5-1.5B-Instruct-Gensyn-Swarm-giant_pale_ferret Text Generation • 2B • Updated 4 minutes ago • 1
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-sft-s1K Text Generation • 16B • Updated 1 day ago • 17 • 1
AmberYifan/Qwen2.5-14B-Instruct-ultrafeedback-iterdpo-iter2-RPO Text Generation • 0.0B • Updated 8 days ago • 15 • 1
AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-iterDPO-iter1-4k Text Generation • 0.0B • Updated 7 days ago • 27 • 1
mradermacher/Qwen2.5-14B-Instruct-ultrafeedback-iterdpo-iter2-RPO-GGUF 15B • Updated 7 days ago • 2.07k • 1
mradermacher/Qwen2.5-14B-Instruct-wildfeedback-RPO-iterDPO-iter1-4k-GGUF 15B • Updated 6 days ago • 1.85k • 1