mradermacher/Tifa-DeepsexV2-7b-MGRPO-safetensors-GGUF Reinforcement Learning • Updated Mar 2 • 340 • 1
mradermacher/Tifa-DeepsexV2-7b-MGRPO-safetensors-i1-GGUF Reinforcement Learning • Updated Mar 2 • 266
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos Reinforcement Learning • Updated Apr 29 • 31 • 2
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos Reinforcement Learning • Updated Apr 29 • 40 • 4
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos-GGUF Reinforcement Learning • Updated May 5 • 7 • 1