vegeta03/DeepHermes-Egregore-v2-RLAIF-8b-Atropos-Q8_0-GGUF Reinforcement Learning • 8B • Updated May 9 • 3