NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28 • 919 • 10
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos-GGUF Reinforcement Learning • 8B • Updated May 5 • 23 • 2
VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1 Text Generation • 3B • Updated Apr 22 • 5
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29 • 6 • 2