VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-GRPO-16bits-V1 Text Generation • Updated Apr 22 • 12
mradermacher/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-GRPO-16bits-V1-GGUF Updated Apr 23 • 120
alfredcs/torchrun-gemma-3-12b-grpo-firstaid-merged Image-Text-to-Text • Updated about 19 hours ago • 33