oieieio
/
Qwen2.5-0.5B-Instruct-GRPO-thinking-function_calling-V0

Model card Files Files and versions Metrics Training metrics Community