Spiral RL

community

https://github.com/spiral-rl/spiral

spiral-rl

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

simonycl updated a model about 4 hours ago

spiral-rl/qwen3-8b-0708-games-bothink-iter207

simonycl updated a model about 4 hours ago

spiral-rl/qwen3-8b-0708-games-bothink-iter175

simonycl updated a model about 4 hours ago

spiral-rl/qwen3-8b-0708-games-bothink-iter191

View all activity

Collections 1

models 125

datasets 1

spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

Viewer • Updated Jul 5, 2025 • 25.5k • 36

Spiral RL

AI & ML interests

Recent Activity

Collections 1

spiral-rl/Spiral-Qwen3-4B

spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B

spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

spiral-rl/Spiral-Qwen3-4B

spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B

spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

models 125

spiral-rl/qwen3-8b-0708-games-bothink-iter207

spiral-rl/qwen3-8b-0708-games-bothink-iter175

spiral-rl/qwen3-8b-0708-games-bothink-iter191

spiral-rl/qwen3-8b-0708-games-bothink-iter239

spiral-rl/qwen3-8b-0708-games-bothink-iter223

spiral-rl/qwen3-8b-0708-games-bothink-iter255

spiral-rl/qwen3-8b-0708-games-bothink-iter271

spiral-rl/qwen3-8b-0708-games-bothink-iter287

spiral-rl/qwen3-8b-0708-games-bothink-iter399

spiral-rl/qwen3-8b-0708-games-bothink-iter351

datasets 1

spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

AI & ML interests

Recent Activity

Team members 4

Collections 1

models 125 Sort: Recently updated

datasets 1

models 125