Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spiral RL

community
https://github.com/spiral-rl/spiral
spiral-rl
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Benjamin-eecs  new activity about 2 months ago
spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT:feat(enhance dataset card): add metadata, expanded intro, and sample usage
Benjamin-eecs  new activity about 2 months ago
spiral-rl/Spiral-Qwen3-4B:feat(improve model card): add pipeline tag, library name, quickstart, and expanded details
Benjamin-eecs  new activity about 2 months ago
spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B:feat: add pipeline tag, library name, and sample usage
View all activity

Leon Guertler's profile picture Bo Liu's profile picture Simon Yu's profile picture Zichen's profile picture

spiral-rl 's collections 1

SPIRAL
  • spiral-rl/Spiral-Qwen3-4B

    Text Generation • 4B • Updated Jul 5 • 229 • 4
  • spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Jul 5 • 23 • 2
  • spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

    Viewer • Updated Jul 5 • 25.5k • 50
  • SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

    Paper • 2506.24119 • Published Jun 30 • 48
SPIRAL
  • spiral-rl/Spiral-Qwen3-4B

    Text Generation • 4B • Updated Jul 5 • 229 • 4
  • spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Jul 5 • 23 • 2
  • spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

    Viewer • Updated Jul 5 • 25.5k • 50
  • SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

    Paper • 2506.24119 • Published Jun 30 • 48
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs