Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kolerk 's Collections
TON

TON

updated 2 days ago

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models.

Upvote
1

  • kolerk/TON-3B-AITZ

    Image-Text-to-Text • Updated 2 days ago • 3

  • kolerk/TON-3B-CLEVR

    Image-Text-to-Text • Updated 2 days ago • 3

  • kolerk/TON-3B-Math

    Image-Text-to-Text • Updated 2 days ago • 3

  • kolerk/TON-7B-Math

    Image-Text-to-Text • Updated 2 days ago • 3

  • kolerk/TON-AITZ-SFT

    Preview • Updated 2 days ago • 25

  • kolerk/TON-Math-SFT

    Viewer • Updated 2 days ago • 8.03k • 35

  • Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

    Paper • 2505.16854 • Published 2 days ago • 9
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs