Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
l3lab
's Collections
L1
miniCTX
L1
updated
19 days ago
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Upvote
4
l3lab/L1-Qwen-1.5B-Max
Updated
19 days ago
•
1.96k
•
9
l3lab/L1-Qwen-1.5B-Exact
Updated
19 days ago
•
2.43k
•
2
Upvote
4
Share collection
View history
Collection guide
Browse collections