Model checkpoints generated during an ongoing research effort into the acceleration potential and tuning quality of LLMs with RL fine tuning.
Scott Biggs
ScottBiggs2
·
AI & ML interests
I'm an AI researcher working on scalable generative modeling and reinforcement learning, with recent work in sparse RL acceleration and preference-based optimization. I release models and artifacts related to research, industry collaboration, and experimental exploration.
Recent Activity
liked
a model
40 minutes ago
Qwen/Qwen3-0.6B-Base
updated
a collection
about 1 hour ago
Conversation Classifiers
updated
a collection
about 1 hour ago
Conversation Classifiers