Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Jackmin108
's Collections
RL Models
SFT Models
RL Models
updated
28 days ago
RL Models
Upvote
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
•
Updated
Feb 24
•
419k
•
•
663
Jackmin108/qwen-7b-rl-step-1
Text Generation
•
Updated
27 days ago
•
33
Jackmin108/qwen-7b-rl-step-2
Text Generation
•
Updated
27 days ago
•
27
Jackmin108/qwen-7b-rl-step-3
Text Generation
•
Updated
27 days ago
•
28
Jackmin108/qwen-7b-rl-step-4
Text Generation
•
Updated
27 days ago
•
27
Jackmin108/qwen-7b-rl-step-8
Text Generation
•
Updated
27 days ago
•
27
Jackmin108/qwen-7b-rl-step-16
Text Generation
•
Updated
27 days ago
•
29
Jackmin108/qwen-7b-rl-step-31
Text Generation
•
Updated
27 days ago
•
36
Jackmin108/qwen-7b-rl-step-32
Text Generation
•
Updated
27 days ago
•
28
Upvote
-
Share collection
View history
Collection guide
Browse collections