Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
1
Toby Simonds
TamasSimonds
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
29 days ago
TamasSimonds/llama3.1-8b-kp-1k-self-play-step-336-sys-prompt
published
a model
29 days ago
TamasSimonds/llama3.1-8b-kp-1k-self-play-step-336-sys-prompt
published
a dataset
about 1 month ago
TamasSimonds/olympiad-proof-problems
View all activity
Organizations
None yet
Papers
4
arxiv:
2504.19394
arxiv:
2503.00735
arxiv:
2412.04645
arxiv:
2410.07490
models
7
Sort: Recently updated
TamasSimonds/llama3.1-8b-kp-1k-self-play-step-336-sys-prompt
8B
•
Updated
29 days ago
•
7
TamasSimonds/spiral-qwen2-5-3b-base-KP-1k-self-play-1-1-step-192
3B
•
Updated
Jul 12
•
5
TamasSimonds/spiral-qwen3-8b-base-KP-1k-self-play-1-1-step-192
8B
•
Updated
Jul 12
•
5
TamasSimonds/spiral-llama-3B-base-KP-1k-self-play-1-1-step-192
3B
•
Updated
Jul 12
•
6
TamasSimonds/Qwen3-4B-KP-no-sys-prompt-1k-self-play-1-1-step-192
4B
•
Updated
Jul 12
•
5
TamasSimonds/spiral-qwen3-4b-base-KP-1k-self-play-1.1_0707T15-09-49
4B
•
Updated
Jul 8
•
3
TamasSimonds/O1-Llama-3.2-3B
3B
•
Updated
Nov 28, 2024
•
4
datasets
5
Sort: Recently updated
TamasSimonds/olympiad-proof-problems
Viewer
•
Updated
Aug 17
•
39.8k
•
85
TamasSimonds/poker_safety_realignment
Viewer
•
Updated
Aug 15
•
70
•
47
TamasSimonds/imo-dataset
Viewer
•
Updated
Aug 9
•
370
•
23
TamasSimonds/TextbooksToRLQuestions-100k
Viewer
•
Updated
Mar 25
•
108k
•
35
•
5
TamasSimonds/ReasonSet
Viewer
•
Updated
Nov 28, 2024
•
1.78k
•
33