Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Roman Teucher
RTT
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a collection
24 days ago
NewPipe
published
a model
5 months ago
RTT/Teuken-7B-instruct-research-v0.4-GRPO
updated
a model
5 months ago
RTT/Qwen2.5-1.5B-Open-R1-GRPO
View all activity
Organizations
models
8
Sort: Recently updated
RTT/Teuken-7B-instruct-research-v0.4-GRPO
Updated
Feb 20
RTT/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Feb 19
•
5
RTT/ppo-Pyramids
Reinforcement Learning
•
Updated
Mar 18, 2024
•
12
RTT/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Mar 18, 2024
•
18
RTT/taxi_v3
Reinforcement Learning
•
Updated
Oct 25, 2023
RTT/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jul 28, 2023
RTT/ppo-Huggy
Reinforcement Learning
•
Updated
Jan 31, 2023
•
41
RTT/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 26, 2023
•
3
datasets
0
None public yet