Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CEIA Reinforcement Learning
university
Activity Feed
Follow
6
AI & ML interests
None defined yet.
Recent Activity
luanagbmartins
Â
updated
a model
about 7 hours ago
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
luanagbmartins
Â
updated
a model
4 days ago
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
luanagbmartins
Â
published
a model
5 days ago
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
View all activity
Team members
5
spaces
1
pinned
Running
LLMasJudgeEval
🥇
models
2
Sort:Â Recently updated
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
Text Generation
•
4B
•
Updated
about 7 hours ago
•
1.02k
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
Text Generation
•
4B
•
Updated
4 days ago
•
547
datasets
8
Sort:Â Recently updated
CEIA-RL/Safety-Questions-Energy
Viewer
•
Updated
5 days ago
•
4.89k
•
25
CEIA-RL/synth_regulacao_eng_qa_v0
Viewer
•
Updated
11 days ago
•
2.32k
•
29
CEIA-RL/QA-Energy
Viewer
•
Updated
11 days ago
•
43
•
38
CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned
Viewer
•
Updated
11 days ago
•
45.1k
•
57
CEIA-RL/hh-rlhf-harmless-base-pt-BR
Viewer
•
Updated
12 days ago
•
44.8k
•
35
CEIA-RL/energy_prompts
Viewer
•
Updated
Feb 27
•
1.56M
•
127
CEIA-RL/judge_results
Viewer
•
Updated
Oct 3, 2024
•
10
•
6
CEIA-RL/judge_requests
Viewer
•
Updated
Sep 27, 2024
•
10
•
6