CEIA Reinforcement Learning

university

AI & ML interests

None defined yet.

Recent Activity

luanagbmartins updated a model about 7 hours ago

CEIA-RL/qwen3-4b-dw-lr-hf-dpo

luanagbmartins updated a model 4 days ago

CEIA-RL/qwen3-4b-dw-lr-dpo-offline

luanagbmartins published a model 5 days ago

CEIA-RL/qwen3-4b-dw-lr-dpo-offline

View all activity

spaces 1

LLMasJudgeEval

models 2

CEIA-RL/qwen3-4b-dw-lr-hf-dpo

Text Generation • 4B • Updated about 7 hours ago • 1.02k

CEIA-RL/qwen3-4b-dw-lr-dpo-offline

Text Generation • 4B • Updated 4 days ago • 547

datasets 8

CEIA-RL/Safety-Questions-Energy

Viewer • Updated 5 days ago • 4.89k • 25

CEIA-RL/synth_regulacao_eng_qa_v0

Viewer • Updated 11 days ago • 2.32k • 29

CEIA-RL/QA-Energy

Viewer • Updated 11 days ago • 43 • 38

CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned

Viewer • Updated 11 days ago • 45.1k • 57

CEIA-RL/hh-rlhf-harmless-base-pt-BR

Viewer • Updated 12 days ago • 44.8k • 35

CEIA-RL/energy_prompts

Viewer • Updated Feb 27 • 1.56M • 127

CEIA-RL/judge_results

Viewer • Updated Oct 3, 2024 • 10 • 6

CEIA-RL/judge_requests

Viewer • Updated Sep 27, 2024 • 10 • 6