RLAIF

Team

community

AI & ML interests

None defined yet.

Recent Activity

AngelRaychev updated a dataset about 3 hours ago

RLAIF/dpo_answer_2e-6_openorca_prompts_responses_1e-6_0.02_0.6B_0.6B_with_gold_labels_kl_estimation

AngelRaychev published a dataset about 3 hours ago

RLAIF/dpo_answer_2e-6_openorca_prompts_responses_1e-6_0.02_0.6B_0.6B_with_gold_labels_kl_estimation

AngelRaychev updated a dataset about 3 hours ago

RLAIF/dpo_uf_rejudged_mixed_openorca_with_gold_labels_kl_estimation

View all activity

RLAIF 's collections 2