This collection contains datasets and models related to "BLEUBERI: BLEU is a surprisingly effective reward for instruction following".
Yapei Chang PRO
yapeichang
AI & ML interests
NLP
Recent Activity
updated
a dataset
1 day ago
yapeichang/WebOrganizer-format-topic-merged-Llama-3.1-405B-FP8
published
a dataset
1 day ago
yapeichang/WebOrganizer-format-topic-merged-Llama-3.1-405B-FP8
updated
a dataset
22 days ago
yapeichang/hotpotqa-filtered