Understanding the Effects of RLHF on LLM Generalisation and - a UCL-DARK Collection

UCL-DARK 's Collections

Understanding the Effects of RLHF on LLM Generalisation and

The Goldilocks of Pragmatic Understanding

Understanding the Effects of RLHF on LLM Generalisation and

updated Feb 1, 2024

Datasets and models for the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity: https://arxiv.org/abs/2310.06452

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Paper • 2310.06452 • Published Oct 10, 2023 • 2
UCL-DARK/sequential-instructions

Viewer • Updated Oct 26, 2023 • 533 • 23 • 3
UCL-DARK/alpaca-farm-id-test

Viewer • Updated Oct 26, 2023 • 1.03k • 82
UCL-DARK/openai-tldr-summarisation-preferences

Viewer • Updated Oct 26, 2023 • 177k • 38 • 1
UCL-DARK/openai-tldr-filtered

Viewer • Updated Oct 26, 2023 • 130k • 25 • 1
huggyllama/llama-7b

Text Generation • Updated Jul 2, 2024 • 171k • 329
tatsu-lab/alpaca-7b-wdiff

Text Generation • Updated May 22, 2023 • 169 • 57
tatsu-lab/alpaca-farm-sft10k-wdiff

Text Generation • Updated May 31, 2023 • 44
tatsu-lab/alpaca-farm-ppo-human-wdiff

Text Generation • Updated Jul 4, 2023 • 54 • 1