Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
UCL-DARK 's Collections
Understanding the Effects of RLHF on LLM Generalisation and
The Goldilocks of Pragmatic Understanding

Understanding the Effects of RLHF on LLM Generalisation and

updated Feb 1, 2024

Datasets and models for the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity: https://arxiv.org/abs/2310.06452

Upvote
-

  • Understanding the Effects of RLHF on LLM Generalisation and Diversity

    Paper • 2310.06452 • Published Oct 10, 2023 • 2

  • UCL-DARK/sequential-instructions

    Viewer • Updated Oct 26, 2023 • 533 • 23 • 3

  • UCL-DARK/alpaca-farm-id-test

    Viewer • Updated Oct 26, 2023 • 1.03k • 82

  • UCL-DARK/openai-tldr-summarisation-preferences

    Viewer • Updated Oct 26, 2023 • 177k • 38 • 1

  • UCL-DARK/openai-tldr-filtered

    Viewer • Updated Oct 26, 2023 • 130k • 25 • 1

  • huggyllama/llama-7b

    Text Generation • Updated Jul 2, 2024 • 171k • 329

  • tatsu-lab/alpaca-7b-wdiff

    Text Generation • Updated May 22, 2023 • 169 • 57

  • tatsu-lab/alpaca-farm-sft10k-wdiff

    Text Generation • Updated May 31, 2023 • 44

  • tatsu-lab/alpaca-farm-ppo-human-wdiff

    Text Generation • Updated Jul 4, 2023 • 54 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs