Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
argilla 's Collections
Synthetic Data Generator
Datasets built with ⚗️ distilabel
Open Image Generation Models
Argilla v2.0 compatible datasets
Notus 7B v1
Notux 8x7B v1
DIBT Prompt collective SPIN
Preference Datasets for DPO
Preference Datasets for KTO
Domain Specific Data

Datasets built with ⚗️ distilabel

updated Dec 11, 2024

This collection contains some datasets generated and/or labelled using https://github.com/argilla-io/distilabel

Upvote
12

  • Runtime error
    15
    15

    Distilabel Synthetic Data Pipeline Finder

    ⚗

    Find and view synthetic data pipelines on Hugging Face


  • argilla/distilabel-capybara-dpo-7k-binarized

    Viewer • Updated Jul 16, 2024 • 7.56k • 1.62k • 180

  • argilla/distilabel-intel-orca-dpo-pairs

    Viewer • Updated Mar 19 • 12.9k • 1.89k • 173

  • alvarobartt/HelpSteer-AIF

    Viewer • Updated Feb 6, 2024 • 1k • 65 • 6

    Note A subset of 1000 samples from `nvidia/HelpSteer` for helpfulness generated using AIF from OpenAI's GPT-4, following HelpSteer's approach but using AIF over human annotators.


  • davanstrien/haiku_dpo

    Viewer • Updated Mar 13, 2024 • 17.5k • 869 • 47

  • davanstrien/haiku_prompts

    Viewer • Updated Jan 15, 2024 • 4.3k • 82 • 9

  • argilla/magpie-ultra-v0.1

    Viewer • Updated Nov 26, 2024 • 50k • 361 • 221

  • HuggingFaceTB/smoltalk

    Viewer • Updated Feb 10 • 2.2M • 7.2k • 334
Upvote
12
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs