Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Finnish-NLP 's Collections
Ahma models
Finnish Wav2vec2-xlsr speech recognition
Finnish Whisper speech recognition
Finnish pretrain datasets
Finnish SFT/DPO dataset
Finnish-Fineweb-edu
Finnish LLama models
Instruction tuned models

Finnish pretrain datasets

updated 23 days ago
Upvote
-

  • Finnish-NLP/mc4_fi_cleaned

    Viewer • Updated Oct 21, 2022 • 18.1M • 276 • 4

  • Finnish-NLP/Reddit_fi_2006_2022

    Viewer • Updated Nov 26, 2023 • 4.52M • 79 • 2

  • Finnish-NLP/wikipedia_20230501_fi_cleaned

    Viewer • Updated May 18, 2023 • 411k • 23

  • Finnish-NLP/oscar_2301_fi_cleaned

    Viewer • Updated May 19, 2023 • 5.23M • 195

  • Finnish-NLP/HPLT_1.2_fi_cleaned

    Viewer • Updated Mar 1, 2024 • 5.11M • 73

  • Finnish-NLP/CulturaX_fi_cleaned

    Viewer • Updated Dec 23, 2023 • 28.8M • 70

  • Finnish-NLP/Fineweb2_Finnish_fineweb_edu_predicted

    Viewer • Updated Jun 5 • 33.2M • 182
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs