Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Finnish-NLP 's Collections
Ahma models
Finnish Wav2vec2-xlsr speech recognition
Finnish Whisper speech recognition
Finnish pretrain datasets
Finnish SFT/DPO dataset
Finnish-Fineweb-edu
Finnish LLama models
Instruction tuned models

Finnish pretrain datasets

updated Dec 31, 2024
Upvote
-

  • Finnish-NLP/mc4_fi_cleaned

    Viewer • Updated Oct 21, 2022 • 18.1M • 227 • 3

  • Finnish-NLP/Reddit_fi_2006_2022

    Viewer • Updated Nov 26, 2023 • 4.52M • 87 • 2

  • Finnish-NLP/wikipedia_20230501_fi_cleaned

    Viewer • Updated May 18, 2023 • 411k • 33

  • Finnish-NLP/oscar_2301_fi_cleaned

    Viewer • Updated May 19, 2023 • 5.23M • 1.79k

  • Finnish-NLP/HPLT_1.2_fi_cleaned

    Viewer • Updated Mar 1, 2024 • 5.11M • 91

  • Finnish-NLP/CulturaX_fi_cleaned

    Viewer • Updated Dec 23, 2023 • 28.8M • 493
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs