Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tamazight-NLP 's Collections
Image Classification Models
MT Models
Speech Datasets
Text Datasets
Bitext Datasets
OCR Datasets
Language Models
Encoders/Fill-Mask

Text Datasets

updated Mar 8
Upvote
1

  • allenai/MADLAD-400

    Updated Sep 9, 2024 • 39.1k • 144

  • wikimedia/wikipedia

    Viewer • Updated Jan 9, 2024 • 61.6M • 61.4k • 876

  • collectivat/amazic

    Viewer • Updated Jun 13 • 3.52k • 133 • 10

  • Tamazight-NLP/IRCAM-CORPUS

    Viewer • Updated Feb 24, 2024 • 55 • 68 • 1

  • cis-lmu/GlotCC-V1

    Viewer • Updated Nov 1, 2024 • 1.28B • 994 • 52

  • cis-lmu/Glot500

    Viewer • Updated Jun 17, 2024 • 1.23B • 2.38k • 37

  • Cohere/wikipedia-2023-11-embed-multilingual-v3

    Viewer • Updated Mar 19, 2024 • 247M • 4.46k • 234

  • Cohere/wikipedia-2023-11-embed-multilingual-v3-int8-binary

    Viewer • Updated Mar 21, 2024 • 247M • 596 • 45

  • cis-lmu/glotlid-corpus

    Viewer • Updated Jun 4 • 288M • 122 • 8

  • MaLA-LM/PolyWrite

    Viewer • Updated Sep 27, 2024 • 35.8k • 192 • 4

  • HuggingFaceFW/fineweb-2

    Viewer • Updated 21 days ago • 5.02B • 654k • 595

  • Tamazight/Randomly

    Viewer • Updated Nov 11, 2024 • 2k • 29 • 5

  • TutlaytAI/Kabyle_Text_Corpus

    Viewer • Updated Jan 29 • 280k • 80 • 2
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs