Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Lamine Bagayoko
Lam77
Follow
0 followers
·
1 following
AI & ML interests
AI and ML
Recent Activity
upvoted
a
collection
about 1 month ago
Bambara Datasets
reacted
to
ehristoforu
's
post
with 🤗
about 2 months ago
✒️ Ultraset - all-in-one dataset for SFT training in Alpaca format. https://huggingface.co/datasets/fluently-sets/ultraset ❓ Ultraset is a comprehensive dataset for training Large Language Models (LLMs) using the SFT (instruction-based Fine-Tuning) method. This dataset consists of over 785 thousand entries in eight languages, including English, Russian, French, Italian, Spanish, German, Chinese, and Korean. 🤯 Ultraset solves the problem faced by users when selecting an appropriate dataset for LLM training. It combines various types of data required to enhance the model's skills in areas such as text writing and editing, mathematics, coding, biology, medicine, finance, and multilingualism. 🤗 For effective use of the dataset, it is recommended to utilize only the "instruction," "input," and "output" columns and train the model for 1-3 epochs. The dataset does not include DPO or Instruct data, making it suitable for training various types of LLM models. ❇️ Ultraset is an excellent tool to improve your language model's skills in diverse knowledge areas.
liked
a model
3 months ago
oza75/whisper-bambara-asr-002
View all activity
Organizations
None yet
Lam77
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
collection
about 1 month ago
Bambara Datasets
Collection
3 items
•
Updated
Nov 17, 2024
•
2