Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lightonai
's Collections
ColBERT-Zero 🐶
LateOn-Code 💻
OriOn
LightOnOCR-2 🦉
PyLate 🐕
LightOnOCR 🦉
Embeddings datasets ⚡️
ModernBERT
PAGnol 🇫🇷
RITA 🧿
Mamba 🐍
ArabicWeb24-ablation-models
Embeddings datasets ⚡️
updated
1 day ago
This collection gather datasets for embeddings pre-training and fine-tuning.
Upvote
4
lightonai/embeddings-pre-training
Viewer
•
Updated
Jan 5
•
1.38B
•
1.7k
•
17
lightonai/nanobeir-multilingual
Viewer
•
Updated
Sep 16, 2025
•
522k
•
485
•
11
lightonai/embeddings_supervised
Viewer
•
Updated
Oct 23, 2025
•
3.43M
•
207
•
10
Upvote
4
Share collection
View history
Collection guide
Browse collections