These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper)
Manuel Faysse
manu
AI & ML interests
NLP, Privacy, multi-modal DL
Recent Activity
upvoted
an
article
2 days ago
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
authored
a paper
about 1 month ago
EuroLLM-9B: Technical Report
authored
a paper
about 1 month ago
ModernVBERT: Towards Smaller Visual Document Retrievers