Aaron Chibb

aari1995

AI & ML interests

Multilinguality and German LLMs

Organizations

Posts 3

view post
Post
3220
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)

ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual

multilingual/orca_dpo_pairs
view post
Post
looking at the tokenizer and the naming (β€œ_enβ€œ), Google Gemma is very likely to have a multilingual counterpart. πŸ‘€

Thoughts?