Samples from the ToPXGen-LLaMA-4-Scout English to Xhosa set augmented with intermediate information generated by LLaMA-4-Scout.
AI & ML interests
NLP, Digital Humanities
Recent Activity
View all activity
Papers
Gaperon: A Peppered English-French Generative Language Model Suite
LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
Samples from the ToPXGen-LLaMA-4-Scout English to Xhosa set augmented with intermediate information generated by LLaMA-4-Scout.
Samples from the WMT19 English to Lithuanian set augmented with intermediate information generated by gemma-3-27b-it.
Collections of models trained on the TopXGen dataset.
Our French-English LLM suite (SFT models are coming soon)