Mangosteen, a 47 billion-token Thai corpus built with a Thai-adapted pipeline, improves language model performance on Thai benchmarks.
Wannaphong Phatthiyaphaibun PRO
wannaphong
AI & ML interests
None yet
Recent Activity
updated
a model
about 3 hours ago
wannaphong/KhanomTanLLM2-ThaiLLM-8B
published
a model
about 4 hours ago
wannaphong/KhanomTanLLM2-ThaiLLM-8B
updated
a Space
about 17 hours ago
pythainlp/api