Tokenization Standards for Linguistic Integrity: Turkish as a Benchmark Paper β’ 2502.07057 β’ Published Feb 10
Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation Paper β’ 2501.00593 β’ Published Dec 31, 2024 β’ 1
Running on CPU Upgrade 205 205 MMLU-Pro Leaderboard π₯ More advanced and challenging multi-task evaluation