Sinisa Stanivuk's picture

Sinisa Stanivuk

Stopwolf

·

AI & ML interests

Multilingual LLMs, STT and TTS models

Organizations

Posts 1

Post

1151

🇷🇸 New Benchmark for Serbian Language 🇷🇸

@DjMel and I recently released a new benchmark for Serbian language that measures General Knowledge of LLMs. We had to parse over 20 years of university entrance exams for University of Belgrade, so the dataset is of high quality.

🥇 OAI models still hold the podium places with a significant gap compared to open-source models
🤔 Qwen/Qwen2-7B-Instruct and VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct models show promising results considering they weren't trained on Serbian language
📈 Best open-source model seems to be Stopwolf/Mustra-7B-Instruct-v0.2, a merge between gordicaleksa/YugoGPT and mistralai/Mistral-7B-Instruct-v0.2
📉 Some models like google/gemma-2-9b-it turned out to be a disappointment with random guessing-like accuracy

Take a look at the whole results at the dataset page:
DjMel/oz-eval

P.S. If you have any constructive criticism or ideas for improvement, feel free to use dataset's Discussions page!

Papers 1

arxiv:2311.11628

spaces 1

Speech To Speech Translation

models 16

Stopwolf/embedic-m2v-large

64.2M • Updated Dec 10, 2025 • 2

Stopwolf/wav2vec2-large-mms-1b-por

Automatic Speech Recognition • 1.0B • Updated May 19, 2025 • 33

Stopwolf/distilhubert-gtzan

Audio Classification • 23.7M • Updated Jan 27, 2025 • 4

Stopwolf/whisper-small-sr

Automatic Speech Recognition • 0.2B • Updated Jan 23, 2025 • 2

Stopwolf/whisper-tiny-minds14

Automatic Speech Recognition • 37.8M • Updated Jan 20, 2025 • 3

Stopwolf/wav2vec2-base-960h-finetuned-gtzan

Audio Classification • 94.6M • Updated Nov 8, 2024 • 3

Stopwolf/Mislisa-1.5B-Instruct

Updated Jul 30, 2024

Stopwolf/Perucac-7B-slerp

Text Generation • 7B • Updated Jun 4, 2024 • 8

Stopwolf/Tito-7B-slerp

Text Generation • 7B • Updated Apr 22, 2024 • 94 • 4

Stopwolf/Cerberus-7B-slerp

Text Generation • 7B • Updated Mar 4, 2024 • 90

datasets 2

Stopwolf/EQ-Bench-Serbian

Viewer • Updated May 22, 2024 • 171 • 24 • 3

Stopwolf/ms-marco-v2.1-sr-500k

Viewer • Updated Aug 15, 2023 • 503k • 14 • 1