view post Post 2338 I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.If anyone wants to double check the results are posted here: https://github.com/csabakecskemeti/lm_eval_resultsAm I made some mistake, or (at least this distilled version) not as good/better than the competition?I'll run the same on the Qwen 7B distilled version too. See translation 7 replies · 👀 6 6 + Reply
Visual Language Models Collection Collection of OpenVINO optimized models for visual-language assistance • 9 items • Updated Jan 27 • 3
view post Post 1907 great blogpost! 🔥@wolfram https://huggingface.co/blog/wolfram/llm-comparison-test-2024-12-04 See translation 🔥 4 4 👍 1 1 + Reply