Flavio Catalani's picture

Flavio Catalani

fakezeta

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

hexgrad/Kokoro-82M

updated a model 2 months ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

published a model 2 months ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

View all activity

Organizations

fakezeta's activity

liked a model about 1 month ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 15 days ago • 1.83M • 3.92k

updated a model 2 months ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

Text Generation • Updated Jan 31 • 19

published a model 2 months ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

Text Generation • Updated Jan 31 • 19

liked 2 Spaces 2 months ago

What could possibly go wrong?

Think in Sync

An addictive AI-powered word puzzle.

reacted to csabakecskemeti's post with 👀 2 months ago

Post

2338

I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.

7 replies

·

upvoted a collection 2 months ago

Visual Language Models

Collection of OpenVINO optimized models for visual-language assistance • 9 items • Updated Jan 27 • 3

liked a Space 4 months ago

Hacker News Listener

Navigate and analyze Hacker News posts and comments.

liked a model 4 months ago

Nexusflow/Athene-V2-Chat

Text Generation • Updated Nov 26, 2024 • 2.59k • 289

reacted to lunarflu's post with 🔥 4 months ago

Post

1907

great blogpost! 🔥@wolfram
https://huggingface.co/blog/wolfram/llm-comparison-test-2024-12-04

liked 2 models 4 months ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 6.22k • 149

kaitchup/Qwen2.5-72B-Instruct-AutoRound-GPTQ-4bit

Text Generation • Updated Nov 26, 2024 • 66 • 6

liked a Space 6 months ago

FacePoke

Import a portrait, click to move the head!

New activity in mistralai/Mistral-Small-Instruct-2409 7 months ago

Please make it CLEAR, this is NOT an OPEN SOURCE MODEL license

#15 opened 7 months ago by

updated 3 models 7 months ago

fakezeta/gemma-2-9b-it-SimPO-ov-int4

Updated Sep 16, 2024 • 8

fakezeta/gemma-2-9b-it-SimPO-ov-int8

Updated Sep 16, 2024 • 8

fakezeta/gemma-2-9b-it-ov-int4

Text Generation • Updated Sep 15, 2024 • 5

updated a collection 7 months ago

Gemma 2

4 items • Updated Sep 15, 2024