a's picture

4 13

a

fredbarre

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 12 days ago

Best Leaderboard

liked a model 12 days ago

burtenshaw/GemmaCoder3-12B

reacted to csabakecskemeti's post with 👀 3 months ago

I've run the open llm leaderboard evaluations + hellaswag on https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall. If anyone wants to double check the results are posted here: https://github.com/csabakecskemeti/lm_eval_results Am I made some mistake, or (at least this distilled version) not as good/better than the competition? I'll run the same on the Qwen 7B distilled version too.

View all activity

Organizations

None yet

fredbarre's activity

upvoted a collection 12 days ago

Best Leaderboard

38 items • Updated 10 days ago • 76

liked a model 12 days ago

burtenshaw/GemmaCoder3-12B

Image-Text-to-Text • Updated 15 days ago • 364 • 39

reacted to csabakecskemeti's post with 👀 3 months ago

Post

2342

I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.

7 replies

·

liked a model 4 months ago

showlab/ShowUI-2B

Updated Mar 11 • 15.2k • 246

liked a model 5 months ago

Alfitaria/Q25-1.5B-VeoLu

Text Generation • Updated 11 days ago • 8

liked 4 models 6 months ago

fblgit/TheBeagle-v2beta-32B-MGS

Text Generation • Updated Oct 26, 2024 • 87 • 17

microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 919 • 1.66k

Felladrin/Llama-160M-Chat-v1

Text Generation • Updated Jul 25, 2024 • 516 • 18

kenhktsui/nano-phi-192M-v0.1

Text Generation • Updated May 8, 2024 • 1

liked a model 8 months ago

google/siglip-so400m-patch14-224

Zero-Shot Image Classification • Updated Aug 23, 2024 • 7.53k • 53

liked a Space 8 months ago

FLUX.1 [Inpainting]

upvoted a paper 8 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 73

liked a model 8 months ago

Niansuh/Prompt-Guard-86M

Text Classification • Updated Jul 30, 2024 • 162 • 2

upvoted a collection 10 months ago

🇫🇷 Calme-2

New Calme-2 fine-tuned models • 30 items • Updated Feb 4 • 4

liked a Space 10 months ago

SD3 Long Captioner

Generate detailed captions for images

upvoted a paper 10 months ago

Depth Anything V2

Paper • 2406.09414 • Published Jun 13, 2024 • 103

liked a Space 10 months ago

Consistent Character

Create images of a given character in different poses

liked a model 10 months ago

fofr/consistent-character-weights

Updated Jun 5, 2024 • 5