Dayan Ruben's picture

91 247

Dayan Ruben

dayanruben

·

https://dayanruben.com

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

mistralai/Devstral-Small-2505_gguf

liked a model 1 day ago

mistralai/Devstral-Small-2507_gguf

liked a model 1 day ago

mistralai/Devstral-Small-2507

View all activity

Organizations

upvoted a collection 5 days ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated 5 days ago • 91

upvoted 2 collections 8 days ago

T5Gemma

32 items • Updated 7 days ago • 55

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 10 items • Updated 6 days ago • 59

upvoted a collection 21 days ago

Gemma 3n

4 items • Updated 7 days ago • 182

upvoted a collection about 2 months ago

Gemma 3n Preview

4 items • Updated 7 days ago • 156

upvoted 3 collections 3 months ago

Mellum

Series of code models by JetBrains • 6 items • Updated 13 days ago • 28

Qwen3

72 items • Updated Jun 15 • 858

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 15 days ago • 46

upvoted an article 3 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 145

upvoted 2 collections 3 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 573

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 7 days ago • 205

upvoted 3 articles 4 months ago

Article

Open R1: How to use OlympicCoder locally for coding?

By

and 4 others •

Mar 20

• 62

Article

Custom Vibe Coding Quest Part 1: The Quest Begins 🧙

By

•

Mar 26

• 9

Article

Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning

By

•

Apr 1

• 25

upvoted 4 collections 4 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated May 21 • 149

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 7 days ago • 60

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 96

Gemma 3 Release

24 items • Updated 7 days ago • 413

upvoted 2 articles 5 months ago

Article

Introducing smolagents: simple agents that write actions in code.

By

and 2 others •

Dec 31, 2024

• 1.08k

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.27k