amenur (Aramis)

upvoted an article 9 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face

+5

Apr 5

•

146

upvoted 4 articles 10 months ago

Article

Open R1: Update #3

Mar 11

•

296

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

Feb 20

•

320

Article

Open R1: Update #2

Feb 10

•

218

Article

SigLIP 2: A better multilingual vision language encoder

+1

Feb 21

•

193

upvoted 4 articles 11 months ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4

•

1.31k

Article

Introducing smolagents: simple agents that write actions in code.

+1

Dec 31, 2024

•

1.16k

Article

Open-R1: Update #1

Feb 2

•

305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28

•

887

upvoted an article 12 months ago

Article

Superposition in Transformers: A Novel Way of Building Mixture of Experts

Jan 4

•

13

upvoted a paper about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

upvoted a collection about 1 year ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 27

upvoted a paper about 1 year ago

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Paper • 2410.01036 • Published Oct 1, 2024 • 15

upvoted an article over 1 year ago

Article

Llama can now see and run on your device - welcome Llama 3.2

+5

Sep 25, 2024

•

191

upvoted a collection over 1 year ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated 6 days ago • 242

upvoted 4 articles over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

Sep 18, 2024

•

272

Article

Scaling robotics datasets with video encoding

+1

Aug 27, 2024

•

40

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Jul 29, 2024

•

365

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30, 2024

•

68

upvoted a paper over 1 year ago

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 31

Aramis

AI & ML interests

Organizations

Welcome Llama 4 Maverick & Scout on Hugging Face

Open R1: Update #3

SmolVLM2: Bringing Video Understanding to Every Device

Open R1: Update #2

SigLIP 2: A better multilingual vision language encoder

Open-source DeepResearch – Freeing our search agents

Introducing smolagents: simple agents that write actions in code.

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Superposition in Transformers: A Novel Way of Building Mixture of Experts

Qwen2.5 Technical Report

Scaling Test-Time Compute with Open Models

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Llama can now see and run on your device - welcome Llama 3.2

Moshi v0.1 Release

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Scaling robotics datasets with video encoding

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Memory-efficient Diffusion Transformers with Quanto and Diffusers

TextGrad: Automatic "Differentiation" via Text

Aramis

AI & ML interests

Organizations

amenur's activity

Welcome Llama 4 Maverick & Scout on Hugging Face

Open R1: Update #3

SmolVLM2: Bringing Video Understanding to Every Device

Open R1: Update #2

SigLIP 2: A better multilingual vision language encoder

Open-source DeepResearch – Freeing our search agents

Introducing smolagents: simple agents that write actions in code.

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Superposition in Transformers: A Novel Way of Building Mixture of Experts

Llama can now see and run on your device - welcome Llama 3.2

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Scaling robotics datasets with video encoding

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Memory-efficient Diffusion Transformers with Quanto and Diffusers