Armaghan Shakir

geetu040

10 37 58

AI & ML interests

Vision, Language and Vision-Language Models

Recent Activity

liked a model about 11 hours ago

moonshotai/Kimi-K3

liked a model 1 day ago

sswwoo/SeeandSniff

liked a model 4 days ago

mlabonne/TwinLlama-3.1-8B

View all activity

Organizations

upvoted a collection 4 days ago

📙 LLM Engineer's Handbook

Collection

Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook • 6 items • Updated Apr 7, 2025 • 17

upvoted a collection 5 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.74k

upvoted an article 8 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 313

upvoted a collection 12 months ago

👁️ LFM2-VL

Collection

LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 10 items • Updated Jun 25 • 66

upvoted 2 articles 12 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 514

Article

Introducing Command A Vision: Multimodal AI built for Business

CohereLabs

•

Jul 31, 2025

• 64

upvoted an article about 1 year ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

ybelkada, timdettmers

•

Aug 17, 2022

• 136

upvoted 2 collections about 1 year ago

🐕Small-Doges

Collection

Doge family of small language models! • 18 items • Updated Apr 21, 2025 • 11

💧 LFM2

Collection

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 28 items • Updated Jun 25 • 155

upvoted an article about 1 year ago

Article

cocogold: training Marigold for text-grounded segmentation

pcuenq

•

Jul 8, 2025

• 31

upvoted a paper about 1 year ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published Jul 2, 2025 • 36

upvoted a collection about 1 year ago

Gemma 3n

Collection

4 items • Updated 7 days ago • 274

upvoted an article about 1 year ago

Article

Introducing smolagents: simple agents that write actions in code.

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.21k

upvoted 2 papers about 1 year ago

SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation

Paper • 2506.18349 • Published Jun 23, 2025 • 13

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 278

upvoted 2 collections about 1 year ago

MiniMax-M1

Collection

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Apr 15 • 119

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 227

upvoted an article about 1 year ago

Article

Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code

ImranzamanML

•

Oct 2, 2024

• 75

upvoted 2 papers about 1 year ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12, 2025 • 138

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published Apr 11, 2025 • 33

Armaghan Shakir

AI & ML interests

Recent Activity

Organizations

geetu040's activity

Transformers v5: Simple model definitions powering the AI ecosystem

Welcome GPT OSS, the new open-source model family from OpenAI!

Introducing Command A Vision: Multimodal AI built for Business

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

cocogold: training Marigold for text-grounded segmentation

Introducing smolagents: simple agents that write actions in code.

Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code