Behrooz Azarkhalili's picture

51 486

Behrooz Azarkhalili

ermiaazarkhalili

·

AI & ML interests

LLMs, VLMs, PEFT, RL for LLMs and VLMs.

Recent Activity

upvoted an article 1 day ago

How I Built 7 Custom Gradio Components in Just 12 Days!

liked a model 2 days ago

facebook/dinov2-large

liked a dataset 6 days ago

Sp1786/multiclass-sentiment-analysis-dataset

View all activity

Organizations

upvoted an article 1 day ago

Article

How I Built 7 Custom Gradio Components in Just 12 Days!

By

•

2 days ago

• 5

upvoted an article 7 days ago

Article

Vision Language Model Alignment in TRL ⚡️

By

and 4 others •

7 days ago

• 44

upvoted a collection 13 days ago

Qwen3-MegaScience

Qwen3-MegaScience • 5 items • Updated 22 days ago • 3

upvoted a paper 13 days ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published 23 days ago • 60

upvoted an article 15 days ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By

and 4 others •

16 days ago

• 152

upvoted a collection about 1 month ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated Jul 12 • 115

upvoted an article about 1 month ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

Feb 11

• 58

upvoted a collection about 2 months ago

Qwen3

84 items • Updated 8 days ago • 1.09k

upvoted a paper about 2 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 57

upvoted an article about 2 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 316

upvoted an article 4 months ago

Article

Multi-Label Classification Model From Scratch: Step-by-Step Tutorial

By

•

Jan 8, 2024

• 46

upvoted an article 5 months ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 34

upvoted an article 7 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

By

and 2 others •

Jan 23

• 182

upvoted an article 9 months ago

Article

Introducing GGUF-my-LoRA

By

•

Nov 1, 2024

• 20

upvoted a collection 9 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 281

upvoted 2 collections 10 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 40 items • Updated Jun 23 • 119

Quantized Qwen2.5

9 items • Updated Dec 9, 2024 • 4

upvoted an article 10 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 41

upvoted a collection 10 months ago

PEFT papers

A collection of methods that have been implemented in the 🤗 PEFT library • 12 items • Updated Jan 30, 2024 • 29

upvoted a paper 11 months ago

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3, 2024 • 37