Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
43
470
Behrooz Azarkhalili
ermiaazarkhalili
Follow
Hamed324's profile picture
osanseviero's profile picture
Q-bert's profile picture
13 followers
·
38 following
b_azarkhalili
behroozazarkhalili
behroozazarkhalili
AI & ML interests
LLMs, VLMS, PEFT, RL for LLMs and VLMS.
Recent Activity
upvoted
a
paper
about 17 hours ago
Understanding R1-Zero-Like Training: A Critical Perspective
upvoted
an
article
2 days ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
liked
a model
10 days ago
microsoft/Phi-4-mini-instruct
View all activity
Organizations
models
6
Sort: Recently updated
ermiaazarkhalili/mistral-7b-instruct-v0.3-grpo-GSM8K
Text Generation
•
Updated
14 days ago
•
5
ermiaazarkhalili/llama-3.2-1b-instruct_grpo-GSM8K
Text Generation
•
Updated
14 days ago
•
5
ermiaazarkhalili/llama-3.2-3b-instruct_grpo-GSM8K
Text Generation
•
Updated
14 days ago
•
5
ermiaazarkhalili/qwen2.5-7b-instruct-trl-sft-ChartQA
Updated
18 days ago
ermiaazarkhalili/qwen2.5-7b-instruct-trl-sft-ChartQA-oop
Updated
19 days ago
ermiaazarkhalili/qwen2-7b-instruct-trl-sft-ChartQA
Updated
Feb 3
datasets
0
None public yet