16 6 201

Sourab Mangrulkar

smangrul

https://www.linkedin.com/in/sourab-m/

pacman100

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision, Reinforcement Learning

Recent Activity

liked a model 3 days ago

intfloat/multilingual-e5-large

liked a model 5 days ago

Qwen/Qwen3-32B

liked a model 14 days ago

openai/gpt-oss-20b

View all activity

Organizations

published an article almost 2 years ago

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20, 2024

•

published an article almost 2 years ago

Article

🤗 PEFT welcomes new merging methods

Feb 19, 2024

•

published an article about 2 years ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.01k

published an article about 2 years ago

Article

Personal Copilot: Train Your Own Coding Assistant

Oct 27, 2023

•

published an article over 2 years ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

Sep 13, 2023

•

published an article over 2 years ago

Article

The Falcon has landed in the Hugging Face ecosystem

Jun 5, 2023

•

published an article over 2 years ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

•

171

published an article almost 3 years ago

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

•

published an article almost 3 years ago

Article

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Feb 10, 2023

•

110

published an article almost 3 years ago

Article

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Feb 10, 2023

•

110

published an article over 3 years ago

Article

Accelerate Large Model Training using DeepSpeed

Jun 28, 2022

•

published an article over 3 years ago

Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

May 2, 2022

•

published an article over 3 years ago

Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

May 2, 2022

•

Sourab Mangrulkar

AI & ML interests

Recent Activity

Organizations

smangrul's activity

GaLore: Advancing Large Model Training on Consumer-grade Hardware

🤗 PEFT welcomes new merging methods

Mixture of Experts Explained

Personal Copilot: Train Your Own Coding Assistant

Fine-tuning Llama 2 70B using PyTorch FSDP

The Falcon has landed in the Hugging Face ecosystem

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Accelerate Large Model Training using DeepSpeed

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel