1 14 16

Kartikey Rawat

carrycooldude

AI & ML interests

None yet

Recent Activity

upvoted an article 15 days ago

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

liked a Space about 2 months ago

webml-community/qwen3-webgpu

upvoted an article about 2 months ago

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

View all activity

Organizations

carrycooldude's activity

upvoted an article 15 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

and 8 others •

16 days ago

• 146

upvoted 4 articles about 2 months ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

and 1 other •

Aug 17, 2022

• 94

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

and 4 others •

May 24, 2023

• 153

Article

Making LLMs lighter with AutoGPTQ and transformers

and 5 others •

Aug 23, 2023

• 55

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

•

Aug 25, 2023

• 31

upvoted a collection 12 months ago

Instruction Pre-Training

Collection

8 items • Updated Jun 21, 2024 • 26

upvoted a collection about 1 year ago

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 1 day ago • 163

upvoted a paper about 1 year ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 71

upvoted 4 articles about 1 year ago

Article

A Dive into Pretraining Strategies for Vision-Language Models

and 1 other •

Feb 3, 2023

• 69

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 387

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 237

Article

CodeGemma - an official Google release for code LLMs

and 5 others •

Apr 9, 2024

• 101

upvoted a paper about 1 year ago

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29, 2024 • 35

upvoted a collection over 1 year ago

Fellows Highlights Winter '23 (Dec) ❄️⛄️

Collection

14 items • Updated Dec 27, 2023 • 5