Marc Sun

marcsun13

AI & ML interests

LLM, Quantization, Training, Inference

Articles

Organizations

marcsun13's activity

upvoted an article about 1 month ago
view article
Article

Fixing Gradient Accumulation

• 40
upvoted 3 articles 2 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

• 199
view article
Article

SmolLM - blazingly fast and remarkably powerful

• 262
upvoted an article 3 months ago
view article
Article

XetHub is joining Hugging Face!

• 80
upvoted an article 6 months ago
view article
Article

Benchmarking Text Generation Inference

• 27
upvoted an article 6 months ago
view article
Article

License to Call: Introducing Transformers Agents 2.0

• 116
upvoted an article 7 months ago
view article
Article

Welcome Llama 3 - Meta's new open LLM

• 278
upvoted 9 articles 7 months ago
view article
Article

Vision Language Models Explained

• 214
view article
Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

• 166
view article
Article

Overview of natively supported quantization schemes in 🤗 Transformers

• 10
view article
Article

Making LLMs lighter with AutoGPTQ and transformers

• 33
view article
Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

• 63
view article
Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

• 91
view article
Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

• 63
view article
Article

quanto: a pytorch quantization toolkit

• 31
view article
Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

• 25