2 78 16

Pham Van Linh

phamvanlinh143

AI & ML interests

OCR, AI, DL

Recent Activity

liked a dataset about 14 hours ago

OleehyO/latex-formulas-80M

liked a dataset about 14 hours ago

OleehyO/latex-formulas

upvoted an article 2 days ago

VLM-OCR Recipes on GPU Infrastructure

View all activity

Organizations

None yet

liked 2 datasets about 14 hours ago

OleehyO/latex-formulas-80M

Viewer • Updated Aug 22, 2025 • 78.2M • 1.49k • 22

OleehyO/latex-formulas

Viewer • Updated Aug 13, 2025 • 1.56M • 990 • 99

upvoted 2 articles 2 days ago

Article

VLM-OCR Recipes on GPU Infrastructure

22 days ago

•

Article

My Journey Into Vision Models

Apr 12, 2025

•

liked a model 4 days ago

datalab-to/chandra

Image-to-Text • 9B • Updated Oct 21, 2025 • 392k • 474

upvoted 3 papers 5 days ago

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published 9 days ago • 36

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 34

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 138

upvoted an article 5 days ago

Article

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

8 days ago

•

liked a model 5 days ago

nvidia/NVIDIA-Nemotron-Parse-v1.1

Image-Text-to-Text • Updated 9 days ago • 119k • 139

upvoted an article 11 days ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

18 days ago

•

upvoted 4 articles about 1 month ago

Article

The Optimal Architecture for Small Language Models

Dec 26, 2025

•

115

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

119

Article

Shrinking Giants: The Quantization Mathematics Making LLMs Accessible

May 3, 2025

•

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

•

123

liked 2 Spaces about 1 month ago

The Smol Training Playbook

📚

2.96k

The secrets to building world-class LLMs

The Ultra-Scale Playbook

🌌

3.67k

The ultimate guide to training LLM on large GPU Clusters

upvoted 3 articles about 2 months ago

Article

Everything You Need to Know about Knowledge Distillation

Mar 6, 2025

•

Article

Mastering Tensor Dimensions in Transformers

Jan 12, 2025

•

134

Article

Understanding BigBird's Block Sparse Attention

Mar 31, 2021

•

Pham Van Linh

AI & ML interests

Recent Activity

Organizations

phamvanlinh143's activity

VLM-OCR Recipes on GPU Infrastructure

My Journey Into Vision Models

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

The Optimal Architecture for Small Language Models

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Shrinking Giants: The Quantization Mathematics Making LLMs Accessible

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

The Smol Training Playbook

The Ultra-Scale Playbook

Everything You Need to Know about Knowledge Distillation

Mastering Tensor Dimensions in Transformers

Understanding BigBird's Block Sparse Attention