Ankit Sharma

nezubn

https://nezubn.com

AI & ML interests

engineering • systems • ml

Recent Activity

upvoted an article 2 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

liked a model 2 months ago

deepseek-ai/DeepSeek-V3

liked a model 4 months ago

bartowski/Llama-3.1_OpenScholar-8B-GGUF

View all activity

Organizations

nezubn's activity

upvoted an article 2 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 67

upvoted a paper 4 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 47

upvoted a paper 5 months ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published Oct 21, 2024 • 24

upvoted 2 articles 7 months ago

Article

Optimizing your LLM in production

Sep 15, 2023

• 15

Article

Getting Started With Embeddings

Jun 23, 2022

• 60

upvoted an article 8 months ago

Article

quanto: a pytorch quantization toolkit

Mar 18, 2024

• 35

upvoted 2 papers 10 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted a paper 11 months ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25, 2024 • 54

upvoted an article 11 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 125

upvoted 3 papers 11 months ago

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 64

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 104

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 45

upvoted 7 papers 12 months ago

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29, 2024 • 35