Dattu Sharma's picture

Dattu Sharma

imdatta0

·

https://datta0.github.io/

AI & ML interests

Everything ML. Specifically Deep Learning.

Recent Activity

liked a model 2 days ago

unsloth/Kimi-K2-Instruct-GGUF

liked a model 21 days ago

unsloth/gemma-3n-E2B-it

liked a model 21 days ago

unsloth/gemma-3n-E2B

View all activity

Organizations

upvoted a collection 21 days ago

Gemma 3n

Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats! • 10 items • Updated 15 days ago • 14

upvoted 9 collections 4 months ago

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 15 days ago • 36

Qwen 2.5

32 items • Updated 15 days ago • 14

Load 4bit models 4x faster

Native bitsandbytes 4bit pre quantized models • 25 items • Updated 15 days ago • 57

Mistral Small 3 (All Versions)

A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more! • 20 items • Updated 15 days ago • 12

Phi-4 (All Versions)

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes • 20 items • Updated 15 days ago • 71

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 15 days ago • 250

Deepseek V3 (All Versions)

Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. • 7 items • Updated 15 days ago • 38

Gemma 3

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 50 items • Updated 15 days ago • 71

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 15 days ago • 83

upvoted a paper almost 2 years ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 242