Christoph Holthaus's picture

Christoph Holthaus

choltha

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

ByteDance-Seed/Seed-Coder-8B-Reasoning

liked a model 1 day ago

ByteDance-Seed/Seed-Coder-8B-Instruct

liked a model 4 days ago

nvidia/OpenCodeReasoning-Nemotron-32B

View all activity

Organizations

choltha's activity

upvoted a collection 7 days ago

D-FINE

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated 8 days ago • 52

upvoted a collection 14 days ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated about 18 hours ago • 140

upvoted a collection 2 months ago

Phi-4 (All Versions)

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes • 20 items • Updated 11 days ago • 68

upvoted 6 collections 4 months ago

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 12 days ago • 28

Llama 3.2 Vision

Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions. • 8 items • Updated 12 days ago • 7

Deepseek V3 (All Versions)

Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. • 7 items • Updated 11 days ago • 37

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 12 days ago • 37

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 12 days ago • 80

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 30 items • Updated 12 days ago • 222

upvoted a collection 5 months ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated Feb 13 • 86

upvoted 2 collections 6 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 77

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 8 days ago • 258

upvoted a collection 11 months ago

Nomic Embed

Open Source Long Context Text Embedders • 8 items • Updated Feb 14, 2024 • 21

upvoted a collection 12 months ago

DiscoLeo 8B: Llama3 for German

Continued Pretraining on Llama3 8B to improve German linguistic capabilities. A collection of base and fine-tuned models and variants. • 5 items • Updated May 25, 2024 • 16

upvoted 3 papers about 1 year ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 71

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 616

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 195

upvoted 3 papers over 1 year ago

BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models

Paper • 2401.12522 • Published Jan 23, 2024 • 12

Platypus: Quick, Cheap, and Powerful Refinement of LLMs

Paper • 2308.07317 • Published Aug 14, 2023 • 24

From Sparse to Soft Mixtures of Experts

Paper • 2308.00951 • Published Aug 2, 2023 • 20