David Golchinfar's picture

David Golchinfar PRO

DavidGF

·

https://vago-solutions.ai

AI & ML interests

finetune llms, improve german language understanding and generated text of llms

Recent Activity

liked a model 1 day ago

DeepMount00/Ita-Search

liked a model 7 days ago

LiquidAI/LFM2-1.2B

View all activity

Organizations

upvoted a collection 2 months ago

Performance LLMs - Fine tuned

39 items • Updated Jun 6 • 6

upvoted 2 collections 3 months ago

Qwen3

72 items • Updated Jun 15 • 861

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 576

upvoted 2 articles 4 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

By

•

Mar 26

• 145

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted 3 articles 6 months ago

Article

DeepSeek R1 on how to build conscious AGI

By

•

Jan 24

• 6

Article

Train 400x faster Static Embedding Models with Sentence Transformers

By

•

Jan 15

• 197

Article

The Large Language Model Course

By

•

Jan 16

• 196

upvoted an article 7 months ago

Article

Intelligence Potentiation: An Evolutionary Perspective on AI Agent Designs

By

•

Dec 19, 2024

• 4

upvoted an article 8 months ago

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

By

•

Nov 9, 2024

• 9

upvoted a collection 8 months ago

🇫🇷 Calme-3

Here you can find all the new Calme-3 models • 27 items • Updated Feb 9 • 16

upvoted a paper 10 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 14

upvoted an article 12 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

By

and 3 others •

Jul 31, 2024

• 59

upvoted a collection 12 months ago

VAGO solutions quants

Quantized version for the excellent german speaking models created by VAGO solutions. • 6 items • Updated Apr 20, 2024 • 2

upvoted 2 collections about 1 year ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Apr 28 • 368

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 39

upvoted 2 articles about 1 year ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24, 2024

• 63

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 237

upvoted a collection about 1 year ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 809

upvoted a collection over 1 year ago

🇩🇪German SFT and DPO datasets

Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 33 items • Updated Jan 23 • 12