Simeon Emanuilov PRO

s-emanuilov

AI & ML interests

Software Engineer & Ph.D. candidate | Specializing in ML/DL system development & applying AI to solve real-world business problems.

Recent Activity

liked a dataset 22 minutes ago

HuggingFaceM4/FineVision

updated a dataset about 8 hours ago

llm-bg/bulgarian-history-complex

published a dataset about 14 hours ago

llm-bg/bulgarian-history-complex

View all activity

Organizations

upvoted a paper 2 days ago

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Paper • 2509.03867 • Published 4 days ago • 182

upvoted a paper 3 days ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 39

upvoted a collection 3 days ago

EmbeddingGemma

Collection

3 items • Updated 3 days ago • 58

upvoted an article 3 days ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

and 5 others •

4 days ago

• 157

upvoted a collection about 2 months ago

Health AI Developer Foundations (HAI-DEF)

Collection

Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations • 15 items • Updated Jul 10 • 103

upvoted a collection 3 months ago

Tucan

Collection

A series of open-source Bulgarian language models fine-tuned for function calling and tool use. 2.6B, 9B, and 27B parameter variants. • 12 items • Updated Jul 1 • 1

upvoted an article 3 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

•

Jan 15

• 207

upvoted a paper 5 months ago

CoLLM: A Large Language Model for Composed Image Retrieval

Paper • 2503.19910 • Published Mar 25 • 15

upvoted 2 articles 5 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

•

May 28, 2024

• 246

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

•

Mar 26

• 161

upvoted a paper 6 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 117

upvoted 3 articles 7 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

and 2 others •

Feb 19

• 72

Article

SigLIP 2: A better multilingual vision language encoder

and 2 others •

Feb 21

• 181

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 134

upvoted 3 papers 7 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 52

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 129

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 170

upvoted an article 7 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.29k

upvoted a collection 7 months ago

llama.vim

Collection

upvoted an article 7 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 683

Simeon Emanuilov PRO

AI & ML interests

Recent Activity

Organizations

s-emanuilov's activity

Welcome EmbeddingGemma, Google's new efficient embedding model

Train 400x faster Static Embedding Models with Sentence Transformers

Training and Finetuning Embedding Models with Sentence Transformers v3

Training and Finetuning Reranker Models with Sentence Transformers v4

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

SigLIP 2: A better multilingual vision language encoder

Merge Large Language Models with mergekit

Open-source DeepResearch – Freeing our search agents

Finally, a Replacement for BERT: Introducing ModernBERT