Dominik Weckmüller's picture

Dominik Weckmüller

do-me

AI & ML interests

Making AI more accessible. Working on semantic search, embeddings and Geospatial AI applications. https://geo.rocks

Recent Activity

Organizations

Social Post Explorers's profile picture

do-me's activity

upvoted an article 15 days ago
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

128
New activity in minishlab/README 15 days ago

Inferencing in Rust

5
#1 opened 17 days ago by
do-me
New activity in EuropeanParliament/Eurovoc 16 days ago

Keep new lines

#5 opened 16 days ago by
do-me
reacted to MoritzLaurer's post with 🔥 22 days ago
view post
Post
2208
🚀 Releasing a new zeroshot-classifier based on ModernBERT! Some key takeaways:

- ⚡ Speed & efficiency: It's multiple times faster and uses significantly less memory than DeBERTav3. You can use larger batch sizes and enabling bf16 (instead of fp16) gave me a ~2x speed boost as well
- 📉 Performance tradeoff: It performs slightly worse than DeBERTav3 on average across my zeroshot classification task collection
- 🧠 Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k).
- 💡 What’s next? I’m preparing a newer version trained on better + longer synthetic data to fully leverage the 8k context window and improve upon the training mix of my older zeroshot-v2.0 models. I also hope that there will be a multilingual variant in the future.

Great work by https://huggingface.co/answerdotai !

If you’re looking for a high-speed zeroshot classifier, give it a try!

📄 Resources below: 👇
Base model: MoritzLaurer/ModernBERT-base-zeroshot-v2.0
Large model: MoritzLaurer/ModernBERT-large-zeroshot-v2.0
Updated zeroshot collection: MoritzLaurer/zeroshot-classifiers-6548b4ff407bb19ff5c3ad6f
ModernBERT collection with paper: answerdotai/modernbert-67627ad707a4acbf33c41deb
reacted to hexgrad's post with 🤗 about 1 month ago
reacted to hexgrad's post with 🔥 about 1 month ago
view post
Post
4029
Merry Christmas! 🎄 Open sourced a small TTS model at hexgrad/Kokoro-82M
  • 2 replies
·
upvoted an article about 2 months ago
liked a Space about 2 months ago
posted an update 3 months ago
New activity in minishlab/M2V_base_glove 3 months ago

ONNX weights

5
#1 opened 3 months ago by
do-me