66 2 42

No Name

Ainonake

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

TheDrummer/Valkyrie-49B-v1-GGUF:Crashes Ollama

new activity about 1 month ago

lodestones/Chroma:This model is going to be insane

new activity about 2 months ago

Undi95/QwQ-RP:I notice need Qwen2, but for SillyTavern, what I need to use?

View all activity

Organizations

None yet

New activity in TheDrummer/Valkyrie-49B-v1-GGUF about 1 month ago

Crashes Ollama

#2 opened about 1 month ago by

SeanUhTron

New activity in lodestones/Chroma about 1 month ago

This model is going to be insane

❤️ 4

#37 opened about 1 month ago by

Ainonake

New activity in Undi95/QwQ-RP about 2 months ago

I notice need Qwen2, but for SillyTavern, what I need to use?

#2 opened about 2 months ago by

RGTails

New activity in TheDrummer/Llama-3SOME-8B-v2 about 2 months ago

fp8

#6 opened about 2 months ago by

010O11

New activity in Undi95/MLewd-ReMM-L2-Chat-20B about 2 months ago

How to query for multi chat

#5 opened about 2 months ago by

Drseltsam

New activity in ByteDance-Seed/UI-TARS-1.5-7B about 2 months ago

Ollama deployment

👍 2

#7 opened 2 months ago by

sedatkaradag

New activity in unsloth/Qwen3-235B-A22B-GGUF 2 months ago

Ud quants please🥺

#5 opened 2 months ago by

Ainonake

New activity in second-state/Qwen3-32B-GGUF 2 months ago

Q8 is noticeably better than Q4_0. Both suffer from repetition in rp in non-english lanuages

#2 opened 2 months ago by

Ainonake

LMstudio error

#1 opened 2 months ago by

DmitryV

New activity in nyuuzyou/archiveofourown 2 months ago

I am very interested in this dataset

😔 👍 3

140

#3 opened 3 months ago by

Demanin

New activity in TheDrummer/Anubis-70B-v1 2 months ago

Gave it a whirl

#4 opened 2 months ago by

SkyStach

New activity in nari-labs/Dia-1.6B 2 months ago

Multiple language support

👀 1

#14 opened 2 months ago by

thunder-007

New activity in microsoft/MAI-DS-R1 2 months ago

Report: Nice job, you've ruined a nice model with censorship no one wants or asks for

❤️ 👍 10

#7 opened 2 months ago by

Ainonake

Can someone explain why this model is currently at top 3 of the trending list?

👍 10

#8 opened 2 months ago by

RobinFeng123

New activity in TheDrummer/Fallen-Command-A-111B-v1.1 3 months ago

🚩 Report: Illegal or restricted content

🤝 🔥 6

#1 opened 3 months ago by

clecho52

liked a model 3 months ago

canopylabs/orpheus-3b-0.1-ft

Text-to-Speech • 4B • Updated May 6 • 21k • • 576

liked a model 4 months ago

RekaAI/reka-flash-3

21B • Updated Mar 13 • 4.89k • 375

reacted to tomaarsen's post with ❤️ 4 months ago

Post

6828

An assembly of 18 European companies, labs, and universities have banded together to launch 🇪🇺 EuroBERT! It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

🇪🇺 15 Languages: English, French, German, Spanish, Chinese, Italian, Russian, Polish, Portuguese, Japanese, Vietnamese, Dutch, Arabic, Turkish, Hindi
3️⃣ 3 model sizes: 210M, 610M, and 2.1B parameters - very very useful sizes in my opinion
➡️ Sequence length of 8192 tokens! Nice to see these higher sequence lengths for encoders becoming more common.
⚙️ Architecture based on Llama, but with bi-directional (non-causal) attention to turn it into an encoder. Flash Attention 2 is supported.
🔥 A new Pareto frontier (stronger *and* smaller) for multilingual encoder models
📊 Evaluated against mDeBERTa, mGTE, XLM-RoBERTa for Retrieval, Classification, and Regression (after finetuning for each task separately): EuroBERT punches way above its weight.
📝 Detailed paper with all details, incl. data: FineWeb for English and CulturaX for multilingual data, The Stack v2 and Proof-Pile-2 for code.

Check out the release blogpost here: https://huggingface.co/blog/EuroBERT/release
* EuroBERT/EuroBERT-210m
* EuroBERT/EuroBERT-610m
* EuroBERT/EuroBERT-2.1B

The next step is for researchers to build upon the 3 EuroBERT base models and publish strong retrieval, zero-shot classification, etc. models for all to use. I'm very much looking forward to it!

1 reply

New activity in Undi95/MistralThinker-v1.1 4 months ago

This shit is fire

#2 opened 4 months ago by

Ainonake

liked a model 4 months ago

tencent/HunyuanVideo-I2V

Image-to-Video • Updated Mar 13 • 3.66k • 321

No Name

AI & ML interests

Recent Activity

Organizations

Ainonake's activity

Crashes Ollama

This model is going to be insane

I notice need Qwen2, but for SillyTavern, what I need to use?

fp8

How to query for multi chat

Ollama deployment

Ud quants please🥺

Q8 is noticeably better than Q4_0. Both suffer from repetition in rp in non-english lanuages

LMstudio error

I am very interested in this dataset

Gave it a whirl

Multiple language support

Report: Nice job, you've ruined a nice model with censorship no one wants or asks for

Can someone explain why this model is currently at top 3 of the trending list?

🚩 Report: Illegal or restricted content

This shit is fire