Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

intfloat
/
e5-large-v2

Sentence Similarity
sentence-transformers
PyTorch
ONNX
Safetensors
OpenVINO
English
bert
mteb
Sentence Transformers
Eval Results
text-embeddings-inference
Model card Files Files and versions Community
23
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

How can I get large corpus dataset (over 200 Millions of records) in a tsv file format to encode with intfloat/e5-large-v2 as an embedding model ?

#15 opened about 1 year ago by
liorf95

Comparison with multilingual-e5-large

#14 opened over 1 year ago by
xuuxu

Single input vs Multiple inputs

1
#13 opened over 1 year ago by
innovationTony

Possible Vector Collaps Issue

1
#10 opened almost 2 years ago by
Banso

Changing the dimensions of the embeddings

1
#9 opened almost 2 years ago by
Suijhin

Adding ONNX file of this model

#5 opened almost 2 years ago by
asifanchor

Adding `safetensors` variant of this model

#4 opened about 2 years ago by
SFconvertbot

e5-large-v2 requirements for training in non english?

2
#3 opened about 2 years ago by
wilfoderek

Which embedding vector to use?

8
#2 opened about 2 years ago by
moooji

How can I support the max_length=2048

6
#1 opened about 2 years ago by
nlpdev3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs