vprelovac
/

universal-sentence-encoder-large-5

Model card Files Files and versions

vprelovac commited on Apr 24, 2023

Commit

b1480d9

·

1 Parent(s): de9ea51

Update README.md

Files changed (1) hide show

README.md +24 -3

README.md CHANGED Viewed

@@ -1,3 +1,24 @@
----
-license: apache-2.0
----

+This is a part of the [MTEB test](https://huggingface.co/spaces/mteb/leaderboard).
+```
+# !pip install tensorflow_text
+import tensorflow_hub as hub
+from tensorflow_text import SentencepieceTokenizer
+import tensorflow as tf
+embedder=hub.load("https://tfhub.dev/google/universal-sentence-encoder-multilingual-large/3")
+class USE():
+    def encode(self, sentences, batch_size=32, **kwargs):
+        embeddings = []
+        for i in range(0, len(sentences), batch_size):
+            batch_sentences = sentences[i:i+batch_size]
+            batch_embeddings = embedder(batch_sentences)
+            embeddings.extend(batch_embeddings)
+        return embeddings
+model = USE()
+```