Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
76
22
David Dale
cointegrated
Follow
tokencore's profile picture
Pushkinue's profile picture
dani-garcia's profile picture
79 followers
·
8 following
https://daviddale.ru/en
cointegrated
avidale
AI & ML interests
Research engineer at FAIR, Meta. Some pet projects on NLP for under-resourced languages. Interests: Machine translation, Chatbots, applied NLU, controllable text generation (in particular, text style transfer), miniature models.
Recent Activity
liked
a dataset
10 days ago
rombodawg/Everything_Instruct_Multilingual
new
activity
18 days ago
openlanguagedata/flores_plus:
[DRAFT] Fix orthography in the Russian dev set
new
activity
18 days ago
openlanguagedata/flores_plus:
Fix encoding at chv devtest
View all activity
Organizations
cointegrated
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
10 days ago
rombodawg/Everything_Instruct_Multilingual
Viewer
•
Updated
Oct 8, 2024
•
5.81M
•
261
•
23
New activity in
openlanguagedata/flores_plus
18 days ago
[DRAFT] Fix orthography in the Russian dev set
4
#4 opened 3 months ago by
cointegrated
Fix encoding at chv devtest
4
#9 opened about 2 months ago by
alexantonov
liked
a dataset
about 1 month ago
google/wmt24pp
Viewer
•
Updated
10 days ago
•
54.9k
•
6.04k
•
32
New activity in
slone/nllb-rus-tyv-v1
about 1 month ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
New activity in
cointegrated/LaBSE-en-ru
about 1 month ago
Warn Some weights of the model checkpoint at cointegrated/LaBSE-en-ru were not used when initializing BertModel:
1
#4 opened 6 months ago by
alashkov83
New activity in
slone/LaBSE-shallow-distilled-bak
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
cointegrated/SONAR_200_text_encoder
about 2 months ago
can you please do the same for decoder
1
#2 opened 3 months ago by
damerajee
New activity in
slone/finugorbib
about 2 months ago
[bot] Conversion to Parquet
#1 opened about 2 months ago by
parquet-converter
liked
a dataset
about 2 months ago
udmurtNLP/udmurt-russian-parallel-corpora
Viewer
•
Updated
Feb 1
•
102k
•
77
•
3
New activity in
openlanguagedata/flores_plus
about 2 months ago
Added Dargwa dev set to flores_plus
2
#3 opened 4 months ago by
Murtazali
published
a dataset
about 2 months ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
176
•
1
updated
a dataset
about 2 months ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
176
•
1
liked
a dataset
2 months ago
alexantonov/chukot_russian_flores_sample
Viewer
•
Updated
Jan 31
•
100
•
93
•
4
liked
a model
2 months ago
Helsinki-NLP/opus-mt-tc-bible-big-mul-mul
Translation
•
Updated
Oct 12, 2024
•
554
•
•
4
New activity in
openlanguagedata/flores_plus
3 months ago
Add data integrity tests
1
#7 opened 3 months ago by
cointegrated
updated
a dataset
3 months ago
openlanguagedata/flores_plus
Viewer
•
Updated
27 days ago
•
434k
•
2.16k
•
29
New activity in
openlanguagedata/flores_plus
3 months ago
Two sentences in the dev set (one Lombard and one Tamasheq-Tifinagh) seem to be missing
#6 opened 3 months ago by
cointegrated
liked
2 datasets
3 months ago
aronlp/aromanian-romanian-MT-corpus
Viewer
•
Updated
Jan 15
•
105k
•
16
•
1
ontocord/fineweb-permissive-multilingual-2m
Viewer
•
Updated
Oct 9, 2024
•
2.23M
•
186
•
2
Load more