David Dale
cointegrated
AI & ML interests
Research engineer at FAIR, Meta. Some pet projects on NLP for under-resourced languages.
Interests: Machine translation, Chatbots, applied NLU, controllable text generation (in particular, text style transfer), miniature models.
Recent Activity
new activity
about 11 hours ago
openlanguagedata/flores_plus:(Beginner) Issues using the described method to load FLORES+ dataset
new activity
about 22 hours ago
openlanguagedata/flores_plus:machine
new activity
about 22 hours ago
openlanguagedata/flores_plus:Misalignments in the Aranese subset (aran1260)
Organizations
cointegrated's activity
(Beginner) Issues using the described method to load FLORES+ dataset
5
#15 opened 1 day ago
by
jaecbc

machine
1
#8 opened about 2 months ago
by
maryamelboraie

Misalignments in the Aranese subset (aran1260)
1
#11 opened about 1 month ago
by
OrianeN
Fix misalignments in the Aranese subset (aran1260)
1
#13 opened 14 days ago
by
agaliano
Split dataset in subsets per language
1
#5 opened 3 months ago
by
thomas-ferraz

[DRAFT] Fix orthography in the Russian dev set
4
#4 opened 3 months ago
by
cointegrated

Fix encoding at chv devtest
4
#9 opened about 2 months ago
by
alexantonov

Adding `safetensors` variant of this model
#1 opened about 1 month ago
by
SFconvertbot

Adding `safetensors` variant of this model
#1 opened about 2 months ago
by
SFconvertbot

can you please do the same for decoder
1
#2 opened 3 months ago
by
damerajee

[bot] Conversion to Parquet
#1 opened about 2 months ago
by
parquet-converter

Added Dargwa dev set to flores_plus
2
#3 opened 4 months ago
by
Murtazali
Add data integrity tests
1
#7 opened 3 months ago
by
cointegrated

Two sentences in the dev set (one Lombard and one Tamasheq-Tifinagh) seem to be missing
#6 opened 3 months ago
by
cointegrated

Adding `safetensors` variant of this model
#1 opened 5 months ago
by
SFconvertbot

[bot] Conversion to Parquet
#1 opened 5 months ago
by
parquet-converter

Optimize the preprocessing and generation
#11 opened 6 months ago
by
cointegrated

More Details about the model
1
#1 opened 11 months ago
by
sanjay73
Training script?
1
#2 opened 10 months ago
by
vdmbrsv
