ali-issa/eng_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli_with_vocab Viewer • Updated 19 days ago • 142M • 110
ali-issa/new_eng_custom_tokenizer_filtered_short_sentences_less_than_5_words Updated 20 days ago • 22
ali-issa/eng_custom_tokenizer_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli Updated 20 days ago • 28
ali-issa/eng_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli Viewer • Updated 21 days ago • 142M • 191
ali-issa/v2_arb_diacritized_tokenized_filtered_dataset_with_custom_tokenizer Viewer • Updated 21 days ago • 79.9M • 111
ali-issa/arb_diacritized_tokenized_filtered_dataset_with_custom_tokenizer Viewer • Updated 24 days ago • 53.9M • 107
ali-issa/arb_diacritized_tokenized_filtered_dataset_with_arb-bpe-tokenizer-32768 Viewer • Updated 29 days ago • 141M • 324
ali-issa/new_removed_none_values_arb_filtered_and_diacritized_short_sentences_less_than_5_words Viewer • Updated Feb 12 • 141M • 204