39 10 16

Joshua Nemecek

jnemecek

https://www.buymeacoffee.com/DCNemesis

AI & ML interests

NLP for low resource languages

Recent Activity

upvoted a paper about 16 hours ago

Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content

commented on a paper 4 days ago

Optimizing Multilingual Text-To-Speech with Accents & Emotions

commented on a paper 4 days ago

Optimizing Multilingual Text-To-Speech with Accents & Emotions

View all activity

Organizations

commented 2 papers 4 days ago

Optimizing Multilingual Text-To-Speech with Accents & Emotions

Paper • 2506.16310 • Published 8 days ago • 22 •

Optimizing Multilingual Text-To-Speech with Accents & Emotions

Paper • 2506.16310 • Published 8 days ago • 22 •

commented 2 papers 4 months ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 99 •

Magic 1-For-1: Generating One Minute Video Clips within One Minute

Paper • 2502.07701 • Published Feb 11 • 36 •

New activity in bible-nlp/biblenlp-corpus 7 months ago

Getting error 36

🤝 1

#5 opened almost 2 years ago by

sakthi07

Convert name keyword argument to an md5 hash

👍 1

#10 opened about 1 year ago by

ajland

commented 2 papers 9 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 179 •

MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Paper • 2406.19680 • Published Jun 28, 2024 • 1 •

commented a paper 10 months ago

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Paper • 2408.07547 • Published Aug 14, 2024 • 8 •

commented a paper 11 months ago

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25, 2024 • 33 •

New activity in sil-ai/wav2vec2-bloom-speech-sdk over 1 year ago

License link is dead

#1 opened over 1 year ago by

Lechasseur

New activity in bible-nlp/biblenlp-corpus over 1 year ago

[Suggestion] Parquet

👍 1

#6 opened almost 2 years ago by

christopher

Difference between this dataset and the OPUS version?

#7 opened over 1 year ago by

ZhaofengWu

commented a paper over 1 year ago

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12, 2024 • 62 •

New activity in bible-nlp/biblenlp-corpus almost 2 years ago

corpus.json doesn't contain all the txt files

#4 opened almost 2 years ago by

charlie0608

New activity in sil-ai/bloom-lm about 2 years ago

Shipibo

#6 opened about 2 years ago by

Eklavya

New activity in sil-ai/bloom-speech over 2 years ago

load_dataset function returns an error

#3 opened over 2 years ago by

idar

New activity in woz4tetra/charged_up_2023 over 2 years ago

Labels aren't showing up

#1 opened over 2 years ago by

jnemecek

New activity in sil-ai/bloom-speech over 2 years ago

Problem loading data with datasets==2.7

#1 opened over 2 years ago by

cassiepowell

Fix for datasets 2.7

#2 opened over 2 years ago by

lhoestq

Joshua Nemecek

AI & ML interests

Recent Activity

Organizations

jnemecek's activity

Getting error 36

Convert name keyword argument to an md5 hash

License link is dead

[Suggestion] Parquet

Difference between this dataset and the OPUS version?

corpus.json doesn't contain all the txt files

Shipibo

load_dataset function returns an error

Labels aren't showing up

Problem loading data with datasets==2.7

Fix for datasets 2.7