10 45 516

afrideva

afri_deva

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

NousResearch/DeepHermes-3-Llama-3-3B-Preview

reacted to hexgrad's post with 👍 3 months ago

I wrote an article about G2P: https://hf.co/blog/hexgrad/g2p G2P is an underrated piece of small TTS models, like offensive linemen who do a bunch of work and get no credit. Instead of relying on explicit G2P, larger speech models implicitly learn this task by eating many thousands of hours of audio data. They often use a 500M+ parameter LLM at the front to predict latent audio tokens over a learned codebook, then decode these tokens into audio. Kokoro instead relies on G2P preprocessing, is 82M parameters, and thus needs less audio to learn. Because of this, we can cherrypick high fidelity audio for training data, and deliver solid speech for those voices. In turn, this excellent audio quality & lack of background noise helps explain why Kokoro is very competitive in single-voice TTS Arenas.

reacted to davanstrien's post with ❤️ 3 months ago

Why choose between strong LLM reasoning and efficient models? Use DeepSeek to generate high-quality training data, then distil that knowledge into ModernBERT https://huggingface.co/answerdotai/ModernBERT-base for fast, efficient classification. Blog post: https://danielvanstrien.xyz/posts/2025/deepseek/distil-deepseek-modernbert.html

View all activity

Organizations

afrideva's activity

liked a model about 2 months ago

NousResearch/DeepHermes-3-Llama-3-3B-Preview

Text Generation • Updated Mar 13 • 2.62k • 28

reacted to hexgrad's post with 👍 3 months ago

Post

5797

I wrote an article about G2P: https://hf.co/blog/hexgrad/g2p

G2P is an underrated piece of small TTS models, like offensive linemen who do a bunch of work and get no credit.

Instead of relying on explicit G2P, larger speech models implicitly learn this task by eating many thousands of hours of audio data. They often use a 500M+ parameter LLM at the front to predict latent audio tokens over a learned codebook, then decode these tokens into audio.

Kokoro instead relies on G2P preprocessing, is 82M parameters, and thus needs less audio to learn. Because of this, we can cherrypick high fidelity audio for training data, and deliver solid speech for those voices. In turn, this excellent audio quality & lack of background noise helps explain why Kokoro is very competitive in single-voice TTS Arenas.

reacted to davanstrien's post with ❤️ 3 months ago

Post

1863

Why choose between strong LLM reasoning and efficient models?

Use DeepSeek to generate high-quality training data, then distil that knowledge into ModernBERT answerdotai/ModernBERT-base for fast, efficient classification.

Blog post: https://danielvanstrien.xyz/posts/2025/deepseek/distil-deepseek-modernbert.html

liked a model 3 months ago

deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated Feb 1 • 35.3k • 434

published a Space 3 months ago

Janus Pro 1b

🌍

A unified multimodal understanding and generation model.

updated a Space 3 months ago

Janus Pro 1b

🌍

A unified multimodal understanding and generation model.

liked a model 5 months ago

shafire/talktoaiZERO

Text Generation • Updated Dec 3, 2024 • 52 • 5

liked a model 6 months ago

shafire/SpectraMind

Updated Jan 20 • 70 • 3

liked a Space 7 months ago

154

Chat With Janus 1.3B

🌍

A unified multimodal understanding and generation model.

liked a model 7 months ago

Svngoku/ancient-africans

Text-to-Image • Updated 10 days ago • 96 • • 11

updated a dataset 7 months ago

afrideva/stacks-mainnet-contracts

Viewer • Updated Oct 14, 2024 • 39.6k • 59

liked a Space 7 months ago

Edge TTS

📝

Microsoft Edge's Text To Speech

liked 2 Spaces 8 months ago

154

PDF OCR

📝

Convert PDF to text using OCR

PDF to Markdown

🚀

Extract text and metadata from PDF files

liked a dataset 8 months ago

sjyuxyz/web3mmlu

Viewer • Updated Jun 10, 2024 • 298 • 40 • 1

reacted to Tonic's post with 👀 8 months ago

Post

2530

🙋🏻‍♂️hey there folks ,

✒️InkubaLM has been trained from scratch using 1.9 billion tokens of data for five African languages, along with English and French data, totaling 2.4 billion tokens of data. It is capable of understanding and generating content in five African languages: Swahili, Yoruba, Hausa, isiZulu, and isiXhosa, as well as English and French.

model lelapa/InkubaLM-0.4B
demo Tonic/Inkuba-0.4B

liked a Space 9 months ago

Phi 3 5 MoE

🖼

liked a dataset 10 months ago

jtatman/famous_movie_quotes

Viewer • Updated Jul 2, 2024 • 6.28k • 19 • 3