Masakhane NLP

community

https://www.masakhane.io/

MasakhaneNLP

masakhane-io

Activity Feed Request to join this org

AI & ML interests

NLP for African languages, MT, NER, POS, QA, ...

Recent Activity

Davlan updated a dataset about 1 month ago

masakhane/masakhanews

Davlan new activity about 1 month ago

masakhane/masakhanews:main

Waasii updated a collection about 1 month ago

MasakhaPOS

View all activity

Davlan

updated a dataset about 1 month ago

masakhane/masakhanews

Viewer • Updated Dec 3, 2025 • 31.1k • 2.28k • 12

Davlan

in masakhane/masakhanews about 1 month ago

main

#2 opened about 1 month ago by

Davlan

Waasii

updated a collection about 1 month ago

MasakhaPOS

Collection

dataset and models for part-of-speech tagging for African languages. • 23 items • Updated Dec 1, 2025

asimz

authored a paper 3 months ago

Is Multilingual LLM Watermarking Truly Multilingual? A Simple Back-Translation Solution

Paper • 2510.18019 • Published Oct 20, 2025 • 17

israel

updated a dataset 3 months ago

masakhane/AfriDocMT

Viewer • Updated Oct 14, 2025 • 28.2k • 523 • 6

kalilouisangare

updated a collection 3 months ago

MasakhaPOS

Collection

dataset and models for part-of-speech tagging for African languages. • 23 items • Updated Dec 1, 2025

Tonic

posted an update 4 months ago

Post

1113

the french ministry of culture releases their first conversation datasets on huggingface 👇🏻
ministere-culture/comparia-conversations

ajesujoba

updated a dataset 4 months ago

masakhane/AfriDocMT

Viewer • Updated Oct 14, 2025 • 28.2k • 523 • 6

Tonic

posted an update 4 months ago

Post

796

COMPUTER CONTROL IS ON-DEVICE !

🏡🤖 78 % of EU smart-home owners DON’T trust cloud voice assistants.

So we killed the cloud.

Meet Exté: a palm-sized Android device that sees, hears & speaks your language - 100 % offline, 0 % data sent anywhere.

🔓 We submitted our technologies for consideration to the Liquid AI hackathon.

📊 Dataset: 79 k UI-action pairs on Hugging Face (largest Android-control corpus ever) Tonic/android-operator-episodes

⚡ Model: 98 % task accuracy, 678MB compressed , fits on existing android devices ! Tonic/l-android-control

🛤️ Experiment Tracker : check out the training on our TrackioApp Tonic/l-android-control

🎮 Live Model Demo: Upload an Android Screenshot and instructions to see the model in action ! Tonic/l-operator-demo

Built in a garage, funded by pre-orders, no VC. Now we’re scaling to 1 k installer units.

We’re giving 50 limited-edition prototypes to investors , installers & researchers who want to co-design the sovereign smart home.

👇 Drop “EUSKERA” in the comments if you want an invite, tag a friend who still thinks Alexa is “convenient,” and smash ♥️ if AI should belong to people - not servers.

4 replies

Tonic

posted an update 4 months ago

Post

727

🙋🏻‍♂️ Hey there folks ,

Just wanted to annouce 🏭SmolFactory : it's the quickest and best way to finetune SmolLM3 and GPT-OSS-20B on huggingface !

Basicaly it's an app you can run on huggingface by duplicating the space and running your training directly on huggingface GPUs .

It will help you basically select datasets and models, fine tune your model , make an experiment tracker you can use on your mobile phone , push all your model card and even automatically make a demo for you on huggingface so you can directly test it out when it's done !

check out the blog to learn more : https://huggingface.co/blog/Tonic/smolfactory

or just try the app directly :
Tonic/SmolFactory

you can vibe check the cool models I made :
French SmolLM3 : Tonic/Petite-LLM-3
Medical GPT-OSS : Tonic/med-gpt-oss-20b-demo

check out the model cards :
multilingual reasoner (gpt-oss) - Tonic/gpt-oss-20b-multilingual-reasoner
med-gpt-oss : Tonic/med-gpt-oss-20b
petite-elle-l-aime : Tonic/petite-elle-L-aime-3-sft

github repo if you like command line more than gradio : https://github.com/josephrp/smolfactory

drop some likes on these links it's really much appreciated !

feedback and PRs are welcome !

Davlan

published a dataset 4 months ago

masakhane/africa-news

Updated Aug 27, 2025 • 9

ImranzamanML

posted an update 5 months ago

Post

652

# Runway Aleph: The Future of AI Video Editing

Runway’s new **Aleph** model lets you *transform*, *edit*, and *generate* video from existing footage using just text prompts.
You can remove objects, change environments, restyle shots, alter lighting, and even create entirely new camera angles, all in one tool.

## Key Links

- 🔬 [Introducing Aleph (Runway Research)](https://runwayml.com/research/introducing-runway-aleph)
- 📖 [Aleph Prompting Guide (Runway Help Center)](https://help.runwayml.com/hc/en-us/articles/43277392678803-Aleph-Prompting-Guide)
- 🎬 [How to Transform Videos (Runway Academy)](https://academy.runwayml.com/aleph/how-to-transform-videos)
- 📰 [Gadgets360 Coverage](https://www.gadgets360.com/ai/news/runway-aleph-ai-video-editing-generation-model-post-production-unveiled-8965180)
- 🎥 [YouTube Demo: ALEPH by Runway](https://www.youtube.com/watch?v=PPerCtyIKwA)
- 📰 [Runway Alpha dataset]( Rapidata/text-2-video-human-preferences-runway-alpha)

## Prompt Tips

1. Be clear and specific (e.g., _“Change to snowy night, keep people unchanged”_).
2. Use action verbs like _add, remove, restyle, relight_.
3. Add reference images for style or lighting.

Aleph shifts AI video from *text-to-video* to *video-to-video*, making post-production faster, more creative, and more accessible than ever.