Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
orasulย 
posted an update 2 days ago
Post
1580
hi, it is deki, and now I am open sourced.

An Android AI agent powered by open-source ML model, ๐—ฑ๐—ฒ๐—ธ๐—ถ, was fully open-sourced.

It understands whatโ€™s on your screen and can perform tasks based on your voice or text commands.

Some examples:
* "Write my friend "some_name" in WhatsApp that I'll be 15 minutes late"
* "Open Twitter in the browser and write a post about something"
* "Read my latest notifications"
* "Write a linkedin post about something"

Currently, it works only on Android โ€” but support for other OS is planned.

The ML and backend codes were also fully open-sourced.

Video prompt example:

"Open linkedin, tap post and write: hi, it is deki, and now I am open sourced. But don't send, just return"

License: GPLv3

You can find other AI agent demos or usage examples, like, code generation or object detection in github.

Github: https://github.com/RasulOs/deki

Thanks for a great work so far.
It is nice to see someone using more basic ML models to work more efficiently instead of just relying on big models.
I believe one good use-case would be the automatic conversion of diagram images (for engineering or SW) to mermaid diagrams for examples, if the raw text and json outputs are both provided to good coding LLM
Let me know if you are interested in something like that.
This can be a good project - with possible business application - to allow enterprises to make their existing documentation AI ready by doing the conversion.