Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
nicolay-rΒ 
posted an update 11 days ago
Post
474
πŸ“’ I am happy to share the bulk-translate 0.25.1. 🎊
This is a framework that allows you adapt your LM or use default (googletranslate API) for a quick translation of your dataset data.

⭐ https://github.com/nicolay-r/bulk-translate

bulk-translate is a tiny Python 🐍 no-string framework that allows translate a massive datasets of pre-annotated fixed-spans with related metadata that are invariant for translator (see picture below). It supports πŸ‘¨β€πŸ’» API for quick data translation with (optionaly) annotated objects in texts (see figure below) in Python 🐍

In new release we enahnce:
1. Fixed: sync type checking for spans representation
2. Compatibility with AREkit pipelines

πŸ€– The quick tutorial for applying it towards list of textual data with optional spans:
https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/translate_texts_with_spans_via_googletrans.ipynb
In this post