onnx support

by thewh1teagle - opened Mar 29

Mar 29

Hi!
Thank you for the model it's very useful.
I would like to convert this model to onnx framework for creating lightweight library, it also possible to quantize the model then.
I successfully converted the model to onnx and now it looks like I only need the tokenizer.
I'm not sure how to implement it, I see some configs in this repo but still, not sure which one of them and how to use them and if there's other things required.
I created the repo dicta-onnx
can you help with that? thank you.

thewh1teagle changed discussion title from onnx inference support to onnx support Mar 29

Shaltiel

DICTA: The Israel Center for Text Analysis org Mar 29

Love the idea! Can you use the HF library?

If so, it's a simple

from transformers import BertTokenizerFast

tok = BertTokenizerFast(tokenizer_file="tokenizer.json")

inputs = tok(texts, return_tensors="np"

thewh1teagle

Mar 29

I'll try later and see if I can use this as inputs
How can I decode the outputs?

Shaltiel

DICTA: The Israel Center for Text Analysis org Mar 30

Take a look at the implementation here:
https://huggingface.co/dicta-il/dictabert-large-char-menaked/blob/main/BertForDiacritization.py#L130

thewh1teagle

Apr 1

I successfully converted the model to onnx and published new Python library, it's faster and I even created quantized model (int8) which it's weight is only ~300MB!
See https://github.com/thewh1teagle/dicta-onnx
Thank you!

Shaltiel

DICTA: The Israel Center for Text Analysis org Apr 1

Very nice! We'll add it to the model card shortly !

thewh1teagle

Apr 3

Also created HF space to add diacritics to text with quantized model
https://huggingface.co/spaces/thewh1teagle/add-diacritics-in-hebrew

Shaltiel

DICTA: The Israel Center for Text Analysis org Apr 8

https://huggingface.co/dicta-il/dictabert-large-char-menaked/blob/main/README.md#community-project

Added to the main README ! :)

Shaltiel changed discussion status to closed Apr 8

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment