RT-DETR-v2 r50vd model fine-tuned on about 11k Manga, Webtoon, Manhua and Western Comic style Images for text and speech bubble detection.
Training Image Size = 640. Training Images were resized, not cropped.
Tall Webtoons were split vertically.
Classes are:
0: bubble
1: text_bubble (text inside bubbles)
2: text_free (text outside bubbles)
- Downloads last month
- 32,212
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support