Multi Lingual OCR models
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Nayana - Vision AI for all

Enabling Vision Language Capabilites for Low resource langauges
Initiative by Cognitivelab
Problem Statement
Despite advancements in vision-language AI, a significant number of the world's languages remain underserved, leaving millions without tools to process documents in their native scripts.
Challenges Addressed by Nayana:
- Wide Language Gap: Lack of robust OCR solutions for a large spectrum of languages, particularly low-resource and rare languages.
- Script Complexity: Supporting diverse writing systems, including those with intricate scripts, cursive styles, or mixed-language content.
- Scalability: Need for adaptable models that can handle real-world multilingual document processing at scale.
Nayana is designed to tackle these challenges by fine-tuning cutting-edge OCR models for diverse languages across multiple regions, empowering users to extract actionable insights from their documents regardless of the language or script.
Vision
To democratize access to Vision-Language AI for all communities by empowering a wide range of languages, including low-resource and underrepresented ones, with cutting-edge OCR and document understanding capabilities.
Mission
- Enhance Accessibility: Build tools that enable equitable AI solutions for diverse linguistic groups worldwide.
- Expand Language Coverage: Support a vast range of languages and scripts, breaking barriers for multilingual document processing.
- Foster Collaboration: Provide an open-source platform where developers and researchers can enhance and expand multilingual OCR capabilities.
models
21

Nayana-cognitivelab/SectionOCR_SFT_v0_base_gemma
Image-Text-to-Text
•
4B
•
Updated
•
47

Nayana-cognitivelab/NayanaVQA
Image-Text-to-Text
•
8B
•
Updated
•
18

Nayana-cognitivelab/Full-SFT-v1-23000
Image-Text-to-Text
•
8B
•
Updated
•
12

Nayana-cognitivelab/Full-SFT-v1-3500
Image-Text-to-Text
•
8B
•
Updated
•
17

Nayana-cognitivelab/Full-SFT-v1-3000
Image-Text-to-Text
•
8B
•
Updated
•
18

Nayana-cognitivelab/NayanaVQA-archive
Image-Text-to-Text
•
8B
•
Updated
•
11

Nayana-cognitivelab/NayanaSectionOCR
Image-Text-to-Text
•
8B
•
Updated
•
85

Nayana-cognitivelab/DocOCR_SFT_v1_50
Image-Text-to-Text
•
8B
•
Updated
•
11
•
1

Nayana-cognitivelab/exp-colpali-merged-en-20k
3B
•
Updated
•
5

Nayana-cognitivelab/exp-colpali-trained-en-20k-lora
Updated
•
4
datasets
121
Nayana-cognitivelab/ViViD_arxiv
Viewer
•
Updated
•
95.4k
•
35
Nayana-cognitivelab/SectionOCR-SFT-augment
Viewer
•
Updated
•
226k
•
69
Nayana-cognitivelab/DocOCR-SFT-augment
Viewer
•
Updated
•
91.5k
•
40
Nayana-cognitivelab/DocOCR-SFT-v2
Viewer
•
Updated
•
1k
•
20
Nayana-cognitivelab/DocOCR-SFT-v1
Viewer
•
Updated
•
1k
•
15
Nayana-cognitivelab/SectionOCR-SFT-augment-archive
Viewer
•
Updated
•
226k
•
28
Nayana-cognitivelab/VQA-SFT
Viewer
•
Updated
•
557k
•
31
Nayana-cognitivelab/VQA-SFT-test
Viewer
•
Updated
•
377
•
54
Nayana-cognitivelab/DocOCR-SFT
Viewer
•
Updated
•
229k
•
10
Nayana-cognitivelab/SectionOCR-SFT
Viewer
•
Updated
•
656k
•
10