Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc.
AI & ML interests
Interactive NLP development
Recent Activity
Organization Card
We are a startup building the NuExtract Platform.
We also develop open-source Information Extraction foundation models that we share here. They are often SOTA in their category, and always under MIT license; use them without restrictions π.
spaces
6
Running
on
L40S
36
NuMarkdown 8b Thinking
π
Reasoning model specialized for OCR/Markdown generation.
Runtime error
13
NuExtract 2.0
π
Space for numind/NuExtract-2.0-4B
Runtime error
77
NuExtract 1.5
π
Playground for NuExtract-v1.5
Running
on
T4
36
NuNER_Zero
π»
Identify named entities in text
Paused
71
NuExtract
π
models
34
numind/NuExtract-2.0-2B
Image-Text-to-Text
β’
2B
β’
Updated
β’
3.51k
β’
32
numind/NuExtract-2.0-4B
Image-Text-to-Text
β’
4B
β’
Updated
β’
2.88k
β’
21
numind/NuExtract-2.0-8B
Image-Text-to-Text
β’
8B
β’
Updated
β’
2.7k
β’
41
numind/NuMarkdown-8B-Thinking-GGUF
8B
β’
Updated
β’
1.3k
β’
1
numind/NuExtract-2.0-8B-GGUF
Image-Text-to-Text
β’
8B
β’
Updated
β’
832
β’
1
numind/NuExtract-2.0-4B-GGUF
Image-Text-to-Text
β’
3B
β’
Updated
β’
133
β’
2
numind/NuExtract-2.0-2B-GGUF
Image-Text-to-Text
β’
2B
β’
Updated
β’
195
numind/NuMarkdown-8B-Thinking
Image-to-Text
β’
8B
β’
Updated
β’
2.03k
β’
216
numind/NuExtract-2.0-8B-GPTQ
Image-Text-to-Text
β’
3B
β’
Updated
β’
49
β’
4
numind/NuExtract-1.5
Text Generation
β’
4B
β’
Updated
β’
267k
β’
240