Hungarian PDF pages from Common Crawl. Annotated with synthetic QAs by Llama 3.3 70B.
Jonathan Li
jlli
AI & ML interests
None yet
Recent Activity
updated
a collection
6 days ago
Hungarian Document Datasets
updated
a collection
6 days ago
Hungarian Document Datasets
updated
a collection
6 days ago
Hungarian Document Datasets
Organizations
Collections
1
models
0
None public yet
datasets
10
jlli/HuDocVQA
Viewer
•
Updated
•
22.4k
•
67
jlli/HuDocVQA-manual
Viewer
•
Updated
•
54
•
45
jlli/HuCCPDF
Viewer
•
Updated
•
113k
•
239
jlli/Hungarian_CCPDF_SynQA_v2
Viewer
•
Updated
•
24.3k
•
60
•
1
jlli/Hungarian_CCPDF_SynQA
Viewer
•
Updated
•
19.3k
•
52
jlli/SynthDog_hu2
Viewer
•
Updated
•
40k
•
19
jlli/JDocQA-binary
Viewer
•
Updated
•
1.38k
•
26
jlli/JDocQA-nonbinary
Viewer
•
Updated
•
7.54k
•
44
jlli/HungarianDocQA-OCR
Viewer
•
Updated
•
54
•
22
•
1
jlli/SynthDog_hu
Viewer
•
Updated
•
20.5k
•
25