A high quality Vietnamese pretraining dataset for LLMs
UET-IAI-NLP-ViEduQALLMs
community
AI & ML interests
None defined yet.
Recent Activity
Collections
1
models
0
None public yet
datasets
13
group2sealion/15mil_milestone
Viewer
•
Updated
•
2.43M
•
17
group2sealion/vnu_crawl
Viewer
•
Updated
•
47.6k
•
40
group2sealion/4mil_milestone
Viewer
•
Updated
•
2.53M
•
16
group2sealion/11mil_last
Viewer
•
Updated
•
1.85M
•
15
group2sealion/8mil_last
Viewer
•
Updated
•
1.85M
•
28
group2sealion/last_result
Viewer
•
Updated
•
1.82M
•
50
group2sealion/8mil_last_domains
Viewer
•
Updated
•
338k
•
39
group2sealion/8mil_clean
Viewer
•
Updated
•
1.73M
•
133
group2sealion/11mil_clean
Viewer
•
Updated
•
1.73M
•
86
group2sealion/11mil_milestone
Viewer
•
Updated
•
1.9M
•
84