Large web-mined general corpus based on CommonCrawl.
Amir Hossein Kargaran
kargaranamir
AI & ML interests
#NLP, checkout https://huggingface.co/cis-lmu
Recent Activity
upvoted
a
collection
1 day ago
llm-urls-neurips
liked
a dataset
1 day ago
nhagar/glotcc-v1_urls
liked
a dataset
2 days ago
shachardon/ShareLM