NLP ZG Collection Collection of all datasets, models, and demos, created during the NLP course at University of Zagreb. • 14 items • Updated 5 days ago • 2
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 87
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper • 2306.01116 • Published Jun 1, 2023 • 33