NER4all or Context is All You Need: Using LLMs for low-effort, high-performance NER on historical texts. A humanities informed approach Paper • 2502.04351 • Published 17 days ago • 1 • 1
Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models Paper • 2501.10322 • Published Jan 17 • 1 • 2
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14 • 55 • 3
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages Paper • 2501.08284 • Published Jan 14 • 6 • 2
Building Foundations for Natural Language Processing of Historical Turkish: Resources and Models Paper • 2501.04828 • Published Jan 8 • 11 • 3