A Multi-Labeled Dataset for Indonesian Discourse: Examining Toxicity, Polarization, and Demographics Information Paper • 2503.00417 • Published Mar 1
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts Paper • 2502.18148 • Published Feb 25
IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language Paper • 2406.19349 • Published Jun 27, 2024