ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT Paper • 2506.04929 • Published 10 days ago
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30, 2024 • 43
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Paper • 2206.15076 • Published Jun 30, 2022 • 4
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews Paper • 2311.12474 • Published Nov 21, 2023
cartesinus/iva_mt_wslot-m2m100_418M-en-pl-plaintext Text2Text Generation • Updated Aug 21, 2023 • 11 • 1
Shushant/biobert-v1.1-biomedicalQuestionAnswering Question Answering • Updated Jan 16, 2022 • 4.7k • • 6