BLiSS 1.0: Evaluating Bilingual Learner Competence in Second Language Small Language Models Paper • 2510.19419 • Published 11 days ago • 1
Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction Paper • 2510.20411 • Published 10 days ago • 2
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published 22 days ago • 2
Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling Paper • 2510.08470 • Published 24 days ago • 1
Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research Paper • 2509.16413 • Published Sep 19 • 1
Meta-Pretraining for Zero-Shot Cross-Lingual Named Entity Recognition in Low-Resource Philippine Languages Paper • 2509.02160 • Published Sep 2 • 1
ByteSpanTokenisers/fw57M-tied_finewebedu-20B_ByteSpanSurprisalGlobalIncrement_64000 Updated Jun 29 • 151
ByteSpanTokenisers/fw57M-tied_finewebedu-20B_ByteSpanSurprisalGlobalIncrement_64000 Updated Jun 29 • 151
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs Paper • 2503.09543 • Published Mar 12
Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies Paper • 2410.22886 • Published Oct 30, 2024 • 1
Self-Training Large Language Models for Tool-Use Without Demonstrations Paper • 2502.05867 • Published Feb 9
Early-Exit and Instant Confidence Translation Quality Estimation Paper • 2502.14429 • Published Feb 20 • 4
Tending Towards Stability: Convergence Challenges in Small Language Models Paper • 2410.11451 • Published Oct 15, 2024
AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets Paper • 2404.05623 • Published Apr 8, 2024 • 3