OWLS: Scaling Laws for Speech Recognition and Translation - a espnet Collection

espnet 's Collections

OpusLM

Codec Survey - Pre-trained Models

OWSM: Fully Open Speech Recognition and Translation Models

OWLS: Scaling Laws for Speech Recognition and Translation

OWSM-CTC: Ultra-Fast Speech Foundation Models

XEUS Model and Data

OWLS: Scaling Laws for Speech Recognition and Translation

updated May 3

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate.