HuBERT - a facebook Collection

facebook 's Collections

Physics of Language Models: Part 4.2

Web-SSL

blt

Perception Encoder

DRAMA

Sparsh

Seamless Communication

MAGNeT

XLSR

XLS-R

Robust Wav2Vec 2.0

HuBERT

Fairseq S^2 TTS

DINOv2

MusicGen Stereo

Sapiens

OPT

FAIR's LayerSkip Llama models

HuBERT

updated Jan 16, 2024

A collection of checkpoints from the HuBERT release, a speech encoder that learns powerful representations from unlabelled audio data.

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

Paper • 2106.07447 • Published Jun 14, 2021 • 3

Note The HuBERT paper, accepted at IEEE/ACM Transactions on Audio, Speech and Language Processing Volume 29.
facebook/hubert-base-ls960

Feature Extraction • Updated Nov 5, 2021 • 531k • • 59

Note The "base" HuBERT model fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-large-ll60k

Feature Extraction • Updated Nov 5, 2021 • 85.5k • 30

Note The "large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-large-ls960-ft

Automatic Speech Recognition • Updated May 24, 2022 • 302k • 73

Note A fine-tuned version of hubert-large-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-xlarge-ll60k

Feature Extraction • Updated Oct 20, 2021 • 2.04k • 5

Note The "extra-large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-xlarge-ls960-ft

Automatic Speech Recognition • 1.0B • Updated Jun 27, 2023 • 54.9k • 14

Note A fine-tuned version of hubert-xlarge-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant HuBERT checkpoint in the release, achieving a WER of 1.8/2.9% on the LibriSpeech test clean/other subsets respectively.