Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nguyenvulebinh
/
AVSRCocktail

Automatic Speech Recognition
Transformers
Safetensors
PyTorch
English
avhubert_avsr
audio-visual-speech-recognition
multimodal
speech-recognition
lip-reading
cocktail-party
noise-robust
av-hubert
transformer
audio
video
english
lrs2
voxceleb2
ctc
attention
beam-search
multi-speaker
noisy-speech
Model card Files Files and versions Community
AVSRCocktail
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
nguyenvulebinh's picture
nguyenvulebinh
Update README.md
ae29b16 verified 7 days ago
  • .gitattributes
    1.52 kB
    initial commit 11 days ago
  • README.md
    9.91 kB
    Update README.md 7 days ago
  • config.json
    4.44 kB
    Upload AVHubertAVSR 11 days ago
  • model.safetensors
    1.72 GB
    LFS
    Upload AVHubertAVSR 11 days ago