Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
meetween 's Collections
SpeechLMM v1

SpeechLMM v1

updated 19 days ago

1st generation of SpeechLMM models, capable of ingesting video, audio and text and generate text as output. From the Meetween consortium (meetween.eu)

Upvote
-

  • meetween/Llama-speechlmm-1.0-s

    Feature Extraction • 2B • Updated Mar 24 • 8

  • meetween/Llama-speechlmm-1.0-m

    Feature Extraction • 4B • Updated Mar 24 • 9

  • meetween/Llama-speechlmm-1.0-l

    Feature Extraction • 8B • Updated Jun 17 • 11

  • meetween/Llama-speechlmm-1.0-xl

    Feature Extraction • 1B • Updated Mar 12 • 7

  • meetween/Llama-speechlmm-1.0-l-ASR

    0.6B • Updated Jun 5 • 3

  • meetween/Llama-speechlmm-1.0-l-ST

    Translation • 9B • Updated Apr 30 • 6

  • meetween/Llama-speechlmm-1.0-l-MT

    Translation • 9B • Updated Jun 18 • 7

  • meetween/Llama-speechlmm-1.0-l-SLU

    9B • Updated Jun 19 • 4

  • meetween/Llama-speechlmm-1.0-l-LIPREAD

    Other • 9B • Updated May 23 • 4

  • meetween/Llama-speechlmm-1.0-l-SQA

    Translation • 9B • Updated May 22 • 7

  • meetween/Llama-speechlmm-1.0-l-SSUM

    9B • Updated Apr 22 • 3

  • meetween/Llama-speechlmm-1.0-l-TSUM

    9B • Updated Apr 22 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs