Model Information

This is the version of meetween/Llama-speechlmm-1.0-l that was fine-tuned for Lip Reading.

License: see LICENSE

Model Architecture

Identical to the base model. The model was obtained by training LoRA and the modality adapter on the LLM. This repository contains the model weights with LoRA merged into the main weights.

How to Use

Identical to the base model.

Fine-tuning Data

This model has been fine-tuned on the same data from the training data of the base model.

Evaluation Results

Model Name	Word Error Rate
AV-Hubert	36.41
SpeechLMM_v1.0_L	45.44
SpeechLMM_v1.0_L_LIPREAD	43.06

Framework Versions

Transformers 4.45.0
Pytorch 2.3.1+cu124.post2
Datasets 3.2.0
Tokenizers 0.20.0

Downloads last month: 13

Safetensors

Model size

9B params

Tensor type

I64

BF16

Inference Providers NEW

Other

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for meetween/Llama-speechlmm-1.0-l-LIPREAD

Base model

meetween/Llama-speechlmm-1.0-l

Finetuned

(4)

this model

Collection including meetween/Llama-speechlmm-1.0-l-LIPREAD

SpeechLMM v1

Collection

1st generation of SpeechLMM models, capable of ingesting video, audio and text and generate text as output. From the Meetween consortium (meetween.eu) • 12 items • Updated 16 days ago