Translation
Transformers
Safetensors
speechlmm

Model Information

This is the version of meetween/Llama-speechlmm-1.0-l that was fine-tuned for Speech-to-Text Translation.

License: see LICENSE

Model Architecture

Identical to the base model. The model was obtained by training LoRA on the LLM. This repository contains the model weights with LoRA merged into the main weights.

How to Use

Identical to the base model.

Fine-tuning Data

This model has been fine-tuned on the same EuroParl-ST and CoVoST2 speech translation data ({en, fr, it, de, es} → {en, fr, it, de, es}) from the training data of the base model.

Evaluation Results

DATASET: CoVoST2 ACL 60/60 AVG
BLEU en-de de-en es-en fr-en it-en en-fr en-de
SeamlessM4T - - - - - 40.4 28.0 -
SpeechLMM_v1.0_L 31.1 36.2 41.1 39.0 32.5 29.1 27.6 33.8
SpeechLMM_v1.0_L_ST 33.7 36.7 41.5 39.0 32.4 29.6 28.5 34.5

Framework Versions

  • Transformers 4.45.0
  • Pytorch 2.3.1+cu124.post2
  • Datasets 3.2.0
  • Tokenizers 0.20.0
Downloads last month
9
Safetensors
Model size
8.98B params
Tensor type
I64
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for meetween/Llama-speechlmm-1.0-l-ST

Finetuned
(1)
this model

Dataset used to train meetween/Llama-speechlmm-1.0-l-ST