meetween/Llama-speechlmm-1.0-l-MT

Model Information

This is the version of meetween/Llama-speechlmm-1.0-l that was fine-tuned for Speech-to-Text Translation.

License: see LICENSE

Model Architecture

Identical to the base model. The model was obtained by training LoRA on the LLM. This repository contains the model weights with LoRA merged into the main weights.

How to Use

Identical to the base model.

Fine-tuning Data

This model has been fine-tuned on the same EuroParl-ST machine translation data ({en, fr, it, de, es} → {en, fr, it, de, es}) from the training data of the base model.

Evaluation Results

DATASET:	FLORES				ACL 60/60		AVG
BLEU	en-de	en-es	en-it	en-fr	en-fr	en-de	AVG
Llama3-instruct (D5)	28.1	24.4	25.0	41.2	48.8	34.2	33.6
NLLB (D5)	39.4	23.7	31.2	50.7	59.1	45.2	41.6
SpeechLMM_v1.0_L	29.4	22.3	20.1	31.9	35.5	32.8	28.7
Speech LMM v1.0_L-FT (LoRA)	20.0	16.0	11.6	21.8	24.9	20.7	19.2

Framework Versions

Transformers 4.45.0
Pytorch 2.3.1+cu124.post2
Datasets 3.2.0
Tokenizers 0.20.0

meetween
/

Llama-speechlmm-1.0-l-MT

Model Information

Model Architecture

How to Use

Fine-tuning Data

Evaluation Results

Framework Versions

Model tree for meetween/Llama-speechlmm-1.0-l-MT

Collection including meetween/Llama-speechlmm-1.0-l-MT

SpeechLMM v1