--- library_name: transformers license: other license_name: custom license_link: LICENSE model_index: - name: Llama-speechlmm-1.0-l-ST base_model: - meetween/Llama-speechlmm-1.0-l datasets: - facebook/covost2 language: - es - it - en - fr - de metrics: - bleu pipeline_tag: translation --- ## Model Information This is the version of [meetween/Llama-speechlmm-1.0-l](https://huggingface.co/meetween/Llama-speechlmm-1.0-l) that was fine-tuned for Speech-to-Text Translation. **License:** see [LICENSE](LICENSE) ## Model Architecture Identical to the base model. The model was obtained by training LoRA on the LLM. This repository contains the model weights with LoRA merged into the main weights. ## How to Use Identical to the base model. ## Fine-tuning Data This model has been fine-tuned on the same EuroParl-ST and CoVoST2 speech translation data ({en, fr, it, de, es} → {en, fr, it, de, es}) from the training data of the base model. ## Evaluation Results
DATASET: | CoVoST2 | ACL 60/60 | AVG | |||||
---|---|---|---|---|---|---|---|---|
BLEU | en-de | de-en | es-en | fr-en | it-en | en-fr | en-de | |
SeamlessM4T | - | - | - | - | - | 40.4 | 28.0 | - |
SpeechLMM_v1.0_L | 31.1 | 36.2 | 41.1 | 39.0 | 32.5 | 29.1 | 27.6 | 33.8 |
SpeechLMM_v1.0_L_ST | 33.7 | 36.7 | 41.5 | 39.0 | 32.4 | 29.6 | 28.5 | 34.5 |