--- library_name: transformers tags: [ ] model_index: - name: Llama-speechlmm-1.0-l-SSUM results: [ ] --- ## Model Information This is the version of [meetween/Llama-speechlmm-1.0-l](https://huggingface.co/meetween/Llama-speechlmm-1.0-l) that was fine-tuned for Speech Summarization. **License:** see [LICENSE](LICENSE) ## Model Architecture Identical to base model. This model does not include a video adapter. This model was obtained by fine-tuning the adapter and a LoRA on the decoder. This repository contains the weights with the LoRA merged into the main weights. ## How to Use Identical to base model. ## Training Data This model has been fine-tuned on the same AMI and ICSI speech summarization data from the training data of the base model. ## Evaluation Results

Model Name	Topic Segmentation	Summary of Summaries	ICSI
R-1	R-2	R-L
Cascade (Whisper + Textual summ.)
Base Model	No	No	27.6	3.8	25.3
Base Model	Yes	No	25.9	5.3	23.8
Base Model	Yes	Yes	21.1	2.3	18.3
meetween/Llama-speechlmm-1.0-l-TSUM	No	No	31.0	4.3	27.6
This Model	No	No	28.9	3.8	26.0

end-to-end directly from audio
Base Model	N/A	N/A	26.6	3.5	23.9
+LoRA decoder	N/A	N/A	27.9	3.3	25.2
+adapter finetune +LoRA decoder (this model)	No	No	32.1	4.1	29.1

## Framework versions - Transformers 4.45.0 - Pytorch 2.3.1+cu124.post2 - Datasets 3.2.0 - Tokenizers 0.20.0