Anime-Llasa-3B

Overview

This is the Anime-Llasa-3B, a Text-to-Speech (TTS) model fine-tuned for Japanese. This model is based on HKUSTAudio/Llasa-3B.

Demo

You can try a demo on Hugging Face Spaces: Anime-Llasa-3B-Demo

What's New?

The primary improvement in this version is a significant increase in the training data. The amount of training data has been increased from approximately 14,000 hours (3 epochs) to approximately 33,000 hours (1 epoch).

This enhancement aims to further improve the model's expressiveness and overall stability.

Old Version

Version 3 model

Version 1 model

License

This model is licensed under the CC-BY-NC-4.0.

Downloads last month
6,968
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for NandemoGHS/Anime-Llasa-3B

Finetuned
(4)
this model
Finetunes
1 model
Quantizations
4 models

Datasets used to train NandemoGHS/Anime-Llasa-3B

Space using NandemoGHS/Anime-Llasa-3B 1