Generate speech from text using adjustable rate and pitch
Qwen-Qwen2.5-7B-Instruct
Generate captions for images