metadata
library_name: transformers
base_model:
- Plachta/Seed-VC
pipeline_tag: audio-to-audio
tags:
- voice-conversion
- seed-vc
- audio
Seed-VC seed-uvit-whisper-base Finetune
Introduction
This model is a fine-tuned version of Plachta/Seed-VC's seed-uvit-whisper-base with 168 hours of clean singing audios in korean.
It demonstrates significant improvements in naturalness and voice quality.
π― Reference Audio
π§ Audio Comparison
Model | ===================Converted Singing Audio (25 steps)=================== |
---|---|
Original | |
Base | |
Finetune |