-
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Paper • 2311.07965 • Published • 1 -
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Paper • 2311.08673 • Published -
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Paper • 2311.08670 • Published -
Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data
Paper • 2309.16196 • Published
AI & ML interests
Large Audio Model、Text to Speech (TTS)、Voice Conversion、Talking Face、Music AI、Speech Security、Infant Acoustic
-
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Paper • 2311.07965 • Published • 1 -
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Paper • 2311.08673 • Published -
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Paper • 2311.08670 • Published -
Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data
Paper • 2309.16196 • Published