Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Lab of Large Audio Model

community
https://largeaudiomodel.com/
Activity Feed Request to join this org

AI & ML interests

Large Audio Model、Text to Speech (TTS)、Voice Conversion、Talking Face、Music AI、Speech Security、Infant Acoustic

Xulong Zhang's profile picture Greg's profile picture wave's profile picture Niucheng's profile picture

llam 's collections 1

Papers
  • DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation

    Paper • 2311.07965 • Published Nov 14, 2023 • 1
  • CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding

    Paper • 2311.08673 • Published Nov 15, 2023
  • CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation

    Paper • 2311.08670 • Published Nov 15, 2023
  • Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data

    Paper • 2309.16196 • Published Sep 28, 2023
Papers
  • DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation

    Paper • 2311.07965 • Published Nov 14, 2023 • 1
  • CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding

    Paper • 2311.08673 • Published Nov 15, 2023
  • CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation

    Paper • 2311.08670 • Published Nov 15, 2023
  • Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data

    Paper • 2309.16196 • Published Sep 28, 2023
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs