Identify speakers in an audio file
Generate personalized images with a face preservation
Generate audio from text with voice synthesis