contains 3 state-of-the-art models
Transcribe audio to text in Eastern languages
Create and launch a voice synthesis interface
Visualize camera simulations and E.T. datasets
Generate a video animating a source image to match a given audio