Generate text responses with various agents
Search arXiv papers, read with TTS voice
Generate spoken audio from text
Style transfer audio using YouTube links