Interact with an agent to complete web-based tasks
Transcribe audio from microphone, files, or YouTube