Large Language Models
audio-arena
Generate text responses to chat messages
VoxCPM
Generate answers from images and text