arxiv:2501.06282
zhihaodu
zhihaodu
AI & ML interests
Audio Generation, Audio Understanding, Speech Enhancement
Recent Activity
upvoted
a
paper
1 day ago
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
authored
a paper
4 days ago
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
liked
a dataset
about 1 month ago
laion/laions_got_talent