Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
Joya Chen PRO
chenjoya
AI & ML interests
Video LLM
Recent Activity
liked
a dataset
about 19 hours ago
keithito/lj_speech
liked
a model
about 20 hours ago
bosonai/higgs-audio-v2-generation-3B-base
liked
a model
22 days ago
google/gemma-3n-E4B-it