view article Article Speculative Decoding for 2x Faster Whisper Inference By sanchit-gandhi • Dec 20, 2023 • 30
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published Jun 23 • 88
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper • 2411.18613 • Published Nov 27, 2024 • 59
view article Article Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers By ylacombe • Jan 19, 2024 • 39
Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Paper • 2503.01774 • Published Mar 3 • 45
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Paper • 2501.01957 • Published Jan 3 • 48