VoxServe: Streaming-Centric Serving System for Speech Language Models Paper • 2602.00269 • Published Jan 30 • 6
VoxServe: Streaming-Centric Serving System for Speech Language Models Paper • 2602.00269 • Published Jan 30 • 6
DualTune: Decoupled Fine-Tuning for On-Device Agentic Systems Paper • 2510.00229 • Published Sep 30, 2025 • 1
Redundancy, Isotropy, and Intrinsic Dimensionality of Prompt-based Text Embeddings Paper • 2506.01435 • Published Jun 2, 2025
ConsumerBench: Benchmarking Generative AI Applications on End-User Devices Paper • 2506.17538 • Published Jun 21, 2025 • 7
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval Paper • 2502.20969 • Published Feb 28, 2025 • 11
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation Paper • 2502.20583 • Published Feb 27, 2025 • 13
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation Paper • 2502.20583 • Published Feb 27, 2025 • 13
kotoba-tech/kotoba-whisper-v2.2 Automatic Speech Recognition • 0.8B • Updated Oct 23, 2024 • 18.2k • 95
kotoba-tech/kotoba-whisper-v2.1 Automatic Speech Recognition • 0.8B • Updated Oct 23, 2024 • 2.63k • 19