RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Paper • 2505.03005 • Published 10 days ago • 27
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 178
AM-RADIO: Agglomerative Model -- Reduce All Domains Into One Paper • 2312.06709 • Published Dec 10, 2023 • 2
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published Mar 18 • 147
T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings Paper • 2406.19223 • Published Jun 27, 2024 • 11
OWLS: Scaling Laws for Speech Recognition and Translation Collection 🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 8 items • Updated 13 days ago • 6
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning Paper • 2305.10005 • Published May 17, 2023 • 3
Eager Updates For Overlapped Communication and Computation in DiLoCo Paper • 2502.12996 • Published Feb 18 • 7
Deepseek Papers Collection Deepseek papers collection • 22 items • Updated about 20 hours ago • 196
QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation Paper • 2502.05178 • Published Feb 7 • 10
NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks Paper • 2408.13106 • Published Aug 23, 2024 • 1
Text to Speech (TTS) Collection Text to Speech (TTS) models compatible with txtai's TextToSpeech pipeline. • 7 items • Updated Jan 26 • 6
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation Paper • 2501.15907 • Published Jan 27 • 17
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published Jan 16 • 72
Sound Datasets Collection Sound Datasets for ASR/ASV or some other tasks • 12 items • Updated Aug 28, 2024 • 1