ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published Dec 19, 2024 • 16
facebook/wav2vec2-large-960h-lv60-self Automatic Speech Recognition • Updated May 23, 2022 • 227k • 143
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper • 2410.05993 • Published Oct 8, 2024 • 108
view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 14