M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Paper
•
2504.10449
•
Published
•
3
Mixture of Experts, Branch Merge Train, International Cooperation, Reuse, https://github.com/ontocord/MDEL