view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq β’ 9 days ago β’ 10
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper β’ 2501.11873 β’ Published 9 days ago β’ 61
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76