MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published May 12 • 81
Smoothie Qwen3 Collection For more details, please visit https://github.com/dnotitia/smoothie-qwen • 8 items • Updated May 30 • 5
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published Apr 30 • 48
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 7 days ago • 60
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 3 days ago • 62
Gemma-APS Release Collection Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated 7 days ago • 22
Gemma 2 JPN Release Collection A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated 7 days ago • 29
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 7 days ago • 86
Reranking Model Collection A collection of Korean-specific reranking models • 2 items • Updated 18 days ago • 3