MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published May 12 • 80
Smoothie Qwen3 Collection For more details, please visit https://github.com/dnotitia/smoothie-qwen • 8 items • Updated 28 days ago • 5
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published Apr 30 • 47
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 28 days ago • 58
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 2 days ago • 62
Gemma-APS Release Collection Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated 28 days ago • 22
Gemma 2 JPN Release Collection A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated 28 days ago • 29
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 28 days ago • 86
Reranking Model Collection A collection of Korean-specific reranking models • 2 items • Updated Aug 16, 2024 • 3