Lemon Mint's picture

Lemon Mint

lemon-mint

·

lemon-mint

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

stepfun-ai/Step-3.5-Flash-SFT

liked a model about 2 months ago

arcee-ai/Trinity-Large-Preview

new activity 2 months ago

lemon-mint/gemma-ko-7b-it-v0.40:Update README.md

View all activity

Organizations

upvoted a collection 10 months ago

Kanana-1.5

Open Source Kanana-1.5 • 16 items • Updated Dec 1, 2025 • 29

upvoted a paper 10 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

upvoted 2 collections 11 months ago

Smoothie Qwen3

For more details, please visit https://github.com/dnotitia/smoothie-qwen • 9 items • Updated Jan 26 • 7

Mellum

Series of code models by JetBrains • 12 items • Updated Oct 1, 2025 • 37

upvoted a paper 11 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30, 2025 • 49

upvoted an article 11 months ago

Article

PipelineRL

Apr 25, 2025

•

43

upvoted a collection 12 months ago

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 9 days ago • 68

upvoted 5 collections about 1 year ago

R1-like Datasets

19 items • Updated May 27, 2025 • 6

Korean Reasoning Datasets 한국어 추론 데이터셋

4 items • Updated 19 days ago • 3

Korean Instructions 한국어 인스트럭션 데이터셋

3 items • Updated Feb 28, 2025 • 4

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

R1 Multilingual

5 items • Updated Jan 31, 2025 • 11

upvoted 8 collections over 1 year ago

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated 9 days ago • 24

Gemma 2 JPN Release

A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated 9 days ago • 31

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 9 days ago • 87

DeepSeek-V2.5

2 items • Updated Nov 27, 2025 • 47

DeepSeek-V2

8 items • Updated Nov 27, 2025 • 35

Reranker Model

A collection of Korean-specific reranking models • 2 items • Updated Jul 19, 2025 • 3

Hermes 3

The Hermes 3 Series of Models • 11 items • Updated Sep 8, 2025 • 133