WAON WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published 19 days ago • 3 llm-jp/WAON-Bench Viewer • Updated 16 days ago • 1.87k • 240 llm-jp/waon-siglip2-base-patch16-256 Zero-Shot Image Classification • 0.4B • Updated 11 days ago • 15 • 1 llm-jp/WAON Updated 8 days ago • 356 • 6
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published 19 days ago • 3
llm-jp/waon-siglip2-base-patch16-256 Zero-Shot Image Classification • 0.4B • Updated 11 days ago • 15 • 1
Optimal Sparsity Code Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks llm-jp/optimal-sparsity-code-d512-E8-k2-320M-A170M Text Generation • 0.3B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-code-d512-E16-k2-520M-A170M Text Generation • 0.5B • Updated Sep 3 • 3 llm-jp/optimal-sparsity-code-d512-E32-k2-920M-A170M Text Generation • 0.9B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-code-d512-E64-k2-1.7B-A170M Text Generation • 2B • Updated Sep 3 • 3
LLM-jp-3.1 Fine-tuned Models Fine-tuned models in the LLM-jp-3 model series llm-jp/llm-jp-3.1-8x13b-instruct4 Text Generation • 73B • Updated May 30 • 120 • 4 llm-jp/llm-jp-3.1-13b-instruct4 Text Generation • 14B • Updated May 30 • 7.35k • 14 llm-jp/llm-jp-3.1-1.8b-instruct4 Text Generation • 2B • Updated May 30 • 18.1k • 12
Open Japanese LLM leaderboard Running on CPU Upgrade 101 101 Open Japanese LLM Leaderboard 🌸 Explore and compare LLM models with interactive filters and visualizations llm-jp/leaderboard-requests Viewer • Updated 21 days ago • 3 • 23.8k • 2 llm-jp/leaderboard-contents Viewer • Updated 21 days ago • 862 • 9.46k • 1 llm-jp/leaderboard-results Updated 21 days ago • 13.6k • 1
Running on CPU Upgrade 101 101 Open Japanese LLM Leaderboard 🌸 Explore and compare LLM models with interactive filters and visualizations
Drop-Upcycling llm-jp/FS-8x1.5B 9B • Updated Feb 27 • 1 llm-jp/BTX-8x1.5B 9B • Updated Feb 27 llm-jp/FS-8x3.7B 19B • Updated Feb 27 • 2 llm-jp/NU-8x1.5B 9B • Updated Feb 27
LLM-jp-3.1 Pre-trained Models Pre-trained models in the LLM-jp-3.1 model series llm-jp/llm-jp-3.1-8x13b Text Generation • 73B • Updated May 30 • 11 llm-jp/llm-jp-3.1-13b Text Generation • 14B • Updated May 30 • 1.97k • 2 llm-jp/llm-jp-3.1-1.8b Text Generation • 2B • Updated May 30 • 1.17k • 6
LLM-jp ver2.0 Models Models in the LLM-jp ver2.0 model series llm-jp/llm-jp-13b-v2.0 Text Generation • Updated Apr 30, 2024 • 178 • 15 llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 302 llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 232 • 1 llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 285 • 3
llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 302
llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 232 • 1
llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 285 • 3
LLM-jp ver1.0 Models Models in the LLM-jp ver1.0 model series llm-jp/llm-jp-13b-v1.0 Text Generation • Updated Oct 20, 2023 • 1.21k • 41 llm-jp/llm-jp-13b-instruct-full-jaster-v1.0 Text Generation • Updated Oct 20, 2023 • 1.07k • 15 llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.14k • 8 llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.05k • 4
llm-jp/llm-jp-13b-instruct-full-jaster-v1.0 Text Generation • Updated Oct 20, 2023 • 1.07k • 15
llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.14k • 8
llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.05k • 4
Llama-Mimi Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens llm-jp/Llama-Mimi-1.3B Audio-to-Audio • 1B • Updated Oct 2 • 120 • 6 llm-jp/Llama-Mimi-8B Audio-to-Audio • 8B • Updated Sep 19 • 25 • 8 Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18 • 1
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18 • 1
Optimal Sparsity Math Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks llm-jp/optimal-sparsity-math-d512-E8-k2-320M-A170M Text Generation • 0.3B • Updated Sep 3 • 3 llm-jp/optimal-sparsity-math-d512-E16-k2-520M-A170M Text Generation • 0.5B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-math-d512-E32-k2-920M-A170M Text Generation • 0.9B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-math-d512-E64-k2-1.7B-A170M Text Generation • 2B • Updated Sep 3 • 4
LLM-jp-3 Fine-tuned Models Fine-tuned models in the LLM-jp-3 model series llm-jp/llm-jp-3-8x13b-instruct3 Text Generation • 73B • Updated Apr 1 • 118 • 8 llm-jp/llm-jp-3-172b-instruct3 Text Generation • 172B • Updated Jan 20 • 19 • 10 llm-jp/llm-jp-3-13b-instruct3 Text Generation • 14B • Updated Feb 4 • 103 • 8 llm-jp/llm-jp-3-8x1.8b-instruct3 Text Generation • 9B • Updated Apr 1 • 19 • 3
Multi Modal Models llm-jp/llm-jp-3-vila-14b Image-Text-to-Text • Updated Nov 18, 2024 • 35 • 11 llm-jp/llm-jp-clip-vit-base-patch16 Zero-Shot Image Classification • Updated Apr 30 • 91 • 1 llm-jp/llm-jp-clip-vit-large-patch14 Zero-Shot Image Classification • Updated Apr 30 • 69 • 2 llm-jp/relaion2B-en-research-safe-japanese-translation Viewer • Updated Apr 30 • 2.1B • 252 • 3
llm-jp/relaion2B-en-research-safe-japanese-translation Viewer • Updated Apr 30 • 2.1B • 252 • 3
Sparse Autoencoders llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c988240 0.1B • Updated Mar 12 • 3 llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c100000 Updated Mar 18 llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c10000 Updated Mar 18 llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c1000 Updated Mar 18
LLM-jp-3 Pre-trained Models Pre-trained models in the LLM-jp-3 model series llm-jp/llm-jp-3-8x13b Text Generation • 73B • Updated Mar 27 • 2.16k llm-jp/llm-jp-3-172b Text Generation • 172B • Updated Dec 23, 2024 • 4 llm-jp/llm-jp-3-8x1.8b Text Generation • 9B • Updated Mar 27 llm-jp/llm-jp-3-13b Text Generation • 14B • Updated Sep 26, 2024 • 7.43k • 13
LLM-jp ver1.1 Models Models in the LLM-jp ver1.1 model series llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1 Text Generation • Updated Mar 12, 2024 • 1 llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • 13B • Updated Feb 7, 2024 • 654 • 2 llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • Updated Mar 12, 2024 • 1
llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • 13B • Updated Feb 7, 2024 • 654 • 2
llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • Updated Mar 12, 2024 • 1
WAON WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published 19 days ago • 3 llm-jp/WAON-Bench Viewer • Updated 16 days ago • 1.87k • 240 llm-jp/waon-siglip2-base-patch16-256 Zero-Shot Image Classification • 0.4B • Updated 11 days ago • 15 • 1 llm-jp/WAON Updated 8 days ago • 356 • 6
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published 19 days ago • 3
llm-jp/waon-siglip2-base-patch16-256 Zero-Shot Image Classification • 0.4B • Updated 11 days ago • 15 • 1
Llama-Mimi Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens llm-jp/Llama-Mimi-1.3B Audio-to-Audio • 1B • Updated Oct 2 • 120 • 6 llm-jp/Llama-Mimi-8B Audio-to-Audio • 8B • Updated Sep 19 • 25 • 8 Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18 • 1
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18 • 1
Optimal Sparsity Code Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks llm-jp/optimal-sparsity-code-d512-E8-k2-320M-A170M Text Generation • 0.3B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-code-d512-E16-k2-520M-A170M Text Generation • 0.5B • Updated Sep 3 • 3 llm-jp/optimal-sparsity-code-d512-E32-k2-920M-A170M Text Generation • 0.9B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-code-d512-E64-k2-1.7B-A170M Text Generation • 2B • Updated Sep 3 • 3
Optimal Sparsity Math Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks llm-jp/optimal-sparsity-math-d512-E8-k2-320M-A170M Text Generation • 0.3B • Updated Sep 3 • 3 llm-jp/optimal-sparsity-math-d512-E16-k2-520M-A170M Text Generation • 0.5B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-math-d512-E32-k2-920M-A170M Text Generation • 0.9B • Updated Sep 3 • 4 llm-jp/optimal-sparsity-math-d512-E64-k2-1.7B-A170M Text Generation • 2B • Updated Sep 3 • 4
LLM-jp-3.1 Fine-tuned Models Fine-tuned models in the LLM-jp-3 model series llm-jp/llm-jp-3.1-8x13b-instruct4 Text Generation • 73B • Updated May 30 • 120 • 4 llm-jp/llm-jp-3.1-13b-instruct4 Text Generation • 14B • Updated May 30 • 7.35k • 14 llm-jp/llm-jp-3.1-1.8b-instruct4 Text Generation • 2B • Updated May 30 • 18.1k • 12
LLM-jp-3 Fine-tuned Models Fine-tuned models in the LLM-jp-3 model series llm-jp/llm-jp-3-8x13b-instruct3 Text Generation • 73B • Updated Apr 1 • 118 • 8 llm-jp/llm-jp-3-172b-instruct3 Text Generation • 172B • Updated Jan 20 • 19 • 10 llm-jp/llm-jp-3-13b-instruct3 Text Generation • 14B • Updated Feb 4 • 103 • 8 llm-jp/llm-jp-3-8x1.8b-instruct3 Text Generation • 9B • Updated Apr 1 • 19 • 3
Open Japanese LLM leaderboard Running on CPU Upgrade 101 101 Open Japanese LLM Leaderboard 🌸 Explore and compare LLM models with interactive filters and visualizations llm-jp/leaderboard-requests Viewer • Updated 21 days ago • 3 • 23.8k • 2 llm-jp/leaderboard-contents Viewer • Updated 21 days ago • 862 • 9.46k • 1 llm-jp/leaderboard-results Updated 21 days ago • 13.6k • 1
Running on CPU Upgrade 101 101 Open Japanese LLM Leaderboard 🌸 Explore and compare LLM models with interactive filters and visualizations
Multi Modal Models llm-jp/llm-jp-3-vila-14b Image-Text-to-Text • Updated Nov 18, 2024 • 35 • 11 llm-jp/llm-jp-clip-vit-base-patch16 Zero-Shot Image Classification • Updated Apr 30 • 91 • 1 llm-jp/llm-jp-clip-vit-large-patch14 Zero-Shot Image Classification • Updated Apr 30 • 69 • 2 llm-jp/relaion2B-en-research-safe-japanese-translation Viewer • Updated Apr 30 • 2.1B • 252 • 3
llm-jp/relaion2B-en-research-safe-japanese-translation Viewer • Updated Apr 30 • 2.1B • 252 • 3
Drop-Upcycling llm-jp/FS-8x1.5B 9B • Updated Feb 27 • 1 llm-jp/BTX-8x1.5B 9B • Updated Feb 27 llm-jp/FS-8x3.7B 19B • Updated Feb 27 • 2 llm-jp/NU-8x1.5B 9B • Updated Feb 27
Sparse Autoencoders llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c988240 0.1B • Updated Mar 12 • 3 llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c100000 Updated Mar 18 llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c10000 Updated Mar 18 llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c1000 Updated Mar 18
LLM-jp-3.1 Pre-trained Models Pre-trained models in the LLM-jp-3.1 model series llm-jp/llm-jp-3.1-8x13b Text Generation • 73B • Updated May 30 • 11 llm-jp/llm-jp-3.1-13b Text Generation • 14B • Updated May 30 • 1.97k • 2 llm-jp/llm-jp-3.1-1.8b Text Generation • 2B • Updated May 30 • 1.17k • 6
LLM-jp-3 Pre-trained Models Pre-trained models in the LLM-jp-3 model series llm-jp/llm-jp-3-8x13b Text Generation • 73B • Updated Mar 27 • 2.16k llm-jp/llm-jp-3-172b Text Generation • 172B • Updated Dec 23, 2024 • 4 llm-jp/llm-jp-3-8x1.8b Text Generation • 9B • Updated Mar 27 llm-jp/llm-jp-3-13b Text Generation • 14B • Updated Sep 26, 2024 • 7.43k • 13
LLM-jp ver2.0 Models Models in the LLM-jp ver2.0 model series llm-jp/llm-jp-13b-v2.0 Text Generation • Updated Apr 30, 2024 • 178 • 15 llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 302 llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 232 • 1 llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 285 • 3
llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 302
llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 232 • 1
llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0 Text Generation • 14B • Updated Apr 30, 2024 • 285 • 3
LLM-jp ver1.1 Models Models in the LLM-jp ver1.1 model series llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1 Text Generation • Updated Mar 12, 2024 • 1 llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • 13B • Updated Feb 7, 2024 • 654 • 2 llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • Updated Mar 12, 2024 • 1
llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • 13B • Updated Feb 7, 2024 • 654 • 2
llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1 Text Generation • Updated Mar 12, 2024 • 1
LLM-jp ver1.0 Models Models in the LLM-jp ver1.0 model series llm-jp/llm-jp-13b-v1.0 Text Generation • Updated Oct 20, 2023 • 1.21k • 41 llm-jp/llm-jp-13b-instruct-full-jaster-v1.0 Text Generation • Updated Oct 20, 2023 • 1.07k • 15 llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.14k • 8 llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.05k • 4
llm-jp/llm-jp-13b-instruct-full-jaster-v1.0 Text Generation • Updated Oct 20, 2023 • 1.07k • 15
llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.14k • 8
llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0 Text Generation • Updated Oct 20, 2023 • 1.05k • 4