SwallowMath - a tokyotech-llm Collection

tokyotech-llm 's Collections

Llama-3.1-Swallow-v0.5

Gemma-2-Swallow

Llama-3.3-Swallow

Llama-3.1-Swallow

Llama-3-Swallow

Swallow

Swallow-instruct

Swallow-MS-instruct

SwallowMath

updated May 7

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

tokyotech-llm/swallow-math

Viewer • Updated May 10 • 4.33M • 1.95k • 26
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0002500

8B • Updated May 7 • 9
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0005000

8B • Updated May 7 • 8
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0007500

8B • Updated May 7 • 8
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0010000

8B • Updated May 7 • 26
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0012500

8B • Updated May 7 • 9
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0002500

8B • Updated May 7 • 10
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0005000

8B • Updated May 7 • 11
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0007500

8B • Updated May 7 • 8
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0010000

8B • Updated May 7 • 8
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0012500

8B • Updated May 7 • 8