Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
tokyotech-llm
's Collections
SwallowMath
SwallowCode
Llama-3.3-Swallow
Llama-3.1-Swallow
Llama-3-Swallow
Swallow
Swallow-instruct
Swallow-MS
Swallow-MX
Swallow-MS-instruct
SwallowMath
updated
3 days ago
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code
Upvote
2
tokyotech-llm/swallow-math
Viewer
•
Updated
3 days ago
•
4.33M
•
1.11k
•
9
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0002500
Updated
3 days ago
•
1
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0005000
Updated
3 days ago
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0007500
Updated
3 days ago
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0010000
Updated
3 days ago
•
2
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0012500
Updated
3 days ago
•
7
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0002500
Updated
3 days ago
•
8
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0005000
Updated
3 days ago
•
8
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0007500
Updated
3 days ago
•
7
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0010000
Updated
3 days ago
•
8
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0012500
Updated
3 days ago
•
10
Upvote
2
Share collection
View history
Collection guide
Browse collections