Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
tokyotech-llm 's Collections
SwallowMath
SwallowCode
Llama-3.3-Swallow
Llama-3.1-Swallow
Llama-3-Swallow
Swallow
Swallow-instruct
Swallow-MS
Swallow-MX
Swallow-MS-instruct

SwallowMath

updated 3 days ago

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Upvote
2

  • tokyotech-llm/swallow-math

    Viewer • Updated 3 days ago • 4.33M • 1.11k • 9

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0002500

    Updated 3 days ago • 1

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0005000

    Updated 3 days ago

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0007500

    Updated 3 days ago

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0010000

    Updated 3 days ago • 2

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0012500

    Updated 3 days ago • 7

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0002500

    Updated 3 days ago • 8

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0005000

    Updated 3 days ago • 8

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0007500

    Updated 3 days ago • 7

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0010000

    Updated 3 days ago • 8

  • tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0012500

    Updated 3 days ago • 10
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs