Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yamatazen 's Collections
GGUF tools
AGI
Model merging
Multilingual LLMs
Japanese LLMs
AI censorship
LLM leaderboards
Grokking

Grokking

updated Jun 27
Upvote
1

  • Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

    Paper • 2405.15071 • Published May 23, 2024 • 42

  • Grokking at the Edge of Numerical Stability

    Paper • 2501.04697 • Published Jan 8 • 2

  • Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

    Paper • 2506.21551 • Published Jun 26 • 28
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs