-
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 44 -
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Paper • 2501.06842 • Published • 16 -
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper • 2501.05441 • Published • 93
nDimensional
nDimensional
AI & ML interests
Computer Vision, Diffusers, Transformers, ML, NLP, Diffusion Models, Unsupervised Learning, JAX, Neural Networks
Recent Activity
upvoted
a
paper
about 2 hours ago
MMSearch-R1: Incentivizing LMMs to Search
liked
a Space
about 2 hours ago
ilcve21/Sparc3D
liked
a Space
about 2 hours ago
Qwen/Qwen3-Demo
Organizations
None yet