arxiv:2501.12370
Vimal Thilak
vimalthilak
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for
Mixture-of-Experts Language Models
upvoted
a
paper
3 months ago
Controlling Language and Diffusion Models by Transporting Activations
Organizations
None yet
Papers
1
models
None public yet
datasets
None public yet