Gausson Tschen
Gausson
AI & ML interests
LLM Architecture, Pre-training, Deep Neural Network Optimization, Sparsity
Recent Activity
upvoted a paper about 23 hours ago
UniPool: A Globally Shared Expert Pool for Mixture-of-Experts updated a model 9 months ago
Gausson/sep_cache