Matthias Seeger
mseeger
ยท
AI & ML interests
None yet
Recent Activity
new activity
about 11 hours ago
deepseek-ai/DeepSeek-V2:Exact computations for multi-head latent attention
new activity
about 2 months ago
Isotonic/gpt_neox_225M:hidden_size % num_attention_heads != 0
Organizations
None yet
models
None public yet
datasets
None public yet