Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mengfanxu's picture
3 3 129

mengfanxu

fxmeng
dark-pen's profile picture Le222's profile picture kevinapple's profile picture
·
https://fxmeng.github.io
  • fxmeng

AI & ML interests

None yet

Recent Activity

liked a dataset 13 days ago
nvidia/Llama-Nemotron-VLM-Dataset-v1
authored a paper 13 days ago
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference
commented on a paper 14 days ago
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference
View all activity

Organizations

None yet

commented a paper 14 days ago

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published 18 days ago • 8 •
2
commented 3 papers 7 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •
9

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •
9

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •
9
New activity in MMMU/MMMU almost 2 years ago

Question about "Text as Input"

#4 opened almost 2 years ago by
fxmeng
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs