Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mengfanxu's picture
3 3 129

mengfanxu

fxmeng
Q-bert's profile picture CMLL's profile picture Suparious's profile picture
·
https://fxmeng.github.io
  • fxmeng

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago
nvidia/Llama-Nemotron-VLM-Dataset-v1
authored a paper about 1 month ago
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference
commented on a paper about 1 month ago
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference
View all activity

Organizations

None yet

authored a paper about 1 month ago

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published Aug 21 • 8
authored 2 papers 8 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57

CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy

Paper • 2411.17426 • Published Nov 26, 2024
authored a paper over 1 year ago

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models

Paper • 2404.02948 • Published Apr 3, 2024 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs