Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiaming Tang's picture
5 13 14

Jiaming Tang

Sakits
21world's profile picture drkylj's profile picture SteveSHEN's profile picture
·
https://jiamingtang.me
  • jmtang42
  • Sakits

AI & ML interests

Efficient Machine Learning

Recent Activity

updated a model about 20 hours ago
Sakits/so101_kick_the_ball_goal_static_pi05_with_state_proj_async_32_random_state_cond_010000
published a model about 20 hours ago
Sakits/so101_kick_the_ball_goal_static_pi05_with_state_proj_async_32_random_state_cond_010000
updated a model 1 day ago
Sakits/so101_kick_the_ball_goal_static_pi05_with_state_proj_async_32_random_state_cond_002000
View all activity

Organizations

Massachusetts Institute of Technology's profile picture MIT HAN Lab's profile picture Efficient-Large-Model's profile picture hanlabrobot's profile picture

authored a paper 12 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 8
authored 3 papers about 1 year ago

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Paper • 2306.00978 • Published Jun 1, 2023 • 11

Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Paper • 2406.10774 • Published Jun 16, 2024 • 3

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization

Paper • 2304.07493 • Published Apr 15, 2023 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs