Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
BinghengWu's picture
3 6 10

BinghengWu

wubingheng
minpeter's profile picture yangfa's profile picture JingzeShi's profile picture
·
https://github.com/wubingheng111
  • HangWu19938
  • wubingheng111

AI & ML interests

I like to fine-tune the small models of the Doge series.

Recent Activity

authored a paper about 1 month ago
Trainable Dynamic Mask Sparse Attention
upvoted an article about 1 month ago
Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models
published an article about 1 month ago
Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models
View all activity

Organizations

Hugging Face Party @ PyTorch Conference's profile picture Doge Face's profile picture

authored a paper about 1 month ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4 • 16
authored a paper about 2 months ago

Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting

Paper • 2505.19716 • Published May 26 • 5
authored 2 papers 9 months ago

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Paper • 2407.16958 • Published Jul 24, 2024 • 4

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 8
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs