AlbertShi's picture

5 13

AlbertShi

AlbertShi

·

AI & ML interests

None yet

Recent Activity

upvoted an article 24 days ago

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

upvoted a paper about 1 month ago

Trainable Dynamic Mask Sparse Attention

reacted to JingzeShi's post with 🤗 about 1 month ago

Trainable selective sampling and sparse attention kernels are indispensable in the era of context engineering. We hope our work will be helpful to everyone! 🤗 https://huggingface.co/papers/2508.02124

View all activity

Organizations

upvoted an article 24 days ago

Article

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

By

and 2 others •

Aug 5

• 6

upvoted a paper about 1 month ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4 • 16

upvoted a collection 5 months ago

🐕Small-Doges

Doge family of small language models! • 18 items • Updated Apr 21 • 10

upvoted a paper 7 months ago

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 8

upvoted a collection 7 months ago

Doge

Doge family of small language models. • 12 items • Updated Mar 28 • 6