Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Songlin Yang's picture
1 6 4

Songlin Yang

sonta7
Applauz's profile picture whyu's profile picture ExplorerFreda's profile picture
·
https://sustcsonglin.github.io/
  • SonglinYang4
  • sustcsonglin

AI & ML interests

None yet

Organizations

fla-hub's profile picture Gated Linear Attention (ICML'24)'s profile picture sonta's profile picture HoPE's profile picture

sonta7's activity

upvoted a paper 5 months ago

Gated Delta Networks: Improving Mamba2 with Delta Rule

Paper • 2412.06464 • Published Dec 9, 2024 • 11
upvoted 2 papers 8 months ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18, 2024 • 45

Gated Slot Attention for Efficient Linear-Time Sequence Modeling

Paper • 2409.07146 • Published Sep 11, 2024 • 21
upvoted a paper 11 months ago

Parallelizing Linear Transformers with the Delta Rule over Sequence Length

Paper • 2406.06484 • Published Jun 10, 2024 • 3
upvoted a collection about 1 year ago

based

Collection
These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes. • 15 items • Updated Oct 18, 2024 • 9
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs