Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Aramis's picture
39 7

Aramis

amenur
21world's profile picture ltim's profile picture
·
  • amenur

AI & ML interests

None yet

Organizations

None yet

amenur 's collections 3

Attention
  • System 2 Attention (is something you might need too)

    Paper • 2311.11829 • Published Nov 20, 2023 • 43
  • Transformers are Multi-State RNNs

    Paper • 2401.06104 • Published Jan 11, 2024 • 39
  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 620
Transformers
  • Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

    Paper • 2403.20041 • Published Mar 29, 2024 • 35
Mamba+Transformers
  • Jamba: A Hybrid Transformer-Mamba Language Model

    Paper • 2403.19887 • Published Mar 28, 2024 • 111
Attention
  • System 2 Attention (is something you might need too)

    Paper • 2311.11829 • Published Nov 20, 2023 • 43
  • Transformers are Multi-State RNNs

    Paper • 2401.06104 • Published Jan 11, 2024 • 39
  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 620
Mamba+Transformers
  • Jamba: A Hybrid Transformer-Mamba Language Model

    Paper • 2403.19887 • Published Mar 28, 2024 • 111
Transformers
  • Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

    Paper • 2403.20041 • Published Mar 29, 2024 • 35
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs