Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shivaen Ramshetty's picture
5 65

Shivaen Ramshetty

shivr
21world's profile picture
·
  • sramshetty

AI & ML interests

NLP, CV, Multimodal

Organizations

fastai X Hugging Face Group 2022's profile picture Aurora-M/MDEL's profile picture

commented 4 papers over 1 year ago

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 81 •
14

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 81 •
14

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 66 •
21

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 66 •
21
New activity in shivr/gpt2-xl_local-narratives-reduced-overlap_lora almost 2 years ago

Librarian Bot: Add base_model information to model

#1 opened almost 2 years ago by
librarian-bot
New activity in shivr/gpt2-xl_grit_and_local-narratives_lora almost 2 years ago

Librarian Bot: Add base_model information to model

#1 opened almost 2 years ago by
librarian-bot
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs