Kalle Hilsenbek

Bachstelze

AI & ML interests

Combining BERT with instructions for explainable AI: gitlab.com/Bachstelze/instructionbert

Recent Activity

Organizations

BabyLM Challenge's profile picture

Bachstelze's activity

commented on Announcing AI Energy Score Ratings 3 months ago
view reply

Thanks for your effort in energy efficiency. You worked up my curiosity!
Why do smolLM-135m and smolLm-1.7B nearly have the same score besides a 10 times model size difference? Does the identical context size mostly cause it?
Could you please enable encoder-decoder models? They should be in theory more efficient because the input has to be encoded only once and can be reused in every decoding step.

upvoted an article 3 months ago
view article
Article

Is Attention Interpretable in Transformer-Based Large Language Models? Let’s Unpack the Hype

4