Samuel Lima Braz

samuellimabraz

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago
Quantum Instruct
updated a dataset 3 days ago
tech4humans/clinc_oos_pt
commented on an article 5 days ago
KV Cache from scratch in nanoVLM
View all activity

Organizations

Tech4Humans's profile picture Hugging Face Discord Community's profile picture Chunkr's profile picture

samuellimabraz's activity

commented on KV Cache from scratch in nanoVLM 5 days ago
view reply

Thanks for this great article! I'm learning a lot from the nanoVLM project.
I'm not an expert in gen ai but I noticed the attention calculation example seems to be missing the scaling √(d_k). Is this intentional for simplification?

d_k = K.shape[-1]
attention_scores = (Q @ K.T) / math.sqrt(d_k)

From my understanding this scaling prevents the dot product growing too large and control the softmax region

upvoted an article 5 days ago
upvoted an article 6 days ago
upvoted an article 27 days ago
view article
Article

Vision Language Models (Better, Faster, Stronger)

By merve and 4 others
423
upvoted an article about 1 month ago