1 17 6

Abdulhakeem Adefioye

kokolamba

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Estimating Knowledge in Large Language Models Without Generating a Single Token

updated a dataset 13 days ago

kokolamba/keen_popqa_gpt2xl_generations

published a dataset 13 days ago

kokolamba/keen_popqa_gpt2xl_generations

View all activity

Organizations

upvoted a paper about 15 hours ago

Estimating Knowledge in Large Language Models Without Generating a Single Token

Paper • 2406.12673 • Published Jun 18, 2024 • 9

updated a dataset 13 days ago

kokolamba/keen_popqa_gpt2xl_generations

Viewer • Updated 13 days ago • 19.2k • 12

published a dataset 13 days ago

kokolamba/keen_popqa_gpt2xl_generations

Viewer • Updated 13 days ago • 19.2k • 12

upvoted a collection about 1 month ago

LMEnt

Collection

14 items • Updated Sep 14 • 6

upvoted 2 papers 2 months ago

ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

Paper • 2505.02819 • Published May 5 • 26

Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

Paper • 2508.04581 • Published Aug 6 • 5

updated a model 2 months ago

kokolamba/moe-mha

Updated Oct 19 • 3

published a model 2 months ago

kokolamba/moe-mha

Updated Oct 19 • 3

updated a model 2 months ago

kokolamba/moe-kv-128

Updated Oct 19 • 8

published a model 2 months ago

kokolamba/moe-kv-128

Updated Oct 19 • 8

updated a model 2 months ago

kokolamba/moe-o-192

Updated Oct 19 • 12

published a model 2 months ago

kokolamba/moe-o-192

Updated Oct 19 • 12

upvoted an article 3 months ago

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

Mar 18, 2024

•

updated 7 models 3 months ago

Abdulhakeem Adefioye

AI & ML interests

Recent Activity

Organizations

kokolamba's activity

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity