view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques ๐ ๐ By Isayoften โข Aug 26, 2024 โข 68
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. โข 8 items โข Updated May 2 โข 97
Transformer Explainer: Interactive Learning of Text-Generative Models Paper โข 2408.04619 โข Published Aug 8, 2024 โข 161
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper โข 2406.12034 โข Published Jun 17, 2024 โข 16
Granite Code Models: A Family of Open Foundation Models for Code Intelligence Paper โข 2405.04324 โข Published May 7, 2024 โข 23
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. โข 23 items โข Updated May 2 โข 194