view article Article Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained β Whatβs Really Changing in Transformers? Apr 4, 2025 β’ 16
view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control +2 Feb 4, 2025 β’ 186
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π Jul 5, 2024 β’ 306
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 16 items β’ Updated 11 days ago β’ 242
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper β’ 2402.09844 β’ Published Feb 15, 2024 β’ 21