Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper โข 2502.06060 โข Published 28 days ago โข 34
GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Paper โข 2407.12077 โข Published Jul 16, 2024 โข 56