Arcee's MergeKit: A Toolkit for Merging Large Language Models Paper β’ 2403.13257 β’ Published Mar 20, 2024 β’ 20 β’ 4
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions Paper β’ 2502.13791 β’ Published Feb 19 β’ 5 β’ 3
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions Paper β’ 2502.13791 β’ Published Feb 19 β’ 5 β’ 3
Linking In-context Learning in Transformers to Human Episodic Memory Paper β’ 2405.14992 β’ Published May 23, 2024 β’ 1 β’ 3
Bridging the Data Provenance Gap Across Text, Speech and Video Paper β’ 2412.17847 β’ Published Dec 19, 2024 β’ 9 β’ 2
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper β’ 2412.02980 β’ Published Dec 4, 2024 β’ 15 β’ 3
$\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens Paper β’ 2402.13718 β’ Published Feb 21, 2024 β’ 1 β’ 2
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper β’ 2407.14933 β’ Published Jul 20, 2024 β’ 12 β’ 3
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI Paper β’ 2310.16787 β’ Published Oct 25, 2023 β’ 5 β’ 2
Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them? Paper β’ 2404.12691 β’ Published Apr 19, 2024 β’ 2 β’ 2
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper β’ 2407.14933 β’ Published Jul 20, 2024 β’ 12 β’ 3
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper β’ 2404.18796 β’ Published Apr 29, 2024 β’ 71 β’ 3
Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning Paper β’ 2404.12897 β’ Published Apr 19, 2024 β’ 2 β’ 2
Augmenting Language Models with Long-Term Memory Paper β’ 2306.07174 β’ Published Jun 12, 2023 β’ 18 β’ 5
Ring Attention with Blockwise Transformers for Near-Infinite Context Paper β’ 2310.01889 β’ Published Oct 3, 2023 β’ 13 β’ 3
Self-attention Does Not Need $O(n^2)$ Memory Paper β’ 2112.05682 β’ Published Dec 10, 2021 β’ 3 β’ 2
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper β’ 2403.00522 β’ Published Mar 1, 2024 β’ 47 β’ 4