Multimodal Autoregressive Pre-training of Large Vision Encoders Paper β’ 2411.14402 β’ Published Nov 21, 2024 β’ 47
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Paper β’ 2408.15998 β’ Published Aug 28, 2024 β’ 88
Text2SQL is Not Enough: Unifying AI and Databases with TAG Paper β’ 2408.14717 β’ Published Aug 27, 2024 β’ 27
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation Paper β’ 2407.13481 β’ Published Jul 18, 2024 β’ 10
Fast Matrix Multiplications for Lookup Table-Quantized LLMs Paper β’ 2407.10960 β’ Published Jul 15, 2024 β’ 13
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Paper β’ 2407.14482 β’ Published Jul 19, 2024 β’ 27
Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study Paper β’ 2406.07057 β’ Published Jun 11, 2024 β’ 17
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Paper β’ 2406.18009 β’ Published Jun 26, 2024 β’ 23
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Paper β’ 2406.07394 β’ Published Jun 11, 2024 β’ 30
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper β’ 2406.06592 β’ Published Jun 5, 2024 β’ 30
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models Paper β’ 2406.06563 β’ Published Jun 3, 2024 β’ 20
The Prompt Report: A Systematic Survey of Prompting Techniques Paper β’ 2406.06608 β’ Published Jun 6, 2024 β’ 68
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Paper β’ 2406.04325 β’ Published Jun 6, 2024 β’ 76
Mixture-of-Agents Enhances Large Language Model Capabilities Paper β’ 2406.04692 β’ Published Jun 7, 2024 β’ 60
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning Paper β’ 2406.06469 β’ Published Jun 10, 2024 β’ 30
Husky-v1 Collection A unified language agent that addresses numerical, tabular and knowledge-based reasoning tasks. β’ 6 items β’ Updated Jun 11, 2024 β’ 8
mistralai_hackathon Collection Synthetic datasets and fine-tuned Mistral models used in MistralAI Hackathon β’ 21 items β’ Updated Feb 4 β’ 4