HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper β’ 2411.02959 β’ Published Nov 5, 2024 β’ 71
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate By muellerzr and 3 others β’ Jun 13, 2024 β’ 54
view article Article Our Transformers Code Agent beats the GAIA benchmark! By m-ric and 1 other β’ Jul 1, 2024 β’ 87
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9, 2024 β’ 53
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning Paper β’ 2406.08973 β’ Published Jun 13, 2024 β’ 90
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts Paper β’ 2405.11273 β’ Published May 18, 2024 β’ 17
WildChat: 1M ChatGPT Interaction Logs in the Wild Paper β’ 2405.01470 β’ Published May 2, 2024 β’ 63
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper β’ 2404.14619 β’ Published Apr 22, 2024 β’ 128
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale Paper β’ 2211.07636 β’ Published Nov 14, 2022 β’ 1