DataComp-LM: In search of the next generation of training sets for language models Paper β’ 2406.11794 β’ Published Jun 17, 2024 β’ 51
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model Paper β’ 2405.04434 β’ Published May 7, 2024 β’ 14
mlx-community/CodeLlama-70b-Instruct-hf-4bit-MLX Text Generation β’ Updated Jan 30, 2024 β’ 18 β’ 25
DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing Paper β’ 2312.07409 β’ Published Dec 12, 2023 β’ 22
SoTaNa: The Open-Source Software Development Assistant Paper β’ 2308.13416 β’ Published Aug 25, 2023 β’ 11
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models Paper β’ 2308.13137 β’ Published Aug 25, 2023 β’ 17