Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper β’ 2504.17025 β’ Published 18 days ago β’ 16
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines Paper β’ 2504.14738 β’ Published 21 days ago β’ 5
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Paper β’ 2504.15266 β’ Published 20 days ago β’ 3
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging Paper β’ 2504.10642 β’ Published 27 days ago β’ 2
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Paper β’ 2504.15133 β’ Published 21 days ago β’ 21
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper β’ 2504.14538 β’ Published 22 days ago β’ 28
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper β’ 2504.17192 β’ Published 18 days ago β’ 108
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 46 items β’ Updated 13 days ago β’ 608
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 23 items β’ Updated 4 days ago β’ 137
Big-Math Collection This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers β’ 4 items β’ Updated 26 days ago β’ 4
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data Paper β’ 2309.11235 β’ Published Sep 20, 2023 β’ 15
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. β’ 121 items β’ Updated Jan 31, 2024 β’ 534