The FinBen: An Holistic Financial Benchmark for Large Language Models Paper ā¢ 2402.12659 ā¢ Published Feb 20, 2024 ā¢ 21
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series Paper ā¢ 2401.03955 ā¢ Published Jan 8, 2024 ā¢ 7
Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning Paper ā¢ 2412.08587 ā¢ Published Dec 11, 2024 ā¢ 1
Adaptive Margin Global Classifier for Exemplar-Free Class-Incremental Learning Paper ā¢ 2409.13275 ā¢ Published Sep 20, 2024 ā¢ 1
Protoformer: Embedding Prototypes for Transformers Paper ā¢ 2206.12710 ā¢ Published Jun 25, 2022 ā¢ 1
Overcoming catastrophic forgetting in neural networks Paper ā¢ 1612.00796 ā¢ Published Dec 2, 2016 ā¢ 1
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper ā¢ 2501.04519 ā¢ Published 10 days ago ā¢ 230
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator Paper ā¢ 2312.04474 ā¢ Published Dec 7, 2023 ā¢ 31
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 ā¢ 9 items ā¢ Updated Nov 27, 2024 ā¢ 103
indic-evals Collection Translated versions of popular LLM benchmarks. ā¢ 4 items ā¢ Updated Oct 23, 2024 ā¢ 2
Writing in the Margins: Better Inference Pattern for Long Context Retrieval Paper ā¢ 2408.14906 ā¢ Published Aug 27, 2024 ā¢ 139
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation Paper ā¢ 2409.12941 ā¢ Published Sep 19, 2024 ā¢ 23
Re-Reading Improves Reasoning in Language Models Paper ā¢ 2309.06275 ā¢ Published Sep 12, 2023 ā¢ 3
Training Language Models to Self-Correct via Reinforcement Learning Paper ā¢ 2409.12917 ā¢ Published Sep 19, 2024 ā¢ 136
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 ā¢ 216
Patched RTC: evaluating LLMs for diverse software development tasks Paper ā¢ 2407.16557 ā¢ Published Jul 23, 2024 ā¢ 1
Evaluating Pre-trained Language Models for Repairing API Misuses Paper ā¢ 2310.16390 ā¢ Published Oct 25, 2023 ā¢ 1