view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach By oopere • Nov 24, 2024 • 10
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 161
rank1 Collection rank1 is the first test-time compute reasoning model in IR • 18 items • Updated Apr 8 • 4
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 172