Certified Mitigation of Worst-Case LLM Copyright Infringement Paper β’ 2504.16046 β’ Published Apr 22 β’ 13
Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning Paper β’ 2503.04973 β’ Published Mar 6 β’ 24
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper β’ 2502.18418 β’ Published Feb 25 β’ 27
MMTEB: Massive Multilingual Text Embedding Benchmark Paper β’ 2502.13595 β’ Published Feb 19 β’ 36
Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering Paper β’ 2502.13962 β’ Published Feb 19 β’ 29
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper β’ 2501.05441 β’ Published Jan 9 β’ 93
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 150
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations Paper β’ 2412.13171 β’ Published Dec 17, 2024 β’ 36
Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements Paper β’ 2410.08968 β’ Published Oct 11, 2024 β’ 14
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models Paper β’ 2409.11136 β’ Published Sep 17, 2024 β’ 24