IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning Paper • 2509.22621 • Published 12 days ago • 8
The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks Paper • 2509.25671 • Published 9 days ago • 6
mmBERT: A Modern Multilingual Encoder with Annealed Language Learning Paper • 2509.06888 • Published about 1 month ago • 12
The Trickle-down Impact of Reward (In-)consistency on RLHF Paper • 2309.16155 • Published Sep 28, 2023 • 1
Jailbreak Distillation: Renewable Safety Benchmarking Paper • 2505.22037 • Published May 28 • 1
Jointly Reinforcing Diversity and Quality in Language Model Generations Paper • 2509.02534 • Published Sep 2 • 25
Seq vs Seq: An Open Suite of Paired Encoders and Decoders Paper • 2507.11412 • Published Jul 15 • 28
The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure Paper • 2506.22724 • Published Jun 28 • 10
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning Paper • 2506.02327 • Published Jun 2 • 20
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback Paper • 2506.11930 • Published Jun 13 • 54
BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge Bases Paper • 2505.20321 • Published May 23 • 5
Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find Paper • 2505.18148 • Published May 23 • 5
Certified Mitigation of Worst-Case LLM Copyright Infringement Paper • 2504.16046 • Published Apr 22 • 13
ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers Paper • 2504.19395 • Published Apr 28 • 5
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published Feb 25 • 28
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 156
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations Paper • 2412.13171 • Published Dec 17, 2024 • 36
Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements Paper • 2410.08968 • Published Oct 11, 2024 • 14
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning Paper • 2410.01044 • Published Oct 1, 2024 • 37