DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 54
In-Context Learning with Long-Context Models: An In-Depth Exploration Paper • 2405.00200 • Published Apr 30, 2024
Efficient Long-Text Understanding with Short-Text Models Paper • 2208.00748 • Published Aug 1, 2022 • 1
ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding Paper • 2305.14196 • Published May 23, 2023
DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule Paper • 2302.12022 • Published Feb 8, 2023 • 1
Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments Paper • 2202.06387 • Published Feb 13, 2022
Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics Paper • 2111.05898 • Published Nov 10, 2021
Achieving Model Robustness through Discrete Adversarial Training Paper • 2104.05062 • Published Apr 11, 2021
Scene Graph to Image Generation with Contextualized Object Layout Refinement Paper • 2009.10939 • Published Sep 23, 2020