FELM: Benchmarking Factuality Evaluation of Large Language Models Paper • 2310.00741 • Published Oct 1, 2023
Evaluating Factual Consistency of Summaries with Large Language Models Paper • 2305.14069 • Published May 23, 2023
Composing Parameter-Efficient Modules with Arithmetic Operations Paper • 2306.14870 • Published Jun 26, 2023 • 3
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios Paper • 2307.13528 • Published Jul 25, 2023
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 15
SkyLadder: Better and Faster Pretraining via Context Window Scheduling Paper • 2503.15450 • Published 4 days ago • 10
SkyLadder: Better and Faster Pretraining via Context Window Scheduling Paper • 2503.15450 • Published 4 days ago • 10