When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy Paper • 2505.22888 • Published about 1 month ago • 6 • 2
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Paper • 2406.19999 • Published Jun 28, 2024 • 4 • 1