XReasoning - models Collection https://arxiv.org/abs/2505.22888 ds - means continue post-training on deepseek distilled qwen math 7b limo-{language}-{amount of data} • 19 items • Updated 24 days ago • 1
When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy Paper • 2505.22888 • Published about 1 month ago • 6
XReasoning Collection multilingualness - Dataset for XReasoning https://arxiv.org/abs/2505.22888XReasoning • 8 items • Updated 24 days ago • 1
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published May 15 • 119
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models Paper • 2310.10378 • Published Oct 16, 2023 • 1
Likelihood as a Performance Gauge for Retrieval-Augmented Generation Paper • 2411.07773 • Published Nov 12, 2024 • 1
view article Article What We Learned About LLM/VLMs in Healthcare AI Evaluation: By shanchen • Nov 8, 2024 • 13
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Paper • 2406.19999 • Published Jun 28, 2024 • 4
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Paper • 2406.13663 • Published Jun 19, 2024 • 7
Resonance RoPE: Improving Context Length Generalization of Large Language Models Paper • 2403.00071 • Published Feb 29, 2024 • 25