LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens Paper • 2510.11919 • Published 19 days ago • 4
From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation Paper • 2510.07624 • Published 24 days ago • 5
TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation Paper • 2508.08680 • Published Aug 12 • 3
GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers Paper • 2412.09722 • Published Dec 12, 2024 • 5
Tree of Problems: Improving structured problem solving with compositionality Paper • 2410.06634 • Published Oct 9, 2024 • 9
In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation Paper • 2408.00397 • Published Aug 1, 2024 • 12
OctoPack: Instruction Tuning Code Large Language Models Paper • 2308.07124 • Published Aug 14, 2023 • 30