Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
haritzpuertoΒ 
posted an update May 28
Post
310
πŸ“œ Accepted at ACL 2025! Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs
We propose to fine-tune LLMs to generate diverse chains of thought (DCoT) in a single inference step. This enables within-inference refinement of the cots, no external feedback needed!
πŸ”— https://arxiv.org/abs/2407.03181
In this post