@haritzpuerto on Hugging Face: "📜 Accepted at ACL 2025! Fine-Tuning on Diverse Reasoning Chains Drives…"

Post

310

📜 Accepted at ACL 2025! Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs
We propose to fine-tune LLMs to generate diverse chains of thought (DCoT) in a single inference step. This enables within-inference refinement of the cots, no external feedback needed!
🔗 https://arxiv.org/abs/2407.03181

Join the conversation