arxiv:2404.18988

Markovian Transformers for Informative Language Modeling

Published on Apr 29, 2024

Authors:

Abstract

The approach integrates Chain-of-Thought reasoning into a Markovian language model to improve accuracy and interpretability by predicting future tokens through an intermediate CoT.

AI-generated summary

Chain-of-Thought (CoT) reasoning often fails to faithfully reflect a language model's underlying decision process. We address this by making CoT text causally essential in a "Markovian" language model, factoring next-token prediction through an intermediate CoT and training it to predict future tokens independently of the original prompt. We formalize this via an "informativeness" objective that quantifies how much a trained CoT improves next-token predictions over a baseline. Using policy gradient, we show that Llama 3.1 8B achieves a 33.2% absolute accuracy improvement on GSM8K. Perturbation tests confirm stronger reliance on the CoT, while cross-model transfers indicate these reasoning traces generalize across interpreters. Our approach enhances both accuracy and interpretability, potentially extending CoT reasoning to arbitrarily long contexts and diverse tasks.

View arXiv page View PDF Add to collection