0 citations

Self-Consistency Improves Chain of Thought Reasoning in Language Models

arXiv (Cornell University)2022

Citations Over Time

Xuezhi Wang, Jason Lee, Dale Schuurmans, Quoc V. Le, Ed H., Narang, Sharan, Chowdhery, Aakanksha, Zhou, Denny

Abstract

Chain-of-thought prompting combined with pre-trained large language models has achieved encouraging results on complex reasoning tasks. In this paper, we propose a new decoding strategy, self-consistency, to replace the naive greedy decoding used in chain-of-thought prompting. It first samples a diverse set of reasoning paths instead of only taking the greedy one, and then selects the most consistent answer by marginalizing out the sampled reasoning paths. Self-consistency leverages the intuition that a complex reasoning problem typically admits multiple different ways of thinking leading to its unique correct answer. Our extensive empirical evaluation shows that self-consistency boosts the performance of chain-of-thought prompting with a striking margin on a range of popular arithmetic and commonsense reasoning benchmarks, including GSM8K (+17.9%), SVAMP (+11.0%), AQuA (+12.2%), StrategyQA (+6.4%) and ARC-challenge (+3.9%).

Related Papers

→ On the Role of Intuition in Decision Making and Some Ways of Multicriteria Aid of Intuition(1997)5 cited
Intensive Margin and Extensive Margin Adjustments of Labor Market : Turkey versus United States(2013)
On Editors' Intuition of Academic Journal(2001)
On the Necessity and Feasibility of Constructing Big Water Margin Cultural System——Taking Big Folk Water Margin by Fan Chaoyang and Zhang Qingjian as an Example(2009)