Back to today's list

The Periodic Table of LLM Reasoning: A Structured Survey of Reasoning Paradigms, Methods, and Failure Modes

Avinash Anand, Mahisha Ramesh, Avni Mittal, Ashutosh Kumar, Erik Cambria, Zhengkui Wang, Timothy Liu, Aik Beng Ng, Simon See, Rajiv Ratn Shah

Published Jun 11, 2026
Editorial review6.8
Relevance0.502
Freshness0.192

Why It Matters

What makes this one worth your time

Understanding the reasoning capabilities and limitations of LLMs is crucial for developing more robust and interpretable AI systems.

A comprehensive survey of reasoning paradigms and failure modes in LLMs.

Summary

The paper surveys over 300 recent papers to analyze reasoning capabilities in large language models (LLMs), categorizing reasoning paradigms and identifying failure modes and limitations.

Key contributions

  • A structured taxonomy of LLM reasoning research.
  • Analysis of methodological trends across reasoning paradigms.
  • Synthesis of recurring limitations and failure modes in LLM reasoning.

Notable insights

  • The paper introduces a structured taxonomy of reasoning paradigms in LLMs.
  • It identifies recurring limitations such as reasoning hallucinations and weak causal abstraction.

Possible limitations

  • Not stated in the abstract

Abstract

arXiv:2606.11470v1 Announce Type: new Abstract: Large Language Models (LLMs) have achieved strong performance across natural language processing tasks, yet reliable reasoning remains an open challenge. Although modern LLMs show progress in structured inference, multi-step problem solving, and contextual understanding, their reasoning behavior is often inconsistent and sensitive to prompting strategies, task design, and model scale. This survey provides a systematic analysis of more than 300 recent papers from arXiv, Semantic Scholar, Google Scholar, Papers with Code, and the ACL Anthology to examine how reasoning capabilities emerge in LLMs and where they fail. We make three main contributions. First, we introduce a structured taxonomy of LLM reasoning research, covering Chain-of-Thought reasoning, multi-hop reasoning, mathematical reasoning, common sense reasoning, visual and temporal reasoning, code and algorithmic reasoning, retrieval-augmented reasoning, tool-augmented and agentic reasoning, and reinforcement learning-based reasoning. Second, we analyze methodological trends across these paradigms, including prompting methods, model architectures, training objectives, reward modeling, and evaluation benchmarks. Third, we synthesize recurring limitations and failure modes, such as reasoning hallucinations, brittle multi-step inference, weak causal abstraction, and poor cross-domain generalization. By organizing a rapidly expanding literature, this survey offers a unified view of the current capabilities and limitations of reasoning in LLMs. We also identify emerging research directions, including meta-reasoning, self-evolving reasoning frameworks, multimodal reasoning, and socially grounded reasoning. Overall, this work aims to serve as a reference for developing more robust, interpretable, and generalizable reasoning systems in future language models.