From Consumption to Reflection: Designing Human-AI Relations for Stable Reasoning
Rikard Rosenbacke, Carl Rosenbacke, Victor Rosenbacke, Martin McKee
Why It Matters
What makes this one worth your time
This work addresses critical cognitive limitations in AI interactions, potentially improving decision-making processes in applications reliant on LLMs.
RRI transforms human-AI interaction into a structured joint reasoning system.
Summary
The paper introduces Relational Reflective Intelligence (RRI), a governance layer designed to enhance reasoning in human-AI interactions by implementing auditable reasoning loops that address cognitive vulnerabilities shared by both humans and large language models.
Key contributions
- Introduction of the Relational Reflective Intelligence (RRI) framework.
- Development of the Rose-Frame for identifying reasoning breakdowns.
- Creation of the Architect's Pen for introducing targeted reflection steps.
Notable insights
- The concept of relational drift highlights the compounding errors in human-AI interactions, emphasizing the need for structured reasoning.
- RRI's approach of embedding reflection into inference-time workflows offers a novel method to enhance reasoning without retraining models.
Possible limitations
- Not stated in the abstract.
Abstract
arXiv:2606.11195v1 Announce Type: cross Abstract: Large language models (LLMs) have transformed how humans access information, but not how we reason with it. Their fluency accelerates consumption while bypassing the slow, reflective processes that underpin sound judgment. This paper introduces Relational Reflective Intelligence (RRI), an inference-time governance layer that operationalizes reflection through auditable reasoning loops. RRI operates not inside the model but around it, providing a practical structure for stable, auditable reasoning between humans and LLMs. The core premise is that LLMs inherit cognitive vulnerabilities similar to those that shape human thought: reliance on intuitive shortcuts, confusion between representation and reality, and a preference for coherence over falsification. When humans and models share these tendencies, their errors compound. We refer to this as relational drift, a failure that arises from interaction rather than from the model alone. Addressing this requires a shift from modeling relations between words to structuring relations between model outputs and human reasoning. RRI provides this missing layer through three components: the Rose-Frame, which identifies likely breakdowns in reasoning; the Architect's Pen, which introduces targeted reflection steps at critical moments; and an inference-time workflow that embeds these steps without retraining the model. Together, these elements transform human-AI interaction into a joint reasoning system with explicit checkpoints, conflict surfacing, and an auditable trail of assumptions. Rather than making machines think like humans or forcing humans to reason like machines, RRI creates a structured interaction in which both compensate for each other's limitations. It reframes AI safety as a cognitive architecture problem, where reliable decisions depend on embedding reflection directly into the interaction process.