| MIND · MATTER · MEANING | No. 41 · May 2026 |
What the Inner Voice Is For
Deflating the inner theater doesn’t silence the inner voice.
| An essay | mindmatterandmeaning.com |
A reader who has followed the chapter this far might draw a deflating conclusion the chapter does not intend. If the inner voice is only internalized social speech, and the self only a construction, and the vocabulary of inner life only a public inheritance — then the running commentary in your head starts to sound like an epiphenomenal echo, a leftover noise from childhood that signifies nothing and does no work. Strip away the inner theater and perhaps the inner monologue goes with it, demoted to mental tinnitus.
That conclusion would be a mistake, and a revealing one. It mistakes deflating the picture for deflating the phenomenon — exactly the error the chapter has been warning against in every other domain. The inner theater is a bad picture of what the inner voice is. It is not a denial that the inner voice does anything. And the inner voice does a great deal.
The Puzzle the Voice Poses
Start with a question sharper than it first appears, raised by Marta Jorba and Agustín Vicente: why would we speak to ourselves at all?1 The puzzle has teeth in this form: if a thought is already in your mind — if you already, in some sense, know what you think — why bother clothing it in words, silently, for an audience of one who supposedly already has the information? Overt speech communicates a thought to someone who lacks it. Inner speech seems to communicate a thought to someone who already has it. Stated that way it looks pointless, a redundant broadcast to a listener who was never out of the loop.
The puzzle dissolves the moment you stop assuming the thought was fully formed before the words arrived. That assumption is the inner-theater picture wearing new clothes — the idea that finished thoughts sit in an inner storehouse and language merely fetches and displays them. Drop it, and a different possibility opens: that putting the thought into words is part of having it in a usable form. I will commit to the stronger reading of that possibility — that the verbalizing is not the read-out of a completed cognition but a stage of the cognition — while flagging it as a commitment rather than a forced conclusion.2 Dropping the inner-theater assumption clears the way for this view; it does not by itself compel it. The chief rival — that inner speech merely expresses a thought already complete in some prelinguistic medium — has able defenders, and the constitutive reading owes it an answer.3 What recommends the constitutive reading over its rival is the work the inner voice turns out to do, which the rest of this essay lays out.
What the Voice Actually Does
The empirical literature, descending from Vygotsky, lists the jobs. We talk to ourselves, Jorba and Vicente summarize, in order to monitor and control our behavior, to plan, to self-evaluate, to motivate ourselves.4 None of these is idle. Each names a real cognitive achievement that the silent voice helps bring about.
Consider control. A child learning to resist the marshmallow narrates the rule aloud — don’t touch, wait for the bell — and the narration is not a report on self-control already achieved; it is the mechanism by which the control is achieved. Vygotsky’s insight was precisely that this regulatory speech does not vanish when it goes quiet. It internalizes, and goes on regulating from the inside.5 The adult who silently tells himself one more page, then a break is running the same machinery the child ran aloud, now under the breath.
Consider planning. Holding a sequence of intended actions in order — first the bank, then the pharmacy, then call the contractor — is work that the inner voice does by rehearsing the sequence in language, parking it in the phonological loop of working memory where it can be held and re-read.6 Strip the verbal rehearsal away and the sequence frays. The voice is not narrating a plan that exists elsewhere; it is, in part, where the plan is kept.
Consider self-evaluation, the silent that came out wrong, try again after a clumsy sentence in a meeting. The verbal formulation makes the lapse available to be worked on. A wordless sense of having erred is hard to operate on; a sentence about the error can be inspected, revised, learned from. Putting the thing into words is what makes it a candidate for correction.
There is a fourth job, quieter than the others, that the constitutive reading makes vivid. Much of the mind runs in pieces that do not naturally talk to each other — the part that reads a face, the part that recalls a name, the part that weighs a risk. Saying a thing to yourself, even half-formed, drags those scattered outputs into one sentence the whole system can then read off of. Henry will be at the party, and Henry is exhausting. The voice does not announce a verdict already reached somewhere in the dark; it is the place the pieces get assembled into a verdict at all. Skeptics of the constitutive reading will say the sentence merely makes an antecedent thought available — and they have a point worth taking seriously. But a thought no part of you can hold or inspect does no work either way, so even the deflationary version concedes the voice a real job; and where the sentence is what binds the scattered pieces, there was no single antecedent thought for it to be the mere clothing of.
In each case the same structural fact recurs. Verbalizing does not transmit a finished thought to a redundant audience. It constitutes the thought in a form the rest of the system can use — hold, rehearse, inspect, correct, sustain across time. The inner voice earns its keep by doing the holding.
Why Deflation Was Never the Threat
Return now to the worry that opened the essay. The fear was that explaining the inner voice as internalized social speech would explain it away — that origin in the social world somehow demotes it to a noise. But notice that the social-origins story is what makes the function intelligible in the first place. The reason inner speech can monitor, plan, and evaluate is that it inherited those powers from outer speech, which did exactly those jobs interpersonally before they were turned inward. A parent’s wait becomes the child’s aloud wait becomes the adult’s silent wait. The regulatory power was real at every stage. Internalization did not dilute it; it relocated it.
This is the general lesson the book keeps relearning, applied once more. Showing that a capacity has humble, external, naturalistic origins does not show that the capacity is unreal or impotent. It shows where the power came from. The transparency of experience did not abolish phenomenal character — the felt quality of what it is like to undergo an experience; it relocated it from inner paint to world-directed content. The social origin of inner speech does not abolish the inner voice’s cognitive work; it relocates the source of that work from a Cartesian inner faculty — a self-standing mental power lodged inside, owing nothing to the world outside — to an internalized public practice.7
So the chapter’s title — the illusion of interiority — should be heard with care, and this essay is the warning label. What is illusory is the picture: the sealed inner chamber, the pre-linguistic self, the finished thoughts on inner display. What is entirely real is the inner voice and everything it accomplishes. You do think in words, often, and the thinking is genuine work, not a shadow play. The voice plans your afternoon, talks you through the hard parts, catches your errors, keeps your resolve from drifting. It is one of the most useful things you do.
It simply does all of that out here, in a borrowed language, in the open — not in a private theater that was never there.
Notes
- Marta Jorba and Agustín Vicente, “Cognitive Phenomenology, Access to Contents, and Inner Speech,” Journal of Consciousness Studies 21, nos. 9–10 (2014): 74–99. The authors frame the question explicitly — “why would we speak to ourselves at all?” — and note the special form it takes for thoughts already possessed: why “put in words thoughts that are already in our mind, and therefore apparently need not be expressed at all.” Their answer requires giving up the assumption that the thought is fully constituted prior to its verbalization. ↩
- I state this as the essay’s working commitment rather than a demonstrated result. The contrast is the one Jorba and Vicente draw between an activity view of inner speech — on which inner speech “is used in having conscious thoughts, not in having thoughts about those” — and a format view, on which inner speech merely supplies a vehicle or display-format for a thought constituted elsewhere (Jorba and Vicente, “Cognitive Phenomenology,” §5). The constitutive reading endorsed here is the activity view, which they note is “clearly more congenial to the defense of cognitive phenomenology than the format view.” Stronger versions are available: Christopher Gauker (Words and Images, Oxford, 2011; “Inner Speech as the Internalization of Outer Speech,” in Langland-Hassan and Vicente, eds., Inner Speech, Oxford, 2018) holds that all conceptual thought occurs in inner speech. I do not need Gauker’s universal thesis — only the local claim that, for the regulatory and planning functions canvassed below, the verbalizing partly constitutes the cognition rather than reporting it. ↩
- The rival is the expressive (format) view: inner speech is the audible-in-imagination clothing of a thought already complete in a prelinguistic medium — Fodor’s language of thought is the standard candidate. The objection presses that the functional facts I cite are equally well explained if inner speech merely makes accessible an antecedent thought rather than constituting it. Two replies. First, accessibility is not idle even on the expressive view — a thought that cannot be held, rehearsed, or inspected does no work, so the expressive theorist must still grant inner speech a load-bearing functional role, which is all the anti-deflationary argument of this essay requires. Second, the constitutive reading earns its extra commitment from the integration cases: Peter Carruthers (“The Cognitive Functions of Language,” Behavioral and Brain Sciences 25, no. 6 [2002]: 657–74) argues that the combinatorial production of inner speech is what integrates the outputs of otherwise encapsulated modules into a single trackable content — a binding the prelinguistic thought, ex hypothesi already unified, could not be performing. Where inner speech does the integrating, it is not displaying a finished thought; it is assembling one. See also Keith Frankish (in Langland-Hassan and Vicente, eds., Inner Speech, 2018) on inner speech as a means of decomposing a problem into sub-problems tractable by lower-level processes — again a doing, not a read-out. ↩
- Jorba and Vicente, “Cognitive Phenomenology,” summarizing the Vygotskyan literature: “Vygotskyans highlight the role of inner speech in self-regulation and executive” function, and hold that we use inner speech “to monitor and control our behavior, to plan, to self-evaluate, to motivate ourselves,” among other functions. The functional inventory is well attested across developmental and cognitive psychology; the philosophical payoff is that none of it requires, or is helped by, the inner-theater picture. ↩
- Lev Vygotsky, Thought and Language, trans. and ed. Alex Kozulin (Cambridge, MA: MIT Press, 1986; orig. 1934), esp. ch. 7, on the transition from egocentric (overt, self-directed) speech to inner speech — Vygotsky’s claim that “egocentric speech is a stage of development preceding inner speech,” the two fulfilling the same intellectual and self-regulatory function across the change in form — and on the decreasing vocalization by which outer speech goes silent without ceasing to regulate. The point that inner speech internalizes outer social speech, rather than expressing a faculty independent of it, is the developmental backbone of the present argument. The marshmallow illustration draws on the broader self-regulation literature (Mischel and successors) read through a Vygotskyan lens; the philosophical point does not depend on any particular experimental paradigm but on the general developmental claim that self-regulatory speech internalizes rather than disappears. ↩
- The “phonological loop” is Alan Baddeley’s term for the verbal-rehearsal component of working memory (Baddeley and Hitch, “Working Memory,” in G. Bower, ed., The Psychology of Learning and Motivation, vol. 8, Academic Press, 1974; Baddeley, Working Memory, Thought, and Action, Oxford, 2007). The claim that verbal rehearsal sustains ordered sequences in working memory is mainstream cognitive science; I borrow the construct without committing to the full multi-component model, which has rivals (e.g., embedded-process accounts). ↩
- The anti-eliminativist moral here is consonant with one reading of Dennett’s position — anti-substantialist about the self without being eliminativist about the cognitive work the self-model performs (Consciousness Explained, Boston: Little, Brown, 1991, on the self as “Center of Narrative Gravity”). That reading is contested: Galen Strawson (“The Self,” Journal of Consciousness Studies 4, nos. 5–6 [1997]: 405–28) takes Dennett’s narrative-gravity construction to be profoundly anti-realist about the self, not merely anti-substantialist. I adopt the deflationary-but-not-eliminativist reading because it is the one the present argument needs and, I think, the more defensible; the dispute does not affect the point made here, which concerns the cognitive work of inner speech rather than the metaphysics of the self. See “The Self That Wasn’t a Story” (Ch. 7 §II) for the parallel point about narrative self-constitution. ↩