| MIND · MATTER · MEANING | May 2026 |
The Origins of Inner Speech
The inner voice isn’t where thought begins — it’s speech turned inward.
| An essay | mindmatterandmeaning.com |
Right now, as you read this, something happens that probably feels like the most private thing in the world. The words on the page get pulled into your head and they sound — silently — in something you would call your own voice. If I stop a sentence mid-thought, like this one — you finish it. The completion arrives in that same inner voice. You can shout in it without moving. You can argue with it. People who have lost their hearing late in life often report that the voice continues, sometimes even with the accent they used to have.[1] Whatever else this thing is, it feels like the deepest interior we have. A private room with the door shut. The last place where the world doesn’t get in.
This essay is about why that picture is almost exactly upside down.
The bad picture goes like this. There is an outside world, full of public language. People talk to each other. There is also an inside, where each of us has an inner voice — a kind of personal narrator who comments on what we see, rehearses what we’ll say, and works problems out under the breath of the soul. The outer language is social, learned, full of conventions. The inner voice is mine — first-person, immediate, the one thing I have that no one else can hear. On this picture, public speech is the noisy externalization of an already-private inner monologue. The thoughts come first, in the head; the words come later, when the thoughts need company.
This picture comes naturally. It has also done more than almost any other to lock the Cartesian theater into modern philosophy of mind. Once you accept that the inner voice is the thing, you have already conceded that there exists an interior space, with its own contents, accessible only to its owner. The mug on the table starts to recede. The outside world becomes a stage that your real life merely watches.
Here is the alternative.
The inner voice is not a private soliloquy that we sometimes externalize. It runs the other direction. Public speech came first — historically in the species and developmentally in each child — and the inner voice consists of that public activity turned inward. What you experience as silent thinking is, very largely, silent speaking: the same activity, with the same meanings, drawn from the same shared language, but with the motor signals turned down so far that nobody else can hear it. The voice in your head has accents because the voice out of your mouth used to. It uses words you learned from other people because all the words you have, you learned from other people. There is no separate inner lexicon. There is only the one lexicon — public, shared, social — being used in two different modes.
Once you see this, several long-running puzzles relax their grip.
Take the question of meaning. If the inner voice were a private soliloquy in a private language, then the meanings of its words would have to be fixed somewhere inside the head, by the speaker’s own lights, with no public check. This is what Wittgenstein was attacking when he sketched the famous case of someone trying to give a private name to a private sensation: there is no way to tell whether the next use of the word follows the rule or breaks it, because there is no public criterion of correct use.[2] The argument generalizes. Meaning never gets fixed by what goes on inside a single skull. It gets fixed in the social practices where words have uses people can correct, share, and inherit. The inner voice borrows those meanings from outside. It does not generate them.
That conclusion sounds counterintuitive only until you ask the obvious question: where would the inner voice get its meanings from, otherwise? It is not as though the language module in your head wakes up one morning with semantics pre-installed. You learned every word you have. You learned them from speakers around you, in contexts where their uses could be corrected. When that public competence later runs silently in your head, it does not shed its public character. It is still the same competence, drawing on the same word-uses, anchored in the same shared world.
This is also, incidentally, why the inner voice is not the private chamber the bad picture says it is. It is the public voice gone quiet. Your phenomenology bears this out: when you “speak to yourself” you are not having direct contact with raw meaning. You are running through words — words with accents, with grammar, with the cadence of speech.[3] You can describe your inner monologue, transcribe it, slow it down, translate it. None of that would be possible if it were not made of the same stuff as the speech you exchange with other people.
A useful diagnostic question follows from this. When you find yourself reaching for the inner voice as evidence of some essentially private inner life, ask: would my best guess about what I am thinking really be wrong if I just said it out loud? Almost always, the answer is no. Saying it out loud is what the inner voice would have been, before we learned to turn the volume off.
This brings us to what we owe Sellars and a long tradition after him. Sellars’s Jonesean myth in “Empiricism and the Philosophy of Mind” posits inner episodes — what we call thoughts — as theoretically modelled on overt verbal utterances rather than directly observed.[4] Tim Crane and Katalin Farkas, working in this lineage, gloss the picture vividly: thoughts are “inner episodes, called ‘thoughts,’ which are conceived on the model of overt verbal utterances, but happen silently in the head.”[5] They treat the inner stream less as a private screening and more as something we model on the basis of our public commerce with each other. The order of explanation runs from public to private, not the other way. We learn what thinking is by learning to talk; then we learn to call the silent version of that activity “thinking.”
Two further consequences fall out.
The first concerns AI. If the inner voice is internalized public speech, then producing fluent inner-voice-like outputs — sentences that sound like thinking — is not, by itself, evidence of an inner life behind the sentences. A large language model can produce streams of fluent prose without any of the social, embodied history that gave human speech its meanings in the first place. The model has the surface of inner speech and none of its provenance.[6] The temptation to see understanding behind its outputs comes precisely from the inverted picture I started with: we assume the words must be coming from an interior, because that’s where our own words seem to come from. They aren’t. Our own words come from a long history of public language that ours simply continues.
The second consequence concerns ourselves. If the inner voice is public speech gone silent, then the most private-feeling activity we have is, at its root, a social inheritance. You think with words you did not invent, in a language you did not design, using meanings calibrated by a community you mostly never met. The Cartesian sense that thinking is yours alone survives only because we forget where the equipment came from. Strip away the borrowed vocabulary and grammar and there would be very little left in the inner room. There would barely be an inner room.
Now an obvious objection. Surely, the objector says, there is more to thinking than silent speech. Mathematicians solve problems without verbalizing them. Musicians compose without inner narration. Animals without language clearly think. The claim that thinking just is internalized public speech overreaches.
Take this seriously, because it is mostly right. The claim worth defending is not that every act of cognition consists in inner monologue. Pre-linguistic infants think; non-human animals think; expert performance often runs faster than any inner narrator could keep up with. The claim is narrower and survives: the experience of thinking-in-words — the inner voice, the one that feels like the inmost private chamber — is best understood as internalized public speech. There is non-verbal cognition, certainly, but it is not what feels private. It is what runs below the level of phenomenology. The phenomenologically vivid voice in your head, the one this essay is mostly about, is the silent residue of conversations you have had and conversations you could have. That narrower claim is what the picture I am offering needs, and it is what survives the objection.[7]
So the inner voice is not nothing. It runs as a real, structured activity, with phenomenal presence. It also does not constitute a private chamber, does not consist of a separate language, and does not anchor the place where meaning gets started. It consists of public speech, well-rehearsed and turned inward — quiet enough that no one else hears, audible enough that you do. The room you thought was private always had the door open. You just never noticed the draft.
Footnotes
[1] The persistence of late-deafened speakers’ inner voice in their pre-deafness accent is reported anecdotally and in clinical literature on inner speech across hearing loss; for a careful treatment of inner-speech phenomenology more generally see Hurlburt, Heavey & Kelsey (2013), “Toward a Phenomenology of Inner Speaking,” Consciousness and Cognition 22(4): 1477–1494, whose Descriptive Experience Sampling reveals large individual differences in the frequency and texture of inner speaking. The picture I draw on these data — that inner speech inherits its phenomenal character from prior outer speech rather than the other way round — runs slightly ahead of Hurlburt et al.’s own conclusions and is the present author’s reading, consistent with their findings rather than directly argued by them.
[2] Philosophical Investigations §§243–315, especially §258. The thrust is not the often-misread “no one could secretly invent a private code,” but rather that the very notion of following a rule requires the possibility of public correction. Without that, there is no fact of the matter about whether the next application of a term agrees with the prior ones. Inner-voice meaning, if it floated free of any such corrective practice, would not be meaning at all. See McDowell (1996, Lecture VI) for an interpretation of Wittgenstein’s target that frames the private-language argument as part of a wider attack on the very idea of a self-sufficient inner standpoint.
[3] This phenomenological observation has been pressed hardest in the cognitive phenomenology literature, where authors like Strawson, Pitt, and Siewert argue that occurrent thought has a proprietary phenomenal character distinct from sensory imagery. The position I am defending here is compatible with that claim about what the phenomenology is like, while differing on its explanatory direction: the linguistic character of inner thought episodes is not evidence of a private language but of the public language doing its work silently.
[4] Sellars, “Empiricism and the Philosophy of Mind” (1956), §§48ff. In the Jonesean myth, the genius Jones develops a theory according to which overt utterances are the culmination of a process beginning with inner episodes — theoretical posits modelled on the antecedent practice of public discourse, not items discovered through inward inspection. The relevant moral for the present essay: thought-talk is conceptually downstream from speech-talk, even when its referents are silent.
[5] Crane, T. and K. Farkas (2022), “Mental Fact and Mental Fiction,” in T. Demeter, T. Parent and A. Toon (eds.), Mental Fictionalism: Philosophical Explorations (Routledge), p. 14 (ms.) — the quoted formulation is theirs, not Sellars’s; they gloss the Jonesean picture in this sentence. Crane and Farkas’s own positive thesis concerns standing mental states (beliefs, desires) as modelled via public ascription rather than directly inspected; the parallel I draw in the body extends this picture from standing states to occurrent inner speech episodes, and the extension belongs to the present essay, not to them.
[6] The point that fluent linguistic surface is consistent with the absence of grounded meaning is developed at length in Jung, K. (2025), “Augustine, AI, and the Two Models of Language,” Journal of Religious Ethics 53(2): 217–238 — particularly Jung’s deployment of Wittgenstein’s meaning is use to argue that large language models succeed at the rule-governed dimensions of language game-play while failing to instantiate the non-linguistic, world-engaged dimensions that make use the right kind of use. Cf. also semantic externalism more broadly: Putnam (1975), Burge (1979).
[7] The objection — that there is non-verbal thought — is sometimes pressed as if it refuted the broader anti-private-language line. It does not. The point about the private language argument is about the constitution of meaning, not the medium of cognition. Non-verbal animals and pre-linguistic infants can have intentional states whose contents are externalist in exactly the relevant sense: fixed by causal-historical relations to the environment, not by inner verbal labelling. The story I am telling about inner speech is a story specifically about the phenomenology of linguistic thinking — the inner voice as such — not a reduction of all cognition to inner monologue. For background on the relation between cognitive phenomenology and conceptual content, see the chapters collected in Bayne and Montague (eds.) (2011), Cognitive Phenomenology, OUP.
References
Bayne, T. and M. Montague (eds.) (2011). Cognitive Phenomenology. Oxford University Press.
Burge, T. (1979). “Individualism and the Mental.” Midwest Studies in Philosophy 4: 73–121.
Crane, T. and K. Farkas (2022). “Mental Fact and Mental Fiction.” In T. Demeter, T. Parent and A. Toon (eds.), Mental Fictionalism: Philosophical Explorations. Routledge.
Hurlburt, R. T., C. L. Heavey and J. M. Kelsey (2013). “Toward a Phenomenology of Inner Speaking.” Consciousness and Cognition 22(4): 1477–1494.
Jung, K. (2025). “Augustine, AI, and the Two Models of Language.” Journal of Religious Ethics 53(2): 217–238.
Mathiesen, K. (2005). “Collective Consciousness, Collective Intentionality, and Phenomenology.” In D. W. Smith and A. L. Thomasson (eds.), Phenomenology and Philosophy of Mind. Oxford University Press.
McDowell, J. (1996). Mind and World. Harvard University Press.
Putnam, H. (1975). “The Meaning of ‘Meaning.’” In Mind, Language and Reality: Philosophical Papers, Volume 2. Cambridge University Press.
Sellars, W. (1956). “Empiricism and the Philosophy of Mind.” In H. Feigl and M. Scriven (eds.), Minnesota Studies in the Philosophy of Science, Vol. 1. University of Minnesota Press.
Wittgenstein, L. (1953). Philosophical Investigations. Trans. G. E. M. Anscombe. Blackwell.