Mind, Matter, and Meaning

Tag: semantic externalism

The Origins of Inner Speech

MIND · MATTER · MEANING May 2026

The Origins of Inner Speech

The inner voice isn’t where thought begins — it’s speech turned inward.

An essay mindmatterandmeaning.com

Right now, as you read this, something happens that probably feels like the most private thing in the world. The words on the page get pulled into your head and they sound — silently — in something you would call your own voice. If I stop a sentence mid-thought, like this one — you finish it. The completion arrives in that same inner voice. You can shout in it without moving. You can argue with it. People who have lost their hearing late in life often report that the voice continues, sometimes even with the accent they used to have.^[1] Whatever else this thing is, it feels like the deepest interior we have. A private room with the door shut. The last place where the world doesn’t get in.

This essay is about why that picture is almost exactly upside down.

The bad picture goes like this. There is an outside world, full of public language. People talk to each other. There is also an inside, where each of us has an inner voice — a kind of personal narrator who comments on what we see, rehearses what we’ll say, and works problems out under the breath of the soul. The outer language is social, learned, full of conventions. The inner voice is mine — first-person, immediate, the one thing I have that no one else can hear. On this picture, public speech is the noisy externalization of an already-private inner monologue. The thoughts come first, in the head; the words come later, when the thoughts need company.

This picture comes naturally. It has also done more than almost any other to lock the Cartesian theater into modern philosophy of mind. Once you accept that the inner voice is the thing, you have already conceded that there exists an interior space, with its own contents, accessible only to its owner. The mug on the table starts to recede. The outside world becomes a stage that your real life merely watches.

Here is the alternative.

The inner voice is not a private soliloquy that we sometimes externalize. It runs the other direction. Public speech came first — historically in the species and developmentally in each child — and the inner voice consists of that public activity turned inward. What you experience as silent thinking is, very largely, silent speaking: the same activity, with the same meanings, drawn from the same shared language, but with the motor signals turned down so far that nobody else can hear it. The voice in your head has accents because the voice out of your mouth used to. It uses words you learned from other people because all the words you have, you learned from other people. There is no separate inner lexicon. There is only the one lexicon — public, shared, social — being used in two different modes.

Once you see this, several long-running puzzles relax their grip.

Take the question of meaning. If the inner voice were a private soliloquy in a private language, then the meanings of its words would have to be fixed somewhere inside the head, by the speaker’s own lights, with no public check. This is what Wittgenstein was attacking when he sketched the famous case of someone trying to give a private name to a private sensation: there is no way to tell whether the next use of the word follows the rule or breaks it, because there is no public criterion of correct use.^[2] The argument generalizes. Meaning never gets fixed by what goes on inside a single skull. It gets fixed in the social practices where words have uses people can correct, share, and inherit. The inner voice borrows those meanings from outside. It does not generate them.

That conclusion sounds counterintuitive only until you ask the obvious question: where would the inner voice get its meanings from, otherwise? It is not as though the language module in your head wakes up one morning with semantics pre-installed. You learned every word you have. You learned them from speakers around you, in contexts where their uses could be corrected. When that public competence later runs silently in your head, it does not shed its public character. It is still the same competence, drawing on the same word-uses, anchored in the same shared world.

This is also, incidentally, why the inner voice is not the private chamber the bad picture says it is. It is the public voice gone quiet. Your phenomenology bears this out: when you “speak to yourself” you are not having direct contact with raw meaning. You are running through words — words with accents, with grammar, with the cadence of speech.^[3] You can describe your inner monologue, transcribe it, slow it down, translate it. None of that would be possible if it were not made of the same stuff as the speech you exchange with other people.

A useful diagnostic question follows from this. When you find yourself reaching for the inner voice as evidence of some essentially private inner life, ask: would my best guess about what I am thinking really be wrong if I just said it out loud? Almost always, the answer is no. Saying it out loud is what the inner voice would have been, before we learned to turn the volume off.

This brings us to what we owe Sellars and a long tradition after him. Sellars’s Jonesean myth in “Empiricism and the Philosophy of Mind” posits inner episodes — what we call thoughts — as theoretically modelled on overt verbal utterances rather than directly observed.^[4] Tim Crane and Katalin Farkas, working in this lineage, gloss the picture vividly: thoughts are “inner episodes, called ‘thoughts,’ which are conceived on the model of overt verbal utterances, but happen silently in the head.”^[5] They treat the inner stream less as a private screening and more as something we model on the basis of our public commerce with each other. The order of explanation runs from public to private, not the other way. We learn what thinking is by learning to talk; then we learn to call the silent version of that activity “thinking.”

Two further consequences fall out.

The first concerns AI. If the inner voice is internalized public speech, then producing fluent inner-voice-like outputs — sentences that sound like thinking — is not, by itself, evidence of an inner life behind the sentences. A large language model can produce streams of fluent prose without any of the social, embodied history that gave human speech its meanings in the first place. The model has the surface of inner speech and none of its provenance.^[6] The temptation to see understanding behind its outputs comes precisely from the inverted picture I started with: we assume the words must be coming from an interior, because that’s where our own words seem to come from. They aren’t. Our own words come from a long history of public language that ours simply continues.

The second consequence concerns ourselves. If the inner voice is public speech gone silent, then the most private-feeling activity we have is, at its root, a social inheritance. You think with words you did not invent, in a language you did not design, using meanings calibrated by a community you mostly never met. The Cartesian sense that thinking is yours alone survives only because we forget where the equipment came from. Strip away the borrowed vocabulary and grammar and there would be very little left in the inner room. There would barely be an inner room.

Now an obvious objection. Surely, the objector says, there is more to thinking than silent speech. Mathematicians solve problems without verbalizing them. Musicians compose without inner narration. Animals without language clearly think. The claim that thinking just is internalized public speech overreaches.

Take this seriously, because it is mostly right. The claim worth defending is not that every act of cognition consists in inner monologue. Pre-linguistic infants think; non-human animals think; expert performance often runs faster than any inner narrator could keep up with. The claim is narrower and survives: the experience of thinking-in-words — the inner voice, the one that feels like the inmost private chamber — is best understood as internalized public speech. There is non-verbal cognition, certainly, but it is not what feels private. It is what runs below the level of phenomenology. The phenomenologically vivid voice in your head, the one this essay is mostly about, is the silent residue of conversations you have had and conversations you could have. That narrower claim is what the picture I am offering needs, and it is what survives the objection.^[7]

So the inner voice is not nothing. It runs as a real, structured activity, with phenomenal presence. It also does not constitute a private chamber, does not consist of a separate language, and does not anchor the place where meaning gets started. It consists of public speech, well-rehearsed and turned inward — quiet enough that no one else hears, audible enough that you do. The room you thought was private always had the door open. You just never noticed the draft.

Footnotes

[1] The persistence of late-deafened speakers’ inner voice in their pre-deafness accent is reported anecdotally and in clinical literature on inner speech across hearing loss; for a careful treatment of inner-speech phenomenology more generally see Hurlburt, Heavey & Kelsey (2013), “Toward a Phenomenology of Inner Speaking,” Consciousness and Cognition 22(4): 1477–1494, whose Descriptive Experience Sampling reveals large individual differences in the frequency and texture of inner speaking. The picture I draw on these data — that inner speech inherits its phenomenal character from prior outer speech rather than the other way round — runs slightly ahead of Hurlburt et al.’s own conclusions and is the present author’s reading, consistent with their findings rather than directly argued by them.

[2] Philosophical Investigations §§243–315, especially §258. The thrust is not the often-misread “no one could secretly invent a private code,” but rather that the very notion of following a rule requires the possibility of public correction. Without that, there is no fact of the matter about whether the next application of a term agrees with the prior ones. Inner-voice meaning, if it floated free of any such corrective practice, would not be meaning at all. See McDowell (1996, Lecture VI) for an interpretation of Wittgenstein’s target that frames the private-language argument as part of a wider attack on the very idea of a self-sufficient inner standpoint.

[3] This phenomenological observation has been pressed hardest in the cognitive phenomenology literature, where authors like Strawson, Pitt, and Siewert argue that occurrent thought has a proprietary phenomenal character distinct from sensory imagery. The position I am defending here is compatible with that claim about what the phenomenology is like, while differing on its explanatory direction: the linguistic character of inner thought episodes is not evidence of a private language but of the public language doing its work silently.

[4] Sellars, “Empiricism and the Philosophy of Mind” (1956), §§48ff. In the Jonesean myth, the genius Jones develops a theory according to which overt utterances are the culmination of a process beginning with inner episodes — theoretical posits modelled on the antecedent practice of public discourse, not items discovered through inward inspection. The relevant moral for the present essay: thought-talk is conceptually downstream from speech-talk, even when its referents are silent.

[5] Crane, T. and K. Farkas (2022), “Mental Fact and Mental Fiction,” in T. Demeter, T. Parent and A. Toon (eds.), Mental Fictionalism: Philosophical Explorations (Routledge), p. 14 (ms.) — the quoted formulation is theirs, not Sellars’s; they gloss the Jonesean picture in this sentence. Crane and Farkas’s own positive thesis concerns standing mental states (beliefs, desires) as modelled via public ascription rather than directly inspected; the parallel I draw in the body extends this picture from standing states to occurrent inner speech episodes, and the extension belongs to the present essay, not to them.

[6] The point that fluent linguistic surface is consistent with the absence of grounded meaning is developed at length in Jung, K. (2025), “Augustine, AI, and the Two Models of Language,” Journal of Religious Ethics 53(2): 217–238 — particularly Jung’s deployment of Wittgenstein’s meaning is use to argue that large language models succeed at the rule-governed dimensions of language game-play while failing to instantiate the non-linguistic, world-engaged dimensions that make use the right kind of use. Cf. also semantic externalism more broadly: Putnam (1975), Burge (1979).

[7] The objection — that there is non-verbal thought — is sometimes pressed as if it refuted the broader anti-private-language line. It does not. The point about the private language argument is about the constitution of meaning, not the medium of cognition. Non-verbal animals and pre-linguistic infants can have intentional states whose contents are externalist in exactly the relevant sense: fixed by causal-historical relations to the environment, not by inner verbal labelling. The story I am telling about inner speech is a story specifically about the phenomenology of linguistic thinking — the inner voice as such — not a reduction of all cognition to inner monologue. For background on the relation between cognitive phenomenology and conceptual content, see the chapters collected in Bayne and Montague (eds.) (2011), Cognitive Phenomenology, OUP.

References

Bayne, T. and M. Montague (eds.) (2011). Cognitive Phenomenology. Oxford University Press.

Burge, T. (1979). “Individualism and the Mental.” Midwest Studies in Philosophy 4: 73–121.

Crane, T. and K. Farkas (2022). “Mental Fact and Mental Fiction.” In T. Demeter, T. Parent and A. Toon (eds.), Mental Fictionalism: Philosophical Explorations. Routledge.

Hurlburt, R. T., C. L. Heavey and J. M. Kelsey (2013). “Toward a Phenomenology of Inner Speaking.” Consciousness and Cognition 22(4): 1477–1494.

Jung, K. (2025). “Augustine, AI, and the Two Models of Language.” Journal of Religious Ethics 53(2): 217–238.

Mathiesen, K. (2005). “Collective Consciousness, Collective Intentionality, and Phenomenology.” In D. W. Smith and A. L. Thomasson (eds.), Phenomenology and Philosophy of Mind. Oxford University Press.

McDowell, J. (1996). Mind and World. Harvard University Press.

Putnam, H. (1975). “The Meaning of ‘Meaning.’” In Mind, Language and Reality: Philosophical Papers, Volume 2. Cambridge University Press.

Sellars, W. (1956). “Empiricism and the Philosophy of Mind.” In H. Feigl and M. Scriven (eds.), Minnesota Studies in the Philosophy of Science, Vol. 1. University of Minnesota Press.

Wittgenstein, L. (1953). Philosophical Investigations. Trans. G. E. M. Anscombe. Blackwell.

May 25, 2026
What a Machine Would Have to Earn
MIND · MATTER · MEANING No. 29 · May 2026

What a Machine Would Have to Earn

Understanding is earned in a world, not performed on a screen.

An essay mindmatterandmeaning.com

A friend sent me a transcript last spring. He had asked a chatbot what a sunburn feels like the morning after — that specific tight, hot, can’t-find-a-way-to-lie-down misery — and the machine answered better than he could have. It named the flinch when a shirt seam drags across the shoulders. It knew the small betrayal of forgetting for a second and leaning back into a hot car seat. He found it uncanny, a little moving, and he wanted to know: does it understand what a sunburn is?

Good question, asked at the right moment. The honest answer takes a while to earn, so let me start with the answer most of us reach for first — because it’s reasonable, and because it’s wrong.

The reasonable view goes like this. Understanding shows up in what you can do. A student who can answer any question about the French Revolution, field the follow-ups, catch the trick ones, and explain the whole thing to a ten-year-old — that student understands the French Revolution, and we would be cranks to deny it on the grounds that we can’t peer inside her skull. Understanding is as understanding does. So if a machine handles every question about sunburns as well as a sunburned person could, the difference between the machine and the person starts to look like a difference we invented to feel special about ourselves. The picture has a respectable pedigree: it descends from behaviorism, and it has a famous instrument in Alan Turing’s imitation game, where the test for thinking just is indistinguishable performance.

Notice the quiet assumption, though. The picture takes understanding a word to be a matter of using it correctly, and takes “correctly” to be settled by looking only at the outputs. Pull on that thread and the whole thing comes apart in your hands.

Stevan Harnad, a cognitive scientist with a gift for naming traps, named this one in 1990: the symbol grounding problem.¹ Imagine trying to learn Chinese from a Chinese-only dictionary. Every definition sends you to other entries, which send you to others, and you ride that merry-go-round forever without once touching the ground. A system whose symbols are defined only by more symbols never means anything by them. Meaning gets in only when some of the symbols connect to the things they are about by some route other than further symbols — when “red” hooks to red, not merely to “crimson,” “scarlet,” and “the color of a stop sign.”

What supplies the hook is not anything inside the system. Hilary Putnam made the case unforgettable with a thought experiment about Twin Earth — a planet just like ours except that the stuff they call “water” there is some other compound with all of water’s surface features.² A person here and their molecular duplicate there can be internally identical, down to the atom, and still mean different things by “water,” because the word answers to the stuff in the world, not to the state of the head. “Meanings,” Putnam wrote, “just ain’t in the head.” Tyler Burge pushed the same point from the social side: what your word “arthritis” picks out depends on the practice of the community you defer to, not on a private definition you carry around.³ Content lives in a relation — between a system, a world, and the company it keeps.

There is even a natural story about how the relation gets built. On teleosemantic accounts — Ruth Millikan’s and Fred Dretske’s, chiefly — a state comes to be about something by acquiring the function of tracking it, the way a frog’s strike comes to be about flies through a long history in which catching flies is what kept frogs going.⁴ The clinching detail is misrepresentation: to get something wrong, a system has to have been in the business of getting it right. A state can mean fly and fire at a passing pellet only because its job, fixed by history, was flies. No history, no job; no job, nothing to be mistaken about; nothing to be mistaken about, no content.

So understanding a word turns out to be an achievement, not a knack: it consists in having states that are genuinely about the world — not states that merely accompany the right answers, but states directed at the very things the words name — and aboutness is something a system earns over time. Your “red” means red because red things have been pushing on you, through eyes and skin and the small stakes of an actual life, since before you could pronounce the word. This is what people are gesturing at, usually too vaguely, when they say minds are embodied. The word invites mysticism, so let me drain it of any. Embodiment names three sober requirements: the system takes in the world through senses and acts back on it; its inner states have been shaped by real traffic with the features they represent; and those states are there to track a world the system inhabits, not merely to emit the right strings. Michael Tye — who spent three decades building the most careful theory we have of how experience could be nothing more than representational content, and then argued that even his own theory needs history — makes the sharpest version of the point. Two creatures could be intrinsically identical at an instant, he argues, and still differ in what they experience, because one has a past of tracking the world and the other was assembled, atom for atom, five minutes ago.⁵ History is not decoration on content. It is part of what fixes it.

Which lets me say, at last, what a machine would actually need. Not the right stuff — I don’t think the barrier is silicon, and here I part company with John Searle, who ties understanding to the specific causal powers of biological brains.⁶ The barrier isn’t carbon; it’s a world. A system understands when its inner states have been shaped by, and stay answerable to, the things they represent — when it senses and acts, lives under stakes, and can get things wrong and pay for it. Build that, and the door to genuine artificial understanding stands open. I mean open, not slyly closed. The claim here is not the tired one that machines could never understand. It is that understanding is earned through engagement, and there is no coupon for skipping the engagement.

Skipping the engagement is precisely what today’s text-only language models do. A large model learns the statistics of how we talk — the staggeringly intricate shape of which words follow which — from a corpus of descriptions of the world, never from the world.⁷ It has read everything ever written about sunburns and has never once had skin. Its “red” is a position in an immense map of words, anchored to other words, anchored to nothing outside the map. The fluency is real and the achievement is genuine; it is simply not the achievement of understanding.

Here the strongest objection arrives, and it deserves a real hearing rather than a brush-off. If the machine’s answers became indistinguishable — in principle, not merely in today’s practice — from an understander’s, then insisting it still lacks understanding looks like clinging to a ghost. A difference that makes no detectable difference, the objection runs, is no difference at all. That is the whole moral of the imitation game, and it is not a silly one.

But “makes no difference you can detect in the output” is the definition of a good simulation, not the absence of a difference. Simulate a hurricane to any precision you please: the equations are flawless and your desk stays bone dry. Modeling a process is not running it.⁸ Two systems can produce the very same words while one means them and the other reports the statistics of how the word gets used — because meaning was never a property of the output. It lives in the history behind the output, and that history is exactly what an output test cannot see. The objection mistakes the instrument for the quarry. It notices that the meter reads the same and concludes there is nothing the meter is missing.

So: does the machine understand what a sunburn is? It has never had skin. It has never flinched, never dreaded an evening because of how the sheets would feel. It holds the words and not the world the words are about. Ask the question again in some later decade, of some later system that has spent years bumping into things and paying for its errors, and the answer could come back different — that is the part the doom-mongers and the hype-merchants both manage to miss. Understanding is not a performance a system delivers. It is a debt a system pays, to the world, in the one currency the world accepts: contact. Until the bill comes due, fluency is only fluency. It was always going to be the easy part.

References

Burge, Tyler. 1979. “Individualism and the Mental.” Midwest Studies in Philosophy 4: 73–121.

Dretske, Fred. 1988. Explaining Behavior: Reasons in a World of Causes. Cambridge, MA: MIT Press.

Harnad, Stevan. 1990. “The Symbol Grounding Problem.” Physica D 42: 335–346.

Harnad, Stevan. 2002. “Symbol Grounding and the Origin of Language.” In Computationalism: New Directions, edited by Matthias Scheutz. Cambridge, MA: MIT Press.

Havlík, Vladimír. 2025. “Meaning and Understanding in Large Language Models.” Synthese 205: 9.

Millikan, Ruth Garrett. 1989. “Biosemantics.” Journal of Philosophy 86 (6): 281–297.

Putnam, Hilary. 1975. “The Meaning of ‘Meaning.’” In Mind, Language and Reality: Philosophical Papers, Volume 2, 215–271. Cambridge: Cambridge University Press.

Searle, John R. 1980. “Minds, Brains, and Programs.” Behavioral and Brain Sciences 3 (3): 417–457.

Searle, John R. 1990. “Is the Brain a Digital Computer?” Proceedings and Addresses of the American Philosophical Association 64 (3): 21–37.

Tye, Michael. 2019. “Homunculi Heads and Silicon Chips: The Importance of History to Phenomenology.” In Blockheads! Essays on Ned Block’s Philosophy of Mind and Consciousness, edited by Adam Pautz and Daniel Stoljar. Cambridge, MA: MIT Press.

Notes
1. Harnad (1990) coined “the symbol grounding problem” and framed it with the Chinese-dictionary regress; he later tied it to the origin of language (Harnad 2002). The problem is older than the label — it is the computational heir of the externalist worry about how any representation latches onto its object — but Harnad’s formulation is the one the AI literature inherited, and it is sharper than the Chinese Room for present purposes because it isolates grounding from Searle’s further claims about consciousness. ↩
2. Putnam (1975). The conclusion is specifically about reference and extension: the content that fixes what “water” is true of does not supervene on the speaker’s intrinsic states. Note that Putnam later qualified his own semantic externalism in several directions; nothing here turns on the most contested versions of the thesis, only on the minimal claim that reference depends on causal-environmental relations the head alone does not settle. ↩
3. Burge (1979) extends externalism from natural-kind reference (Putnam) to social content: holding a thinker’s physical history fixed while varying the surrounding linguistic community varies which concept the thinker exercises. The two cases are independent routes to the same structural conclusion — internal organization underdetermines content — which is why the essay leans on both rather than treating Burge as a footnote to Putnam. ↩
4. The teleosemantic tradition, principally Millikan (1989) and Dretske (1988), grounds content in proper function: a state represents what it has the function of tracking, where functions are fixed by selection or learning history. Misrepresentation is the standard adequacy test for any naturalistic theory of content, since a theory on which states cannot be false has not yet described representation. Rival tracking theories handle reliable misrepresentation differently, but the historical structure — content fixed by what a state was for — is common ground and is what the embodiment argument borrows. ↩
5. Tye (2019). The thesis is that two beings intrinsically alike at a time can differ in phenomenal character because they differ in history — a representationalist’s concession that current intrinsic structure does not suffice. Ned Block replies in the same volume (“Fading Qualia: A Response to Michael Tye”) that a subject could be radically wrong about their own phenomenology; the disagreement is real and unresolved, and the essay sides with Tye while granting that Block has located the genuine pressure point. That Tye, of all people, reaches for history is the relevant fact: the most developed representationalism on offer does not think structure alone fixes content. ↩
6. Searle (1990) argues that computation is observer-relative — a physical system “computes” only under an interpretation we assign — so computational description cannot, by itself, explain intrinsic intentionality. The essay takes this negative point and leaves Searle’s positive doctrine behind. Searle’s biological naturalism holds that only the specific causal powers of brains can produce understanding; the view defended here replaces “the right biology” with “the right causal-environmental engagement,” which a non-biological system could in principle possess. The negative argument survives the amputation of the positive one. ↩
7. Not everyone takes the contact gap to be fatal, and the most direct contrary voice deserves naming. Vladimír Havlík (2025) argues the reverse of this essay’s conclusion — that large language models do ground the meanings of their expressions, by way of what he calls semantic fragmentism, so that grounding in worldly reference is not a precondition of understanding. I think this mislocates the gap rather than closing it. Semantic fragmentism can explain how a model’s tokens come to bear stable relations to one another; the externalist and teleosemantic considerations above concern what fixes the relation between a token and the world, which is precisely what a text-only training signal never touches. The architectural premise is not what divides us — a text-only model is trained to predict the next token over a corpus of text, full stop — what divides us is whether that suffices for content, and Havlík’s affirmative answer is the live position this essay rejects. ↩
8. The simulation/realization distinction is Searle’s reply to the Brain Simulator objection in “Minds, Brains, and Programs” (1980), generalized: a model of a process is not an instance of it, and whatever a process owes to its physical realization is not delivered by a description of that realization, however exact. The hurricane example makes the point without the contested premises about consciousness — no one is tempted to say the simulated storm is wet — which is why it does cleaner work here than the Chinese Room. ↩
May 25, 2026
Multimodality and the Symbol-Grounding Problem
MIND · MATTER · MEANING No. 31 · May 2026

Multimodality and the Symbol-Grounding Problem

Adding eyes to a language model gives it more pictures, not a world.

An essay mindmatterandmeaning.com

Hold a bruised avocado up to the newest chatbot and it will tell you, with a confidence you have never once earned at a produce counter, that the fruit has about a day left and you should make the guacamole tonight. It can see the avocado. That is the pitch, anyway, and it lands. After years of watching these systems shuffle words around — predicting the next token the way a very well-read parrot predicts the next syllable — here at last is one that looks at your kitchen and answers.

The demos impress, and the feeling they produce is specific: the machine has finally made contact. The symbols have touched down. Whatever was missing in the text-only models — the thing that made us suspect the parrot didn’t know what it was saying¹ — surely closes the moment you give the thing eyes.

Here is the story almost everyone now tells, and I told a version of it myself for longer than I’d like to admit. The old language models lived sealed in a room of words. “Apple” meant nothing to them beyond its statistical company — the other words it tends to travel with. No wonder they made things up; they had never met an apple. But bolt on a camera and a microphone, and “apple” stops being a token rubbing shoulders with other tokens and becomes the round red thing on the counter. Multimodality, on this telling, just is grounding. It is the rope that finally ties the words to the world.

It is a natural thought, and something in its neighborhood is even correct. But the conclusion doesn’t follow, and seeing why it doesn’t pays better than any demo.

Start with what a multimodal model actually eats. It does not eat avocados. It eats images of avocados — arrays of numbers, paired during training with text that humans wrote about them. A photograph has not smuggled a piece of the world into the machine. A photograph is a representation: a flat, frozen, human-made encoding, every bit as much a symbol as the word “avocado,” only written in a richer alphabet. Feed a model a billion captioned pictures and you have fed it a billion more descriptions of the world. You have handed it more symbols, in a new code. You have not handed it more world.

This is the trap Stevan Harnad named in 1990, and Harnad — a cognitive scientist who has spent the better part of his career worrying about how a symbol ever comes to be about anything — gave it a form worth keeping.² Imagine trying to learn Chinese from a Chinese-Chinese dictionary. Every word gets defined in terms of other words, which lead to still other words, around and around, and you never once step outside the circle of symbols to the things they name. No amount of definition conjures meaning out of more symbols; the chain has to touch ground somewhere. Somewhere a symbol has to connect to the thing — not to another symbol — through the system’s own capacity to pick that thing out, sort it, act on it.

Harnad had a sharp way of pricing this. Language, he wrote, lets us “steal” categories quickly and cheaply, through hearsay — I can tell you what a zebra is and spare you the safari. But theft works only because somebody, somewhere, earned the category the hard way, through what he called sensorimotor “toil”: the trial and error of dealing with actual zebras, guided by the cost of getting it wrong. It cannot be theft all the way down.³

And theft all the way down is exactly what multimodality quietly proposes. It tries to buy grounding with a bigger pile of borrowed representations. But a photograph of a zebra is more hearsay, not the safari. The richer alphabet is still an alphabet, and an alphabet, however many characters you add to it, is the kind of thing that needs grounding — never the kind of thing that supplies it.

There’s a deeper reason the input’s richness can’t do the job, and it arrives from the least mystical corner of philosophy. Hilary Putnam — who revised his own positions so often, and so cheerfully, that the restlessness became part of his reputation — argued in 1975 that meanings “just ain’t in the head.”⁴ What a thought is about depends on how the thinker stands to the world, not only on what is happening inside. Two systems can be alike down to the last detail and still mean different things, because they have different histories of contact with different surroundings. Michael Tye, who built one of the most careful versions of the view that an experience just is a way of representing the world, pressed the same point about minds: what a state represents depends partly on the causal history through which the system came to have it.⁵ A system that has tracked ripeness — reached for fruit, been right, been wrong, paid the difference in a bad lunch — has states that are about ripeness. A system assembled from a frozen archive of ripe-labeled photographs has states that are about how humans tended to label photographs. Which is not nothing. It is just not ripeness.

So here is the distinction the grounding story walks straight past. Multimodality adds modalities of representation — more kinds of symbol the system can take in. It does not add modalities of engagement — sensors wired to actuators in a world the system inhabits, a history of tracking real features, and some stake in getting it right.⁶ The first is a matter of feeding the model new file formats. The second is a matter of putting the model on the line. They are not the same project, and no quantity of the first sums to the second. The avocado demo feels like seeing. But seeing is something a creature does in a world it can be wrong about and suffer for being wrong about. What the model does is map an array of numbers onto a likely sentence.⁷ It has never been hungry. It has never been fooled. It has never cut into one and found mush.

The strongest reply grants most of this and turns it around. Fine, the objector says — you’ve already admitted an embodied system could mean things. And multimodal models are precisely the perception stack going into embodied systems: the same vision encoders that caption your avocado get bolted onto robots that pick things up. So you’re knocking down a strawman. Nobody serious claims a static image model is grounded; the claim is that multimodality is step one toward a system that is. The trajectory is the point.

This objection is right about nearly everything, and I want to be careful, because where it’s right is exactly what matters. Yes — a robot that acts in a world, tracks what it touches, and pays for its mistakes could come to mean something by “avocado.” I have no objection in principle; the door stands open. But notice what does the work in that story. The grounding gets accomplished by the acting-in-a-world — the closed loop, the tracking, the stakes — and not by the number of input channels feeding the network. A simple creature with one sense and a body on the line stands nearer to meaning than a thousand-modality oracle trained on a frozen scrape of the internet. So the honest version of the trajectory claim is not “multimodality grounds language.” It is “embodiment might, and multimodality is some of the plumbing.” Those two sentences advertise very different products. The first hands you grounding you have not paid for. The second admits the bill is still outstanding.

The avocado on your counter is ripe or it isn’t, and you settle the question the only way anyone ever has: you cut it open — a small risky act in a world that pushes back and now and then embarrasses you. The model has never once been embarrassed, because it has never been anywhere it could be wrong. Giving it a camera changed what it can be shown. It did not change what it can be answerable to — and answerability to the world, not access to more pictures of it, was the whole of what we were missing. We did not open the model’s eyes. We widened the window of the room it was always in, and hung a sharper picture in the glass.

References

Burge, Tyler. 1979. “Individualism and the Mental.” Midwest Studies in Philosophy 4: 73–121.

Dretske, Fred. 1988. Explaining Behavior: Reasons in a World of Causes. Cambridge, MA: MIT Press.

Dretske, Fred. 1995. Naturalizing the Mind. Cambridge, MA: MIT Press.

Harnad, Stevan. 1990. “The Symbol Grounding Problem.” Physica D 42: 335–346.

Harnad, Stevan. 2002. “Symbol Grounding and the Origin of Language.” In Computationalism: New Directions, edited by Matthias Scheutz, 143–158. Cambridge, MA: MIT Press.

Havlík, Vladimír. 2024. “Meaning and Understanding in Large Language Models.” Synthese 204: 71.

Putnam, Hilary. 1975. “The Meaning of ‘Meaning.’” Minnesota Studies in the Philosophy of Science 7: 131–193.

Searle, John R. 1980. “Minds, Brains, and Programs.” Behavioral and Brain Sciences 3 (3): 417–457.

Tye, Michael. 2019. “Homunculi Heads and Silicon Chips: The Importance of History to Phenomenology.” In Blockheads! Essays on Ned Block’s Philosophy of Mind and Consciousness, edited by Adam Pautz and Daniel Stoljar. Cambridge, MA: MIT Press.

Notes
1. The suspicion is not universal, and honesty requires flagging the dissent. Vladimír Havlík argues that Searle’s assumption of an unbridgeable gap between syntax and semantics is unjustified, and that meaning of a kind can emerge from the distributional and inferential structure a large model internalizes (Havlík 2024). I take the disagreement seriously but read it as a quarrel over what “meaning” must answer to. If content is individuated by world-involving causal relations (see notes 4–6), then distributional structure recovers how a linguistic community uses a term without recovering what anchors the term to the world. On that reading the parrot worry is relocated, not dissolved — which is why this essay presses on grounding rather than on usage. ↩
2. Harnad, “The Symbol Grounding Problem” (1990), poses the problem through the image of trying to learn a first language from a Chinese-Chinese dictionary: an endless circuit of symbol-to-symbol definition that never reaches the world. The claim is not that symbols can never refer, but that reference cannot be conferred by further symbols alone — the regress must terminate in a non-symbolic capacity to identify a category’s members. Note that Harnad’s diagnosis is considerably friendlier to connectionism than Searle’s: the grounding he demands is sensorimotor categorization, a task he takes neural networks to be well suited to learn, given the right embodiment. The argument here is therefore not anti-connectionist; it is anti–disembodied-connectionist. ↩
3. Harnad, “Symbol Grounding and the Origin of Language” (2002): “What language allows us to do is to ‘steal’ categories quickly and effortlessly through hearsay instead of having to earn them the hard way, through risky and time-consuming sensorimotor ‘toil.’” The theft/toil contrast is his. The application is mine: a model trained exclusively on representations attempts the theft with no underwriting toil anywhere in its causal history — not its own, and not, in any content-fixing way, the photographers’. The captioned-image corpus is a vast ledger of other people’s earnings that the model never made. ↩
4. Putnam, “The Meaning of ‘Meaning’” (1975). Twin Earth fixes the individuation of content by external relations: my molecular twin and I, internally identical, mean different substances by “water” because our environments differ (H₂O here, the look-alike “XYZ” there). Burge (“Individualism and the Mental,” 1979) extends the externalism to the social environment. I lean only on the modest thesis — that internal richness underdetermines content — and not on any stronger claim about whether phenomenal character itself is wide. The modest thesis is enough to sink “more pixels equals more meaning.” ↩
5. Tye, “Homunculi Heads and Silicon Chips: The Importance of History to Phenomenology” (2019). Tye accepts Block’s verdict that a “China-body system” duplicating our functional organization at a moment would have no experiences, but argues the reason is historical rather than organizational: the system lacks the causal history through which its states would come to track — and therefore represent — worldly features. Because Tye holds that phenomenal character just is representational content of the right kind, a historical condition on content becomes a condition on experience. (The library’s copy carries a “2011” preprint stamp; the published version appears in the Pautz and Stoljar Blockheads! volume, MIT Press 2019.) For the record, Tye announced a move toward panpsychism in 2024; nothing here depends on that later turn — the historical thesis stands on its own. ↩
6. This is the teleosemantic ingredient, and it is doing quiet but essential work. On Dretske’s account (Explaining Behavior, 1988; Naturalizing the Mind, 1995), a state represents what it has the function of indicating, and functions are acquired through a learning or selectional history in which getting it right and getting it wrong carried consequences. “Stakes” is shorthand for that history: a system for which misrepresentation costs nothing is, on this view, not yet in the business of representation at all. A frozen training corpus supplies correlations in abundance but no such history — which is why scaling the corpus, in any modality, changes the quantity of correlation without manufacturing the one thing teleosemantics says content requires. ↩
7. I bring in Searle’s syntax/semantics argument (“Minds, Brains, and Programs,” 1980) only here, and deliberately not at the front: the educated reader has largely filed the Chinese Room under “answered,” by way of the Systems and Robot replies. But notice that the Robot Reply — the proposal that grounding the symbols in sensors and effectors would supply understanding — concedes precisely this essay’s point. It locates the missing ingredient in embodiment, not in more or richer symbols. Searle himself resists even that, on the ground that bolting transducers onto the room changes nothing happening inside it; whether he is right about that further step is a dispute this essay can leave open, because its target — the claim that multimodal input alone grounds meaning — is one the Robot Reply and Searle both reject. ↩
May 25, 2026
Twin Earth and Semantic Externalism
MIND · MATTER · MEANING May 2026

Twin Earth and Semantic Externalism

Meaning isn’t in the head. The word reaches past the skull to the world.

An essay mindmatterandmeaning.com

Stand in the kitchen and point at the kettle and say water. Nothing about that performance feels mysterious. You meant water; the kettle holds water; the word landed where you sent it. The whole transaction belongs to ordinary life — the kind of thing a competent five-year-old manages a hundred times a day without philosophical assistance.

Now suppose, while you weren’t looking, the kettle had been swapped for one filled with a clear, tasteless liquid whose molecular structure happens to differ from water in every respect that matters at the bench. You can’t tell. The five-year-old can’t tell. The kettle whistles. You pour. You drink. Did you mean water when you said the word, even though no water was anywhere in the room?

This is the question Hilary Putnam asked in 1975, and the answer he gave reshaped the philosophy of mind. The short version: no. You meant water, in the way you ordinarily do, only because real water exists out in the world and your linguistic community has been pointing at it for generations. The pointing happens partly outside your skull. The meaning, accordingly, lives partly outside it too. As Putnam put it, in the line that has been quoted ever since: “meanings just ain’t in the head” (Putnam, 1975).¹

The slogan has the feel of something half-rhetorical. It isn’t. What follows explains why it turns out to be true, what bad picture it replaces, and what difference it makes — including for one of the more aggressive claims you’ll hear at the confident end of contemporary AI discourse.

The picture we usually carry

The bad picture is so familiar it barely registers. When you think of meaning, you probably think of something happening inside a head. A word floats up; an image, or a definition, or a feeling of recognition attaches itself to it; out the word goes, freighted with its little inner cargo. Whatever it is that makes water mean water, the picture says, is some inner state of yours — a concept, a representation, a private mental something — that the word is hooked up to.

The picture has a long pedigree. Descartes built half his metaphysics around it. The British empiricists stocked the mind with ideas the way one stocks a pantry, and spent a great deal of energy reassuring themselves that the pantry tracked the outside world accurately. The cognitive scientists of the 1960s gave the pantry a computational paint job and called the contents internal representations. Same picture throughout: meaning sits inside the head, the head has a private inventory, the word inherits its meaning from the inner item it labels.

What Putnam’s 1975 paper does, with one of the cleanest thought experiments in philosophy, is show that the picture cannot be right.

Twin Earth, and why it bites

Imagine a planet exactly like ours, down to the molecule, except that in every place our world has water, Twin Earth has a different stuff. Putnam called it XYZ. It looks, feels, tastes, and behaves indistinguishably from H₂O at the rough scale of human life. People on Twin Earth wash with it, drink it, complain about its hardness; they call it water. The chemistry beneath the surface differs, but no Twin Earther in 1750 has any way of detecting that difference.

Now consider Oscar on Earth and his molecule-for-molecule duplicate Twin Oscar on Twin Earth, both in 1750, before chemistry exists. Stand them side by side, look inside their skulls, take inventory of every inner item the bad picture would care about: the same brain states, the same images, the same feelings, the same dispositions, the same everything. By the bad picture’s lights, when each says water, each means the same thing.

But each does not mean the same thing. When Oscar says water, his word reaches for H₂O and lands on it, because that is the stuff the linguistic practice he inherited has been about. When Twin Oscar says water, his word reaches for XYZ. They cannot mean the same thing, because their words have different references — different things in the world that they pick out, different conditions under which what they say is true. The meaning differs even though everything inside the head is identical (Putnam, 1975).²

Notice what the thought experiment is not claiming. Oscar and Twin Oscar do not have different inner lives; they have qualitatively identical ones, by stipulation. The point is that those inner lives, however rich, do not by themselves fix what their words are about. Something else does — namely, the actual stuff the community’s linguistic practice has been latching onto across time.

This is what philosophers call semantic externalism — the view that the meaning of a word, and the content of a thought, depends constitutively on factors outside the speaker or thinker. Outside the skull, outside the inner inventory, outside whatever the bad picture wanted to keep tucked away in private mental space.

Why the result generalizes

The water case is the showpiece, but Putnam’s argument doesn’t depend on natural kinds with hidden chemistry. Tyler Burge spent a career defending a more sweeping version — the view he calls anti-individualism — arguing that the same lesson runs through perception, concept possession, and the categories that structure ordinary cognition (Burge, 2010). The reason is structural. A representation succeeds or fails at hitting its target, and what counts as the target gets settled by relations the representation bears to a world. Burge’s signature example: a patient tells his doctor he has arthritis in his thigh. He is simply wrong — arthritis by definition is a disease of the joints — and crucially he is wrong about what his own word means, not because his inner state is defective but because his community’s medical practice has settled the term against him. What a word reaches for is fixed by the practice the word participates in, not by what the speaker pictures when uttering it.³

Alex Byrne and Michael Tye have argued that on the strongest version of representationalism, even the felt character of experience — the qualia earlier philosophy treated as the last private redoubt — depends on the world the experience represents (Byrne & Tye, 2006). If they are right, even the most intimate-seeming features of mental life have an outside leg — a claim Ned Block has pressed hard against, and one the externalist has to earn rather than assume.⁴

The lesson, told plainly: minds reach into a world to do their work. A mental state has the content it has partly in virtue of what, out there, it latches onto. That latching runs through causal, historical, social, and environmental relations all at once. The inside contributes half the mechanism. The outside contributes the other half.

What the LLM defender wants to say

Once you see why meanings can’t be in the head, an objection arrives almost immediately, and these days it usually concerns language models. A defender of the strong-AI line will say something like this: very well, meaning is not in any individual head — but it doesn’t need to be. Modern large language models are trained on the entire textual output of a civilization. The hookings, the practices, the patterns of use, are all there, distributed across the corpus. Whatever fixes meaning for human speakers should fix it for a model that has internalized the practice at scale. The model’s words reach into the same world ours do, through the same network of usage. Why call this anything less than understanding?

Vladimír Havlík defends a sophisticated version of this view. He argues that the meanings of linguistic expressions in LLMs are grounded — in his words — “neither in the world, nor in an internal idea of the world,” but within the linguistic corpus as a whole, and that this turns out to be sufficient for what he calls referential grounding (Havlík, 2024). The picture deserves a fair hearing. If meaning lives in patterns of public use, and a model has absorbed those patterns at civilization scale, then the model — the argument runs — has whatever it takes.⁵

I think the argument fails. And where it fails reveals what Twin Earth really showed.

What Twin Earth really showed

Putnam’s thought experiment said something stronger than meaning lies outside the speaker. It said that meaning depends on the world the practice latches onto. Oscar and Twin Oscar both participate in fluent verbal practice; both communities use water the same way; the difference is only what their respective practices are anchored to. The anchoring fixes which stuff the word reaches for. Talk that is not anchored does not reach.

A language model has the corpus. It does not have the anchoring. It has the residue of the anchoring, frozen in token statistics, with no living relation to the stuff the tokens came from. When the model produces water, no path runs from the word back to any water — not in training, not in deployment, not even, in any straightforward sense, in the data. The data records human anchoring in compressed form; the model inherits a derivative shadow of that anchoring; the shadow does informative work, sometimes spectacularly so — but a shadow of an anchor does not anchor anything. John Searle, the Berkeley philosopher whose Chinese Room argument we will meet again, made an adjacent point four decades ago — syntax, however elaborate, does not constitute semantics (Searle, 1980) — and the externalist diagnosis converges with his from the other side: both isolate the same missing thing, a relation between symbols and the world they purport to describe.⁶ The model does not lack complexity. It lacks that relation.⁷

This isn’t a denial of the model’s achievement, which is real and genuinely impressive in ways I don’t want to minimize. Fluent next-token prediction over a record of meaning counts for something — but it does not count as the same accomplishment as meaning. Meaning is what the record records. The record itself, separated from the activity it records, has no pointing power of its own. Twin Earth tells us so: without the right anchoring, even an inner life qualitatively identical to ours fails to mean what we mean. A model without any anchoring at all does no better.

Where this leaves the reader

The point isn’t to settle the AI question in one essay — that takes a longer argument — but to remove a misleading picture that gets in the way of seeing the question clearly. The picture says meaning is a stuff inside heads, and minds reach into the world by carrying that stuff outward. The picture is wrong. Meaning is what minds do when they reach — a relation, not an inner cargo. The kettle whistles, the word lands, the kettle holds water, and the linguistic community has been latching onto water for a long time. That whole arrangement is what makes your word work. None of it lives between your ears alone.

The mind doesn’t make meaning by storing it. It makes meaning by reaching — and a reach with nothing at the far end is not a reach. It’s a gesture.

Notes
1. Putnam’s argument runs through “The Meaning of ‘Meaning’” (in K. Gunderson, ed., Minnesota Studies in the Philosophy of Science, vol. 7 [Minneapolis: University of Minnesota Press, 1975], 131–193), with the slogan at p. 144. The slogan is often quoted as if it were the conclusion; in Putnam’s text it sits midway through the development, after the Twin Earth case has done its work and before the apparatus of stereotype, normal form description, and the division of linguistic labor is introduced. The full position is more structural than the slogan suggests: meaning, on Putnam’s account, supervenes on a four-element vector (syntactic markers, semantic markers, stereotype, extension), and only the first three live “in the head.” The extension — the actual stuff the word picks out — lies outside, and the extension is constitutive of meaning. The division of linguistic labor does additional load-bearing work the slogan hides: a lay speaker can mean gold without being able to tell gold from pyrite, because the community houses experts whose discriminations the lay speaker defers to. Reference is thus a collective achievement distributed across a community and its history, not a private hookup renewed in each head — a point that matters directly when the question turns to a system that has the corpus but stands in no deferential relation to any expert in it. ↩
2. The Twin Earth argument depends on two further commitments Putnam developed in parallel with Saul Kripke’s work in Naming and Necessity (Cambridge, MA: Harvard University Press, 1980; the lectures were delivered in 1970). Natural-kind terms like water are rigid designators: they pick out the same kind in every possible world in which that kind exists. And the identity water = H₂O is a necessary truth discovered a posteriori: not derivable from the concept of water alone, but, once established, holding of metaphysical necessity. These two commitments together explain why Oscar’s water and Twin Oscar’s water cannot have the same reference even when their inner states are identical. It bears emphasis against a standard misreading: the indexicality of water (Putnam’s “this liquid, the same liquid as that“) does not relocate the difference back inside the head as a difference in narrow content. The demonstrative reaches its referent only through a causal-historical relation to a sample, and it is that relation — not any inner accompaniment of the demonstrating — that differs between the twins. ↩
3. Burge’s case is developed first in “Individualism and the Mental,” Midwest Studies in Philosophy 4 (1979): 73–121, and expanded across decades into the systematic anti-individualism of Origins of Objectivity (Oxford: Oxford University Press, 2010), esp. chaps. 2–3. Burge’s claim is stronger than Putnam’s along two axes. First, it needs no hidden microstructure: the arthritis case turns on a purely social fact — that “arthritis” is, in the speaker’s community, a disease of the joints — so the externalist conclusion extends to artifact and institutional terms (sofa, contract) that have no chemical essence at all. Second, and more carefully than the main text’s compression allows: Burge’s point is not that the patient is ignorant of a dictionary entry, but that the content of his belief is fixed by his community’s practice despite his incomplete grasp of the term. The patient genuinely believes that his arthritis has spread to his thigh — a false belief about arthritis, deferring to a practice that determines what “arthritis” picks out — rather than a true belief about some idiosyncratic private concept. Strip away the community and the very identity of the concept he is deploying goes indeterminate. Content, that is, is constitutively dependent on relations the thinker bears to a wider linguistic and physical environment, not merely causally downstream of them. ↩
4. Byrne and Tye, “Qualia Ain’t in the Head,” Noûs 40, no. 2 (2006): 241–255. The argument runs as an externalist extension of Tye’s strong representationalism (Ten Problems of Consciousness [Cambridge, MA: MIT Press, 1995], esp. chaps. 4–5): if phenomenal character is identical to representational content of the right kind, and if representational content is itself externally determined (per Putnam, Burge, and the wider tradition), then phenomenal character cannot be wholly internal to the perceiving system. The strongest objection is Ned Block’s “mental paint” line (Block, “Mental Paint,” in M. Hahn and B. Ramberg, eds., Reflections and Replies: Essays on the Philosophy of Tyler Burge [Cambridge, MA: MIT Press, 2003], 165–200): there are, Block argues, intrinsic phenomenal features — the “paint” on the inner canvas — that vary independently of any represented worldly property, as in cases of phenomenal inversion or in afterimages, so that two experiences could represent the same scene yet differ in felt character. Met at full strength, the objection is answered, not conceded, by holding the line on the identity claim: the cases Block adduces are redescribed as differences in what is represented (a difference in represented hue, an afterimage represented as a colored region of the visual field) rather than as residue left over once representational content is fixed. “Mental paint” names exactly the inner cargo the externalist denies; to grant it as an independent variable would be to smuggle the bad picture back in under a new label. The present essay endorses the strong representationalist line and takes the burden Block identifies — to redescribe every putative case of paint without remainder — to be one the view can carry. ↩
5. Havlík, “Meaning and Understanding in Large Language Models,” Synthese 204, article 71 (2024). Havlík distinguishes three candidate locations for the grounding of LLM meanings — the world, an internal world-model, and the linguistic corpus itself — and argues that the first two cannot be required of an LLM without begging questions about what counts as grounding; his positive proposal is that meaning in LLMs is grounded intra-linguistically, within the corpus, so that referential success becomes a property of distributional structure rather than of any speaker-world relation. The objection pressed in the main text grants Havlík his negative point — referential grounding is indeed not the only way to fix meaning for a symbolic system — while denying the positive one: intra-linguistic structure can do real semantic work (disambiguation, inference, paraphrase) precisely because it inherits the compressed trace of relations the original speakers bore to the world, but it cannot do the constitutive work the externalist tradition has identified, because the relation that did that work was severed at training time and the trace is not the relation. Compare the converging diagnosis of Emily M. Bender and Alexander Koller, “Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data,” Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020): 5185–5198, who argue from the side of form what externalism argues from the side of reference: a system exposed to form alone, however much of it, has no route to communicative intent or to the world that intent is about. ↩
6. Searle, “Minds, Brains, and Programs,” Behavioral and Brain Sciences 3, no. 3 (1980): 417–424; followed up in “Is the Brain a Digital Computer?” Proceedings and Addresses of the American Philosophical Association 64, no. 3 (1990): 21–37. For the present essay’s externalist diagnosis, Searle’s two arguments converge from different directions: the 1980 paper isolates the syntax/semantics gap from the side of what symbol manipulation alone delivers (a syntactic engine, however fast, never crosses into semantics by running faster); the 1990 paper isolates it from the side of what counts as a symbol in the first place (syntax is not intrinsic to physics — it is assigned by an interpreter, which threatens any account that hopes to read off semantics from a system’s formal structure). The externalist arrives at the same gap from a third direction: even granting determinate symbols, their reference is fixed by relations to a world, and those relations are not among the system’s formal properties. Three independent roads, one missing ingredient. ↩
7. The relevant technical distinction: training statistics encode the distributional facts about how speakers in a corpus deploy tokens relative to one another, but do not encode the referential facts about which extensions those deployments succeeded in picking out. The two are correlated — because the human speakers were anchored — but the correlation does not survive the move from the speakers to the trained model: the model has access to the shadow the anchoring cast on usage, not to the anchoring. This is why the natural reply — “but a model can be grounded, via multimodal training, robotic embodiment, or retrieval against a live environment” — is not a counterexample but a concession in disguise. Each such proposal works precisely by restoring some causal-historical relation between the system’s symbols and the world, which is the externalist’s point: reference is purchased by anchoring, and where genuine anchoring is added the verdict can change (cf. Michael Tye, “How Can We Tell if a Machine is Conscious?” Inquiry [advance online publication, 2024], https://doi.org/10.1080/0020174X.2024.2434856, on the embodied conditions under which machine reference could succeed). The claim of this essay is narrow and exact: a text-only model trained on a static corpus has no such relation, and so its fluency, however vast, is fluency over a record of meaning rather than an instance of it. ↩
May 12, 2026