Mind, Matter, and Meaning

Tag: large language models

Borrowed Meaning
MIND · MATTER · MEANING No. 38 · May 2026

Borrowed Meaning

A model’s words mean something — for us, not for it.

An essay mindmatterandmeaning.com

A well-trained language model produces a sentence about a maple tree. The sentence parses, predicts, and pleases. Did the model say something about a maple tree?

A growing line of argument answers yes — not by claiming the model has thoughts, perceives leaves, or means anything in the rich, mental sense, but by claiming something cleverer. The model’s outputs, the argument goes, inherit their meaning the way a counterfeit twenty inherits the design of a real twenty: by lineage. The tokens belong to a public linguistic practice with its own teleology. They have been selected, refined, and stabilized through generations of human speakers using “maple” to talk about maples. When an LLM emits the word, it produces a token of a type whose proper function — in Millikan’s sense — already exists. The model does not need to mean anything mentally. The word does the meaning for it. Call this the borrowed-meaning move.

It is a serious argument. Jumbly Grindrod’s “Large Language Models and Linguistic Intentionality” works from Gareth Evans on naming practices and Ruth Millikan on teleosemantics, and one cannot wave the view off by pointing out, again, that the model lacks consciousness.¹ Grindrod has agreed the model lacks consciousness. He has carved out a space — linguistic intentionality — that allegedly does not require any mind on the speaker’s end at all. The thought is bracing: meaning, in the public sense, lives in the practice, and any device that successfully participates in the practice is meaning-bearing whether or not anyone is home.

The view deserves the strongest version one can give it. Words really do have public lives. “Water” refers to H₂O whether the speaker can recite the formula or not; a sneeze that sounds like “achoo” does not refer, but a careful utterance of “achoo” in a Burns Night recitation might; lineage matters. Millikan built her account precisely to capture this: a token gets its content from the cooperative history that selected it.² Producers and consumers, refined over generations, settle what a sign is for. So far, so good.

The trouble starts when the LLM walks into the cooperative history and asks for a seat at the table.

Millikan’s mechanism has two halves, and the borrowed-meaning move quietly drops one. Producers make signs; consumers take them up; selection happens because consumers’ uptake feeds back into which producer-tokens persist.³ A bee that waggles in the wrong direction starves the hive; a hive that ignores good waggles starves itself. Selection requires that the loop close — that getting it right about the world makes the difference between thriving and not. Strip out the loop and you have not teleosemantics; you have stenography with extra steps.

Now the LLM. What does it select for? It selects for tokens that look, statistically, like the tokens that came before them. The objective function rewards plausibility under a distribution, not correctness about maples. The model’s “consumers” — its loss function, its trainers, its users — do not punish it when its outputs misrepresent maples; they punish it when its outputs read poorly. There is a feedback loop, but the loop runs through human readability, not through the world. When humans read fluently, the model is reinforced; when humans wince, it is corrected. The maple has no vote.

This matters because the proper function Grindrod wants to inherit was forged in a different kind of loop. The word “maple” stabilized in human speech because, over generations, calling maples maples helped people find sap, identify wood, build sugar shacks, and not eat the wrong leaves. The lineage runs through successful engagement with maples. When a contemporary speaker uses the word and gets things right or wrong, she is the latest carrier of that lineage because her uses, too, are accountable to maples. The line of descent is not just morphological; it is causal-ecological.⁴

The LLM joins the morphology and skips the ecology. Its tokens are the right shape, but the path by which they arrived bypasses the maples entirely. To insist that they nonetheless inherit maple-content is to confuse the costume with the role. A child wearing a postman’s uniform delivers nothing.

One can almost hear the rebuttal forming: surely the model’s training data was itself produced by speakers whose words were maple-accountable, and so the lineage runs through the model’s outputs after all. This is the move that makes the argument feel airtight, and it deserves to be taken seriously. But notice what it requires. It requires that being trained on tokens produced by maple-accountable speakers counts as participating in the maple-accountable lineage. By that standard, my photocopier participates too, and so does the optical scanner that produced the JPEG of someone else’s botany textbook. If the criterion for inheritance is “your outputs causally trace back to outputs produced by accountable speakers,” it sweeps in every device that traffics in linguistic shapes. We have not extended teleosemantics; we have diluted it into ink-on-paper.

A sharper friend of the view will press here, because the diagnosis so far leans on the producer side and Millikan’s deeper innovation was the consumer side. So grant it: maybe the model is no producer worth the name, but its human users surely are consumers — they take up its tokens, act on them, and feed their satisfaction or dismay back into the next round of training. Doesn’t that close the loop after all, with us standing in as the consumers Millikan requires? It is the best version of the objection, and it almost works. What it leaves out is what the consumers are selecting for. A Millikanian consumer closes a loop only when its uptake tracks whether the sign got the world right; the hoverfly’s visual system is a consumer of the bee-dance only because hoverflies that misread the dance leave fewer offspring. The human reading an LLM’s paragraph is selecting for whether the paragraph reads well, coheres, flatters the prompt — and a fluent falsehood passes that filter as smoothly as a fluent truth. The consumer is real; the kind of selection is wrong. We are consumers of the model’s prose the way we are consumers of a pleasant melody, not the way the hive is a consumer of the waggle. The loop closes on us, and stops there, well short of the maple.⁵

The serious version of the argument has to add something: that the LLM does not merely repeat tokens but produces new tokens whose proper function is settled by the practice. Fine. But the proper function of “maple” — the thing the lineage selected for — was the function of being deployed in maple-accountable ways. A producer that emits “maple” without any sensitivity to whether maples are present is, in Millikan’s own terms, not performing the function. It is doing something function-shaped. Grindrod knows this; he concedes that Millikan herself would resist his application.⁶ That concession is the ballgame. The framework was built around a kind of accountability the LLM does not have, and once you remove the accountability, the inheritance has nothing to inherit.

There is a softer version of the borrowed-meaning move that survives all this and is worth keeping. LLM outputs are not meaningless noises; they are parasitic on meaning. They function as inscriptions — like the words in a book lying open on a desk. The book is full of meaning in the sense that meaning runs through it: the author meant things, and a competent reader will recover them. We do not therefore conclude that the book is a meaner. The book is a vehicle. So is the model.

Calling the model a meaner because its outputs traffic in meaningful tokens is the same kind of category mistake as calling a perfectly transcribed prayer a worshipper. The transcription preserves the words; the worshipper supplies the world-directedness; the difference, in our tradition, is the whole point.⁷ Strip it away and you have not democratized intentionality. You have just mislaid it.

What is left, then, of the model’s apparent eloquence? Quite a lot, actually — and this is where the gentlemanly part of the knife fight ends in a handshake. I use these tools. I find them genuinely useful. LLMs are extraordinary engines of linguistic regularity. They surface patterns in our practices that we ourselves had not articulated. They are useful in the way a very good concordance is useful, except they generate as well as retrieve. Treating them as oracles dishonors them; treating them as colleagues misclassifies them; treating them as new instruments of inquiry treats them right.

Borrowed meaning is still borrowed. The maple still does the work. And anyone who writes the word — child, philosopher, model, footnote-machine — only joins the practice by doing what the practice was always for: getting things right about the world, or being corrected when one fails. The model does not fail in that way, because it cannot. It does not succeed in that way either. It produces the shape of success, which is a real and beautiful thing, and which our language has a perfectly good word for.

We call it style.

Notes
1. Jumbly Grindrod, “Large Language Models and Linguistic Intentionality,” Synthese 204:71 (2024). Grindrod’s strategy is to grant the cognitive emptiness of LLMs and recover meaning at a different level: the linguistic practice itself. Drawing on Gareth Evans’s distinction between producers and consumers of a naming practice (Evans, The Varieties of Reference, ed. John McDowell [Oxford: Clarendon, 1982], ch. 11), and on Millikan’s teleosemantic account of conventional signs, Grindrod argues that an LLM can stand in the consumer role even though it lacks the demonstrative-recognitional capacities Evans required of producers. The argument is the strongest version of the LLM-meaning move currently in print; the response developed here grants the structure and contests the inheritance step. ↩
2. Ruth Garrett Millikan, Language, Thought, and Other Biological Categories: New Foundations for Realism (Cambridge, MA: MIT Press, 1984), chs. 1–2; and “Biosemantics,” Journal of Philosophy 86 (1989): 281–297. Millikan’s “proper function” is the function a trait has in virtue of the evolutionary or learning history that selected for its predecessors. Applied to signs, the proper function of a token-type is what its ancestors did that made it persist. The water-as-H₂O case is most directly developed in Hilary Putnam, “The Meaning of ‘Meaning,’” in Mind, Language and Reality: Philosophical Papers, Volume 2 (Cambridge: Cambridge University Press, 1975), 215–271; the convergence between Putnam’s causal-historical externalism and Millikan’s teleosemantics is one of the more underappreciated agreements in twentieth-century semantics. ↩
3. The producer-consumer asymmetry is central to Millikan’s account and is what distinguishes teleosemantics from mere informational semantics à la Dretske (Knowledge and the Flow of Information [Cambridge, MA: MIT Press, 1981]). Information by itself does not generate the normative dimension — the difference between succeeding and failing at representation — because mere correlation between sign and signified does not yet involve the kind of cooperative history that lets a sign be wrong. The bee-dance example is Millikan’s own (Language, Thought, and Other Biological Categories, 96–98); the philosophical point is that proper function lives downstream of consumer uptake, not in the producer’s intrinsic states. Peter Godfrey-Smith, “Mental Representation, Naturalism, and Teleosemantics,” in Teleosemantics: New Philosophical Essays, ed. Graham MacDonald and David Papineau (Oxford: Oxford University Press, 2006), 42–68, provides the cleanest critical overview. ↩
4. The morphology/ecology distinction sharpens an old worry. Stevan Harnad’s symbol-grounding problem (Harnad, “The Symbol Grounding Problem,” Physica D 42 [1990]: 335–346; and “Symbol Grounding and the Origin of Language,” in Computationalism: New Directions, ed. Matthias Scheutz [Cambridge, MA: MIT Press, 2002], 143–158) holds that the meaning of a formal symbol system cannot be fixed by relations among symbols alone, on pain of regress — the symbols must be grounded in a non-symbolic capacity to sort, label, and interact with what they denote. Harnad’s vivid gloss is that language lets us “steal” categories by hearsay rather than “earn” them through sensorimotor “toil,” but the theft presupposes that some categories were earned the hard way: “it cannot be linguistic theft all the way down” (2002, abstract). The borrowed-meaning move is precisely an attempt to make it theft all the way down — to let the LLM inherit grounded content without any grounding of its own. Teleosemantics is supposed to be the framework that explains how grounding gets transmitted across a lineage; the present objection is that transmission, in Millikan’s sense, requires the consumer’s uptake to remain world-accountable, which is exactly the link the LLM’s training loop severs. The ecology is the grounding; the morphology is the theft. ↩
5. The consumer-side reply is the strongest objection to the argument, since it grants the producer-side point and relocates the loop in human users. The reply fails because Millikanian consumption is not mere uptake but selection-relevant uptake: the consumer’s responses must covary with the sign’s worldly accuracy in a way that differentially preserves accurate producer-tokens. This is the feature that, on Godfrey-Smith’s reading, distinguishes a genuine teleosemantic feedback process from a merely causal one — see Peter Godfrey-Smith, “Mental Representation, Naturalism, and Teleosemantics,” in Teleosemantics: New Philosophical Essays, ed. Graham MacDonald and David Papineau (Oxford: Oxford University Press, 2006), 42–68, esp. the discussion of which feedback loops Millikan’s biological cases license. Human satisfaction with an LLM paragraph covaries with fluency, coherence, and prompt-fit, not with the paragraph’s accuracy about its ostensible subject — fluent falsehood and fluent truth pass the same filter. The point parallels the standard charge against pure informational semantics (Dretske, Knowledge and the Flow of Information [Cambridge, MA: MIT Press, 1981]): correlation alone yields no norm of correctness, because a sign that merely tracks what its consumers reward cannot thereby be wrong about the world. The LLM’s consumers reward readability; readability is not a world-tracking norm; so the loop, though real, is the wrong kind of loop. ↩
6. Grindrod, “Large Language Models and Linguistic Intentionality,” §4. The concession that Millikan would resist his application is doing more work than Grindrod treats it as doing. Millikan’s framework is not merely a name-tag for “tokens have public meanings”; it is a specific account of what makes a token-type have a meaning, and the answer is that consumers’ uptake of the token, in selection-relevant feedback loops, shapes which producer-tokens persist. Strip out the consumer side of the loop — which is exactly what the LLM case does — and the framework no longer applies. The response in the main text follows the line developed in Millikan, “What Has Natural Information to Do with Intentional Representation?” Royal Institute of Philosophy Supplement 49 (2001): 105–125. For a sympathetic but firm rebuttal of LLM teleosemantic inheritance from a different angle, see Marek Havlík, “Meaning and Understanding in Large Language Models,” Synthese 203:113 (2024). ↩
7. The vehicle/content distinction at work here echoes Tim Crane’s deployment of it in “Is There a Perceptual Relation?”, in Perceptual Experience, ed. Tamar Szabó Gendler and John Hawthorne (Oxford: Oxford University Press, 2006), 126–146, and connects to the Chinese Room’s underlying point in Searle, “Minds, Brains, and Programs,” Behavioral and Brain Sciences 3 (1980): 417–424 — that formal manipulation of meaningful tokens is not itself a meaning-bearing activity. The prayer-transcription analogy is mine; the structural point — that a vehicle which carries meaning does not thereby produce meaning — is widely shared across the realist tradition the book sits within. Ch. 9 develops the Searlean version of the diagnosis at length. ↩
May 25, 2026
The Word “Hallucination” Was Already Taken
MIND · MATTER · MEANING No. 36 · May 2026

The Word “Hallucination” Was Already Taken

A system with no inside can’t hallucinate — only drift.

An essay mindmatterandmeaning.com

A chatbot hands you a footnote. It cites a paper that does not exist. The author it names never wrote anything close to the title. The journal volume runs ten issues short, and the page numbers point into empty air. You copy the citation into a search engine, find nothing, and go back to the chat window with the now-standard complaint: it hallucinated again. Everyone in this little drama uses the word as if it carried no philosophical freight at all — as if the engineers had flipped through the dictionary, hunting for something punchy, and simply landed on the right term.

They did not land on the right term. They landed on a word that already had a job, and the job mattered.

In philosophy, hallucination names something quite specific. A person has an experience that seems, from the inside, to present a real object — a pink rat on the kitchen counter, a friend at the foot of the bed — when no such object is there. The experience happens. The object does not. And the whole difficulty lives in that word seems. The hallucinator looks out at what feels like a world and meets no resistance in it; nothing on the inside of the experience whispers that it has failed.¹ Philosophers disagree, sometimes fiercely, about what that shared appearance comes to.² But every account worth having agrees on one thing: a hallucination happens to someone. It needs a subject who seems to see.

Now look at what the engineer means. A model trained on text produces a string that mentions a paper it has never encountered. No seeming is involved. Nothing inside the system scans a row of citations and concludes, falsely, that one of them is real. The model has no vantage point from which the false output looks like a world. It does not have a world. It has a distribution over tokens, shaped by your prompt, and a sampling step that picked one path through that distribution rather than another. The output strays from the truth because nothing in its training rewarded tracking the truth this finely. The phenomenon is real, and it deserves a name. Hallucination simply names the wrong shape.

Why did the word stick? Partly because it sounds clinical and forgiving at the same time. It pathologizes the model gently, as though it had caught a passing fever. The alternative is to say plainly what every text-only system does: it strings together plausible continuations whether or not those continuations track anything. And partly the word stuck because it smuggles in a familiar picture — a mind, turned inward, deceived. The old Cartesian theater reopens for one more show, this time staged inside a server rack. By sheer suggestion, the model becomes a tiny subject, now and then misled. Once that picture takes hold, the question how do we stop the model from hallucinating? sounds answerable, the way medicating a patient sounds answerable. The harder question, the one the picture hides, never even gets asked.

Here is that harder question. To misrepresent anything, a system has to be tied to the world tightly enough that something fixes when it succeeds and when it fails. The teleosemantic tradition locates that tie in a system’s biological or designed function: a state misrepresents when it fires outside the conditions it was built to track.³ Searle gets to a kindred conclusion by another road — genuine meaning shows up only where formal symbol-shuffling connects to real causal, embodied dealings with the world.⁴ Either way, misrepresentation is an achievement. A thing has to first be the kind of thing that can represent. Only then can it, on a given occasion, get something wrong.

A text-only language model has no such footing to lose. Its outputs ride on statistical patterns combed out of a corpus, and the corpus stands in for the world only in the thin sense that the humans who wrote it were writing about the world. Nothing in the model’s loop checks its outputs against any state of affairs out there. The very idea of the model getting it wrong imports a yardstick the model cannot hold. We hold it for it. We are the ones who notice the missing journal issue. The model notices nothing.

Some philosophers push back on this hard line, and they deserve a hearing. Marek Havlík argues for what he calls semantic fragmentism: the claim that language models do achieve real meaning, not everywhere, but within bounded patches of language where their training is dense and coherent.⁵ The view is trying to honor something obvious — the gulf between Eliza shuffling canned phrases and a modern model translating, summarizing, and holding a dozen constraints in the air at once. Fair enough. But fragmentism still owes us an account of what fixes meaning inside those patches. If the answer is use within a corpus, it has only moved the form/meaning gap somewhere harder to see. If the answer is grounding in the world, it has conceded the whole point.⁶

None of this makes the engineering problem vanish once we take the philosophical word back. The problem stays. It just gets more honest. What the model does, when it fabricates a citation, is closer to confabulation — a word we already use for fluent narration produced without access to the facts the narration claims to report. Or, more plainly still: drift from a standard the system cannot detect. Neither phrase will ever move a product launch. Both have the modest merit of being true.

The cost of the borrowed word shows up in the questions the field lets itself ask. Ask whether a model can be made to stop hallucinating, and you have quietly assumed it has a grip on the world that slipped — and that the right tweak will tighten it. Ask instead what a system would need before its outputs counted as representations at all, and you walk straight into the harder country: sensors, a body, a causal history, the long apprenticeship through which a creature comes to mean cat by the word “cat.” Better questions tend to make better engineering. They also, as it happens, make better metaphysics.

No one is giving the word back. The AI industry does not borrow vocabulary and then return it, and there is something almost endearing about the theft — a field moving so fast it will cheerfully lift a clinical term from the discipline next door and call the lifting naming. But look at what the borrowing does. It plants, at the dead center of the most consequential technology story of the decade, a word that describes an inner theater inside a system that has no inside. The pretense earns its keep. It quietly props up the very confusion this whole project has been working, patiently, to take apart. A model that hallucinates sounds like a mind on the mend. A model that drifts from facts it cannot detect sounds like exactly what it is. And once we hear it as what it is, we can finally ask the real question: what would have to be added before a system could be capable of getting anything wrong at all?

Notes
1. Tim Crane, “Is There a Perceptual Relation?”, in Perceptual Experience, ed. Tamar Szabó Gendler and John Hawthorne (Oxford: Oxford University Press, 2006), 126–146; “Introspection, Intentionality, and the Transparency of Experience,” Philosophical Topics 28 (2000): 49–67; and “The Problem of Perception,” Stanford Encyclopedia of Philosophy (rev. 2021, with Craig French). On Crane’s intentionalist treatment, hallucination is a representational state whose content fails to match the world; the phenomenal character of the state arises from how it represents, not from any inner object the subject is alleged to inspect. This sits within strong representationalism and explains why a hallucination seems to present a worldly object — it represents one, just inaccurately. The treatment is congenial to the present essay’s claim that hallucination is a content-failure of a world-directed state, and it is the account the main text of Chapter 4 develops and Ch04.2 (“The Pain in the Toe That Isn’t There”) applies to phantom limb. The point of citing it here is to fix the philosophical meaning of hallucination before the engineering metaphor co-opts the word. ↩
2. M.G.F. Martin, “The Transparency of Experience,” Mind and Language 17 (2002): 376–425, and “On Being Alienated,” in Perceptual Experience, ed. Tamar Szabó Gendler and John Hawthorne (Oxford: Oxford University Press, 2006), 354–410. Martin’s disjunctivism denies the common factor assumption — that veridical perception and a subjectively indistinguishable hallucination share a metaphysically substantive mental state. On his view a hallucination is characterized only negatively, as a state indistinguishable through reflection from a veridical perception of a particular kind. The book sits closer to Crane’s intentionalist account than to Martin’s negative epistemic disjunctivism (see Ch. 3 on the argument from illusion), which is exactly the disagreement the main text gestures at with “sometimes fiercely.” Both camps nonetheless converge on the point that does the work here: the philosophical use of hallucination requires a subject who seems to encounter a world. ↩
3. Ruth Garrett Millikan, Language, Thought, and Other Biological Categories (Cambridge, MA: MIT Press, 1984), chs. 1–2; and Karen Neander, A Mark of the Mental: In Defense of Informational Teleosemantics (Cambridge, MA: MIT Press, 2017). Neander’s “informational teleosemantics” extends Millikan’s framework by tying representational content to the conditions a system is functionally adapted to detect — its informational functions, which carry what she calls “normative aboutness” — rather than to the conditions it merely happens to correlate with. The misrepresentation case then comes out clean: a state misrepresents when it occurs outside the conditions its function was selected to track. The frog’s bug-detector firing at a passing pellet (Chapter 6’s example) is the canonical illustration. Crucially for the present argument, neither Millikan nor Neander offers any route by which a text-only LLM could misrepresent, since on neither account can a representational achievement be inherited without the selection history that grounds it (see Ch06.1 for the developed argument). ↩
4. John Searle, “Minds, Brains, and Programs,” Behavioral and Brain Sciences 3 (1980): 417–424; and “Is the Brain a Digital Computer?”, Proceedings and Addresses of the American Philosophical Association 64 (1990): 21–37. Searle’s two-step argument — first that syntax does not yield semantics (the Chinese Room), then that computation is itself observer-relative — is the spine of Chapter 9’s case against treating LLMs as understanders. (A guarded note for the careful reader: the Chinese Room alone does not establish the strong, fully general claim that no syntactic process could ever yield semantics; the book leans on the observer-relativity argument, not the thought experiment in isolation, to carry that weight.) The point relevant here is narrower. Searle’s distinction between systems with genuine, world-grounded semantic content and systems that merely emit semantic-shaped tokens makes the engineering term hallucination a category mistake: a system without content to begin with has no content to misrepresent. What it has is output drift relative to an external standard. ↩
5. Marek Havlík, “Meaning and Understanding in Large Language Models,” Synthese 204 (2024). Havlík’s “semantic fragmentism” (developed in his §3.7) is the more sympathetic edge of the contemporary LLM-meaning debate: rather than denying LLMs any relation to content, he argues that they achieve bounded semantic competence within domains where their training distribution is dense and coherent. The book grants the empirical observation — modern models really do show competence gradations across domains — but resists the inference that domain-bounded statistical coherence amounts to genuine semantic content. The fragmentist position trades on the same conflation Bender and Koller diagnose (next note): the slide from “the form is right” to “the meaning is there.” A useful contrast piece is Jumbly Grindrod, “Large Language Models and Linguistic Intentionality,” Synthese 204:71 (2024), discussed at length in Ch06.1. ↩
6. Emily M. Bender and Alexander Koller, “Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data,” in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020), 5185–5198. Their “octopus test” is the cleanest engineering-side statement of the syntax/semantics gap for large language models: a hyperintelligent octopus that taps an undersea cable carrying two islanders’ chatter could learn every statistical regularity in their messages without ever encountering a coconut or a predator, and would still produce fluent replies it does not understand — until the day real help is needed and the fluency fails. The argument recasts Searle’s Chinese Room in distributional-semantics terms and has been much discussed in the NLP literature without being much heeded. The form/meaning gap they diagnose is the same gap this essay turns on: textual coherence is not world-grounded representation, and a system that lacks the latter cannot, strictly, misrepresent in the way hallucinate implies. ↩
May 25, 2026
What a Machine Would Have to Earn
MIND · MATTER · MEANING No. 29 · May 2026

What a Machine Would Have to Earn

Understanding is earned in a world, not performed on a screen.

An essay mindmatterandmeaning.com

A friend sent me a transcript last spring. He had asked a chatbot what a sunburn feels like the morning after — that specific tight, hot, can’t-find-a-way-to-lie-down misery — and the machine answered better than he could have. It named the flinch when a shirt seam drags across the shoulders. It knew the small betrayal of forgetting for a second and leaning back into a hot car seat. He found it uncanny, a little moving, and he wanted to know: does it understand what a sunburn is?

Good question, asked at the right moment. The honest answer takes a while to earn, so let me start with the answer most of us reach for first — because it’s reasonable, and because it’s wrong.

The reasonable view goes like this. Understanding shows up in what you can do. A student who can answer any question about the French Revolution, field the follow-ups, catch the trick ones, and explain the whole thing to a ten-year-old — that student understands the French Revolution, and we would be cranks to deny it on the grounds that we can’t peer inside her skull. Understanding is as understanding does. So if a machine handles every question about sunburns as well as a sunburned person could, the difference between the machine and the person starts to look like a difference we invented to feel special about ourselves. The picture has a respectable pedigree: it descends from behaviorism, and it has a famous instrument in Alan Turing’s imitation game, where the test for thinking just is indistinguishable performance.

Notice the quiet assumption, though. The picture takes understanding a word to be a matter of using it correctly, and takes “correctly” to be settled by looking only at the outputs. Pull on that thread and the whole thing comes apart in your hands.

Stevan Harnad, a cognitive scientist with a gift for naming traps, named this one in 1990: the symbol grounding problem.¹ Imagine trying to learn Chinese from a Chinese-only dictionary. Every definition sends you to other entries, which send you to others, and you ride that merry-go-round forever without once touching the ground. A system whose symbols are defined only by more symbols never means anything by them. Meaning gets in only when some of the symbols connect to the things they are about by some route other than further symbols — when “red” hooks to red, not merely to “crimson,” “scarlet,” and “the color of a stop sign.”

What supplies the hook is not anything inside the system. Hilary Putnam made the case unforgettable with a thought experiment about Twin Earth — a planet just like ours except that the stuff they call “water” there is some other compound with all of water’s surface features.² A person here and their molecular duplicate there can be internally identical, down to the atom, and still mean different things by “water,” because the word answers to the stuff in the world, not to the state of the head. “Meanings,” Putnam wrote, “just ain’t in the head.” Tyler Burge pushed the same point from the social side: what your word “arthritis” picks out depends on the practice of the community you defer to, not on a private definition you carry around.³ Content lives in a relation — between a system, a world, and the company it keeps.

There is even a natural story about how the relation gets built. On teleosemantic accounts — Ruth Millikan’s and Fred Dretske’s, chiefly — a state comes to be about something by acquiring the function of tracking it, the way a frog’s strike comes to be about flies through a long history in which catching flies is what kept frogs going.⁴ The clinching detail is misrepresentation: to get something wrong, a system has to have been in the business of getting it right. A state can mean fly and fire at a passing pellet only because its job, fixed by history, was flies. No history, no job; no job, nothing to be mistaken about; nothing to be mistaken about, no content.

So understanding a word turns out to be an achievement, not a knack: it consists in having states that are genuinely about the world — not states that merely accompany the right answers, but states directed at the very things the words name — and aboutness is something a system earns over time. Your “red” means red because red things have been pushing on you, through eyes and skin and the small stakes of an actual life, since before you could pronounce the word. This is what people are gesturing at, usually too vaguely, when they say minds are embodied. The word invites mysticism, so let me drain it of any. Embodiment names three sober requirements: the system takes in the world through senses and acts back on it; its inner states have been shaped by real traffic with the features they represent; and those states are there to track a world the system inhabits, not merely to emit the right strings. Michael Tye — who spent three decades building the most careful theory we have of how experience could be nothing more than representational content, and then argued that even his own theory needs history — makes the sharpest version of the point. Two creatures could be intrinsically identical at an instant, he argues, and still differ in what they experience, because one has a past of tracking the world and the other was assembled, atom for atom, five minutes ago.⁵ History is not decoration on content. It is part of what fixes it.

Which lets me say, at last, what a machine would actually need. Not the right stuff — I don’t think the barrier is silicon, and here I part company with John Searle, who ties understanding to the specific causal powers of biological brains.⁶ The barrier isn’t carbon; it’s a world. A system understands when its inner states have been shaped by, and stay answerable to, the things they represent — when it senses and acts, lives under stakes, and can get things wrong and pay for it. Build that, and the door to genuine artificial understanding stands open. I mean open, not slyly closed. The claim here is not the tired one that machines could never understand. It is that understanding is earned through engagement, and there is no coupon for skipping the engagement.

Skipping the engagement is precisely what today’s text-only language models do. A large model learns the statistics of how we talk — the staggeringly intricate shape of which words follow which — from a corpus of descriptions of the world, never from the world.⁷ It has read everything ever written about sunburns and has never once had skin. Its “red” is a position in an immense map of words, anchored to other words, anchored to nothing outside the map. The fluency is real and the achievement is genuine; it is simply not the achievement of understanding.

Here the strongest objection arrives, and it deserves a real hearing rather than a brush-off. If the machine’s answers became indistinguishable — in principle, not merely in today’s practice — from an understander’s, then insisting it still lacks understanding looks like clinging to a ghost. A difference that makes no detectable difference, the objection runs, is no difference at all. That is the whole moral of the imitation game, and it is not a silly one.

But “makes no difference you can detect in the output” is the definition of a good simulation, not the absence of a difference. Simulate a hurricane to any precision you please: the equations are flawless and your desk stays bone dry. Modeling a process is not running it.⁸ Two systems can produce the very same words while one means them and the other reports the statistics of how the word gets used — because meaning was never a property of the output. It lives in the history behind the output, and that history is exactly what an output test cannot see. The objection mistakes the instrument for the quarry. It notices that the meter reads the same and concludes there is nothing the meter is missing.

So: does the machine understand what a sunburn is? It has never had skin. It has never flinched, never dreaded an evening because of how the sheets would feel. It holds the words and not the world the words are about. Ask the question again in some later decade, of some later system that has spent years bumping into things and paying for its errors, and the answer could come back different — that is the part the doom-mongers and the hype-merchants both manage to miss. Understanding is not a performance a system delivers. It is a debt a system pays, to the world, in the one currency the world accepts: contact. Until the bill comes due, fluency is only fluency. It was always going to be the easy part.

References

Burge, Tyler. 1979. “Individualism and the Mental.” Midwest Studies in Philosophy 4: 73–121.

Dretske, Fred. 1988. Explaining Behavior: Reasons in a World of Causes. Cambridge, MA: MIT Press.

Harnad, Stevan. 1990. “The Symbol Grounding Problem.” Physica D 42: 335–346.

Harnad, Stevan. 2002. “Symbol Grounding and the Origin of Language.” In Computationalism: New Directions, edited by Matthias Scheutz. Cambridge, MA: MIT Press.

Havlík, Vladimír. 2025. “Meaning and Understanding in Large Language Models.” Synthese 205: 9.

Millikan, Ruth Garrett. 1989. “Biosemantics.” Journal of Philosophy 86 (6): 281–297.

Putnam, Hilary. 1975. “The Meaning of ‘Meaning.’” In Mind, Language and Reality: Philosophical Papers, Volume 2, 215–271. Cambridge: Cambridge University Press.

Searle, John R. 1980. “Minds, Brains, and Programs.” Behavioral and Brain Sciences 3 (3): 417–457.

Searle, John R. 1990. “Is the Brain a Digital Computer?” Proceedings and Addresses of the American Philosophical Association 64 (3): 21–37.

Tye, Michael. 2019. “Homunculi Heads and Silicon Chips: The Importance of History to Phenomenology.” In Blockheads! Essays on Ned Block’s Philosophy of Mind and Consciousness, edited by Adam Pautz and Daniel Stoljar. Cambridge, MA: MIT Press.

Notes
1. Harnad (1990) coined “the symbol grounding problem” and framed it with the Chinese-dictionary regress; he later tied it to the origin of language (Harnad 2002). The problem is older than the label — it is the computational heir of the externalist worry about how any representation latches onto its object — but Harnad’s formulation is the one the AI literature inherited, and it is sharper than the Chinese Room for present purposes because it isolates grounding from Searle’s further claims about consciousness. ↩
2. Putnam (1975). The conclusion is specifically about reference and extension: the content that fixes what “water” is true of does not supervene on the speaker’s intrinsic states. Note that Putnam later qualified his own semantic externalism in several directions; nothing here turns on the most contested versions of the thesis, only on the minimal claim that reference depends on causal-environmental relations the head alone does not settle. ↩
3. Burge (1979) extends externalism from natural-kind reference (Putnam) to social content: holding a thinker’s physical history fixed while varying the surrounding linguistic community varies which concept the thinker exercises. The two cases are independent routes to the same structural conclusion — internal organization underdetermines content — which is why the essay leans on both rather than treating Burge as a footnote to Putnam. ↩
4. The teleosemantic tradition, principally Millikan (1989) and Dretske (1988), grounds content in proper function: a state represents what it has the function of tracking, where functions are fixed by selection or learning history. Misrepresentation is the standard adequacy test for any naturalistic theory of content, since a theory on which states cannot be false has not yet described representation. Rival tracking theories handle reliable misrepresentation differently, but the historical structure — content fixed by what a state was for — is common ground and is what the embodiment argument borrows. ↩
5. Tye (2019). The thesis is that two beings intrinsically alike at a time can differ in phenomenal character because they differ in history — a representationalist’s concession that current intrinsic structure does not suffice. Ned Block replies in the same volume (“Fading Qualia: A Response to Michael Tye”) that a subject could be radically wrong about their own phenomenology; the disagreement is real and unresolved, and the essay sides with Tye while granting that Block has located the genuine pressure point. That Tye, of all people, reaches for history is the relevant fact: the most developed representationalism on offer does not think structure alone fixes content. ↩
6. Searle (1990) argues that computation is observer-relative — a physical system “computes” only under an interpretation we assign — so computational description cannot, by itself, explain intrinsic intentionality. The essay takes this negative point and leaves Searle’s positive doctrine behind. Searle’s biological naturalism holds that only the specific causal powers of brains can produce understanding; the view defended here replaces “the right biology” with “the right causal-environmental engagement,” which a non-biological system could in principle possess. The negative argument survives the amputation of the positive one. ↩
7. Not everyone takes the contact gap to be fatal, and the most direct contrary voice deserves naming. Vladimír Havlík (2025) argues the reverse of this essay’s conclusion — that large language models do ground the meanings of their expressions, by way of what he calls semantic fragmentism, so that grounding in worldly reference is not a precondition of understanding. I think this mislocates the gap rather than closing it. Semantic fragmentism can explain how a model’s tokens come to bear stable relations to one another; the externalist and teleosemantic considerations above concern what fixes the relation between a token and the world, which is precisely what a text-only training signal never touches. The architectural premise is not what divides us — a text-only model is trained to predict the next token over a corpus of text, full stop — what divides us is whether that suffices for content, and Havlík’s affirmative answer is the live position this essay rejects. ↩
8. The simulation/realization distinction is Searle’s reply to the Brain Simulator objection in “Minds, Brains, and Programs” (1980), generalized: a model of a process is not an instance of it, and whatever a process owes to its physical realization is not delivered by a description of that realization, however exact. The hurricane example makes the point without the contested premises about consciousness — no one is tempted to say the simulated storm is wet — which is why it does cleaner work here than the Chinese Room. ↩
May 25, 2026
Multimodality and the Symbol-Grounding Problem
MIND · MATTER · MEANING No. 31 · May 2026

Multimodality and the Symbol-Grounding Problem

Adding eyes to a language model gives it more pictures, not a world.

An essay mindmatterandmeaning.com

Hold a bruised avocado up to the newest chatbot and it will tell you, with a confidence you have never once earned at a produce counter, that the fruit has about a day left and you should make the guacamole tonight. It can see the avocado. That is the pitch, anyway, and it lands. After years of watching these systems shuffle words around — predicting the next token the way a very well-read parrot predicts the next syllable — here at last is one that looks at your kitchen and answers.

The demos impress, and the feeling they produce is specific: the machine has finally made contact. The symbols have touched down. Whatever was missing in the text-only models — the thing that made us suspect the parrot didn’t know what it was saying¹ — surely closes the moment you give the thing eyes.

Here is the story almost everyone now tells, and I told a version of it myself for longer than I’d like to admit. The old language models lived sealed in a room of words. “Apple” meant nothing to them beyond its statistical company — the other words it tends to travel with. No wonder they made things up; they had never met an apple. But bolt on a camera and a microphone, and “apple” stops being a token rubbing shoulders with other tokens and becomes the round red thing on the counter. Multimodality, on this telling, just is grounding. It is the rope that finally ties the words to the world.

It is a natural thought, and something in its neighborhood is even correct. But the conclusion doesn’t follow, and seeing why it doesn’t pays better than any demo.

Start with what a multimodal model actually eats. It does not eat avocados. It eats images of avocados — arrays of numbers, paired during training with text that humans wrote about them. A photograph has not smuggled a piece of the world into the machine. A photograph is a representation: a flat, frozen, human-made encoding, every bit as much a symbol as the word “avocado,” only written in a richer alphabet. Feed a model a billion captioned pictures and you have fed it a billion more descriptions of the world. You have handed it more symbols, in a new code. You have not handed it more world.

This is the trap Stevan Harnad named in 1990, and Harnad — a cognitive scientist who has spent the better part of his career worrying about how a symbol ever comes to be about anything — gave it a form worth keeping.² Imagine trying to learn Chinese from a Chinese-Chinese dictionary. Every word gets defined in terms of other words, which lead to still other words, around and around, and you never once step outside the circle of symbols to the things they name. No amount of definition conjures meaning out of more symbols; the chain has to touch ground somewhere. Somewhere a symbol has to connect to the thing — not to another symbol — through the system’s own capacity to pick that thing out, sort it, act on it.

Harnad had a sharp way of pricing this. Language, he wrote, lets us “steal” categories quickly and cheaply, through hearsay — I can tell you what a zebra is and spare you the safari. But theft works only because somebody, somewhere, earned the category the hard way, through what he called sensorimotor “toil”: the trial and error of dealing with actual zebras, guided by the cost of getting it wrong. It cannot be theft all the way down.³

And theft all the way down is exactly what multimodality quietly proposes. It tries to buy grounding with a bigger pile of borrowed representations. But a photograph of a zebra is more hearsay, not the safari. The richer alphabet is still an alphabet, and an alphabet, however many characters you add to it, is the kind of thing that needs grounding — never the kind of thing that supplies it.

There’s a deeper reason the input’s richness can’t do the job, and it arrives from the least mystical corner of philosophy. Hilary Putnam — who revised his own positions so often, and so cheerfully, that the restlessness became part of his reputation — argued in 1975 that meanings “just ain’t in the head.”⁴ What a thought is about depends on how the thinker stands to the world, not only on what is happening inside. Two systems can be alike down to the last detail and still mean different things, because they have different histories of contact with different surroundings. Michael Tye, who built one of the most careful versions of the view that an experience just is a way of representing the world, pressed the same point about minds: what a state represents depends partly on the causal history through which the system came to have it.⁵ A system that has tracked ripeness — reached for fruit, been right, been wrong, paid the difference in a bad lunch — has states that are about ripeness. A system assembled from a frozen archive of ripe-labeled photographs has states that are about how humans tended to label photographs. Which is not nothing. It is just not ripeness.

So here is the distinction the grounding story walks straight past. Multimodality adds modalities of representation — more kinds of symbol the system can take in. It does not add modalities of engagement — sensors wired to actuators in a world the system inhabits, a history of tracking real features, and some stake in getting it right.⁶ The first is a matter of feeding the model new file formats. The second is a matter of putting the model on the line. They are not the same project, and no quantity of the first sums to the second. The avocado demo feels like seeing. But seeing is something a creature does in a world it can be wrong about and suffer for being wrong about. What the model does is map an array of numbers onto a likely sentence.⁷ It has never been hungry. It has never been fooled. It has never cut into one and found mush.

The strongest reply grants most of this and turns it around. Fine, the objector says — you’ve already admitted an embodied system could mean things. And multimodal models are precisely the perception stack going into embodied systems: the same vision encoders that caption your avocado get bolted onto robots that pick things up. So you’re knocking down a strawman. Nobody serious claims a static image model is grounded; the claim is that multimodality is step one toward a system that is. The trajectory is the point.

This objection is right about nearly everything, and I want to be careful, because where it’s right is exactly what matters. Yes — a robot that acts in a world, tracks what it touches, and pays for its mistakes could come to mean something by “avocado.” I have no objection in principle; the door stands open. But notice what does the work in that story. The grounding gets accomplished by the acting-in-a-world — the closed loop, the tracking, the stakes — and not by the number of input channels feeding the network. A simple creature with one sense and a body on the line stands nearer to meaning than a thousand-modality oracle trained on a frozen scrape of the internet. So the honest version of the trajectory claim is not “multimodality grounds language.” It is “embodiment might, and multimodality is some of the plumbing.” Those two sentences advertise very different products. The first hands you grounding you have not paid for. The second admits the bill is still outstanding.

The avocado on your counter is ripe or it isn’t, and you settle the question the only way anyone ever has: you cut it open — a small risky act in a world that pushes back and now and then embarrasses you. The model has never once been embarrassed, because it has never been anywhere it could be wrong. Giving it a camera changed what it can be shown. It did not change what it can be answerable to — and answerability to the world, not access to more pictures of it, was the whole of what we were missing. We did not open the model’s eyes. We widened the window of the room it was always in, and hung a sharper picture in the glass.

References

Burge, Tyler. 1979. “Individualism and the Mental.” Midwest Studies in Philosophy 4: 73–121.

Dretske, Fred. 1988. Explaining Behavior: Reasons in a World of Causes. Cambridge, MA: MIT Press.

Dretske, Fred. 1995. Naturalizing the Mind. Cambridge, MA: MIT Press.

Harnad, Stevan. 1990. “The Symbol Grounding Problem.” Physica D 42: 335–346.

Harnad, Stevan. 2002. “Symbol Grounding and the Origin of Language.” In Computationalism: New Directions, edited by Matthias Scheutz, 143–158. Cambridge, MA: MIT Press.

Havlík, Vladimír. 2024. “Meaning and Understanding in Large Language Models.” Synthese 204: 71.

Putnam, Hilary. 1975. “The Meaning of ‘Meaning.’” Minnesota Studies in the Philosophy of Science 7: 131–193.

Searle, John R. 1980. “Minds, Brains, and Programs.” Behavioral and Brain Sciences 3 (3): 417–457.

Tye, Michael. 2019. “Homunculi Heads and Silicon Chips: The Importance of History to Phenomenology.” In Blockheads! Essays on Ned Block’s Philosophy of Mind and Consciousness, edited by Adam Pautz and Daniel Stoljar. Cambridge, MA: MIT Press.

Notes
1. The suspicion is not universal, and honesty requires flagging the dissent. Vladimír Havlík argues that Searle’s assumption of an unbridgeable gap between syntax and semantics is unjustified, and that meaning of a kind can emerge from the distributional and inferential structure a large model internalizes (Havlík 2024). I take the disagreement seriously but read it as a quarrel over what “meaning” must answer to. If content is individuated by world-involving causal relations (see notes 4–6), then distributional structure recovers how a linguistic community uses a term without recovering what anchors the term to the world. On that reading the parrot worry is relocated, not dissolved — which is why this essay presses on grounding rather than on usage. ↩
2. Harnad, “The Symbol Grounding Problem” (1990), poses the problem through the image of trying to learn a first language from a Chinese-Chinese dictionary: an endless circuit of symbol-to-symbol definition that never reaches the world. The claim is not that symbols can never refer, but that reference cannot be conferred by further symbols alone — the regress must terminate in a non-symbolic capacity to identify a category’s members. Note that Harnad’s diagnosis is considerably friendlier to connectionism than Searle’s: the grounding he demands is sensorimotor categorization, a task he takes neural networks to be well suited to learn, given the right embodiment. The argument here is therefore not anti-connectionist; it is anti–disembodied-connectionist. ↩
3. Harnad, “Symbol Grounding and the Origin of Language” (2002): “What language allows us to do is to ‘steal’ categories quickly and effortlessly through hearsay instead of having to earn them the hard way, through risky and time-consuming sensorimotor ‘toil.’” The theft/toil contrast is his. The application is mine: a model trained exclusively on representations attempts the theft with no underwriting toil anywhere in its causal history — not its own, and not, in any content-fixing way, the photographers’. The captioned-image corpus is a vast ledger of other people’s earnings that the model never made. ↩
4. Putnam, “The Meaning of ‘Meaning’” (1975). Twin Earth fixes the individuation of content by external relations: my molecular twin and I, internally identical, mean different substances by “water” because our environments differ (H₂O here, the look-alike “XYZ” there). Burge (“Individualism and the Mental,” 1979) extends the externalism to the social environment. I lean only on the modest thesis — that internal richness underdetermines content — and not on any stronger claim about whether phenomenal character itself is wide. The modest thesis is enough to sink “more pixels equals more meaning.” ↩
5. Tye, “Homunculi Heads and Silicon Chips: The Importance of History to Phenomenology” (2019). Tye accepts Block’s verdict that a “China-body system” duplicating our functional organization at a moment would have no experiences, but argues the reason is historical rather than organizational: the system lacks the causal history through which its states would come to track — and therefore represent — worldly features. Because Tye holds that phenomenal character just is representational content of the right kind, a historical condition on content becomes a condition on experience. (The library’s copy carries a “2011” preprint stamp; the published version appears in the Pautz and Stoljar Blockheads! volume, MIT Press 2019.) For the record, Tye announced a move toward panpsychism in 2024; nothing here depends on that later turn — the historical thesis stands on its own. ↩
6. This is the teleosemantic ingredient, and it is doing quiet but essential work. On Dretske’s account (Explaining Behavior, 1988; Naturalizing the Mind, 1995), a state represents what it has the function of indicating, and functions are acquired through a learning or selectional history in which getting it right and getting it wrong carried consequences. “Stakes” is shorthand for that history: a system for which misrepresentation costs nothing is, on this view, not yet in the business of representation at all. A frozen training corpus supplies correlations in abundance but no such history — which is why scaling the corpus, in any modality, changes the quantity of correlation without manufacturing the one thing teleosemantics says content requires. ↩
7. I bring in Searle’s syntax/semantics argument (“Minds, Brains, and Programs,” 1980) only here, and deliberately not at the front: the educated reader has largely filed the Chinese Room under “answered,” by way of the Systems and Robot replies. But notice that the Robot Reply — the proposal that grounding the symbols in sensors and effectors would supply understanding — concedes precisely this essay’s point. It locates the missing ingredient in embodiment, not in more or richer symbols. Searle himself resists even that, on the ground that bolting transducers onto the room changes nothing happening inside it; whether he is right about that further step is a dispute this essay can leave open, because its target — the claim that multimodal input alone grounds meaning — is one the Robot Reply and Searle both reject. ↩
May 25, 2026
Twin Earth and Semantic Externalism
MIND · MATTER · MEANING May 2026

Twin Earth and Semantic Externalism

Meaning isn’t in the head. The word reaches past the skull to the world.

An essay mindmatterandmeaning.com

Stand in the kitchen and point at the kettle and say water. Nothing about that performance feels mysterious. You meant water; the kettle holds water; the word landed where you sent it. The whole transaction belongs to ordinary life — the kind of thing a competent five-year-old manages a hundred times a day without philosophical assistance.

Now suppose, while you weren’t looking, the kettle had been swapped for one filled with a clear, tasteless liquid whose molecular structure happens to differ from water in every respect that matters at the bench. You can’t tell. The five-year-old can’t tell. The kettle whistles. You pour. You drink. Did you mean water when you said the word, even though no water was anywhere in the room?

This is the question Hilary Putnam asked in 1975, and the answer he gave reshaped the philosophy of mind. The short version: no. You meant water, in the way you ordinarily do, only because real water exists out in the world and your linguistic community has been pointing at it for generations. The pointing happens partly outside your skull. The meaning, accordingly, lives partly outside it too. As Putnam put it, in the line that has been quoted ever since: “meanings just ain’t in the head” (Putnam, 1975).¹

The slogan has the feel of something half-rhetorical. It isn’t. What follows explains why it turns out to be true, what bad picture it replaces, and what difference it makes — including for one of the more aggressive claims you’ll hear at the confident end of contemporary AI discourse.

The picture we usually carry

The bad picture is so familiar it barely registers. When you think of meaning, you probably think of something happening inside a head. A word floats up; an image, or a definition, or a feeling of recognition attaches itself to it; out the word goes, freighted with its little inner cargo. Whatever it is that makes water mean water, the picture says, is some inner state of yours — a concept, a representation, a private mental something — that the word is hooked up to.

The picture has a long pedigree. Descartes built half his metaphysics around it. The British empiricists stocked the mind with ideas the way one stocks a pantry, and spent a great deal of energy reassuring themselves that the pantry tracked the outside world accurately. The cognitive scientists of the 1960s gave the pantry a computational paint job and called the contents internal representations. Same picture throughout: meaning sits inside the head, the head has a private inventory, the word inherits its meaning from the inner item it labels.

What Putnam’s 1975 paper does, with one of the cleanest thought experiments in philosophy, is show that the picture cannot be right.

Twin Earth, and why it bites

Imagine a planet exactly like ours, down to the molecule, except that in every place our world has water, Twin Earth has a different stuff. Putnam called it XYZ. It looks, feels, tastes, and behaves indistinguishably from H₂O at the rough scale of human life. People on Twin Earth wash with it, drink it, complain about its hardness; they call it water. The chemistry beneath the surface differs, but no Twin Earther in 1750 has any way of detecting that difference.

Now consider Oscar on Earth and his molecule-for-molecule duplicate Twin Oscar on Twin Earth, both in 1750, before chemistry exists. Stand them side by side, look inside their skulls, take inventory of every inner item the bad picture would care about: the same brain states, the same images, the same feelings, the same dispositions, the same everything. By the bad picture’s lights, when each says water, each means the same thing.

But each does not mean the same thing. When Oscar says water, his word reaches for H₂O and lands on it, because that is the stuff the linguistic practice he inherited has been about. When Twin Oscar says water, his word reaches for XYZ. They cannot mean the same thing, because their words have different references — different things in the world that they pick out, different conditions under which what they say is true. The meaning differs even though everything inside the head is identical (Putnam, 1975).²

Notice what the thought experiment is not claiming. Oscar and Twin Oscar do not have different inner lives; they have qualitatively identical ones, by stipulation. The point is that those inner lives, however rich, do not by themselves fix what their words are about. Something else does — namely, the actual stuff the community’s linguistic practice has been latching onto across time.

This is what philosophers call semantic externalism — the view that the meaning of a word, and the content of a thought, depends constitutively on factors outside the speaker or thinker. Outside the skull, outside the inner inventory, outside whatever the bad picture wanted to keep tucked away in private mental space.

Why the result generalizes

The water case is the showpiece, but Putnam’s argument doesn’t depend on natural kinds with hidden chemistry. Tyler Burge spent a career defending a more sweeping version — the view he calls anti-individualism — arguing that the same lesson runs through perception, concept possession, and the categories that structure ordinary cognition (Burge, 2010). The reason is structural. A representation succeeds or fails at hitting its target, and what counts as the target gets settled by relations the representation bears to a world. Burge’s signature example: a patient tells his doctor he has arthritis in his thigh. He is simply wrong — arthritis by definition is a disease of the joints — and crucially he is wrong about what his own word means, not because his inner state is defective but because his community’s medical practice has settled the term against him. What a word reaches for is fixed by the practice the word participates in, not by what the speaker pictures when uttering it.³

Alex Byrne and Michael Tye have argued that on the strongest version of representationalism, even the felt character of experience — the qualia earlier philosophy treated as the last private redoubt — depends on the world the experience represents (Byrne & Tye, 2006). If they are right, even the most intimate-seeming features of mental life have an outside leg — a claim Ned Block has pressed hard against, and one the externalist has to earn rather than assume.⁴

The lesson, told plainly: minds reach into a world to do their work. A mental state has the content it has partly in virtue of what, out there, it latches onto. That latching runs through causal, historical, social, and environmental relations all at once. The inside contributes half the mechanism. The outside contributes the other half.

What the LLM defender wants to say

Once you see why meanings can’t be in the head, an objection arrives almost immediately, and these days it usually concerns language models. A defender of the strong-AI line will say something like this: very well, meaning is not in any individual head — but it doesn’t need to be. Modern large language models are trained on the entire textual output of a civilization. The hookings, the practices, the patterns of use, are all there, distributed across the corpus. Whatever fixes meaning for human speakers should fix it for a model that has internalized the practice at scale. The model’s words reach into the same world ours do, through the same network of usage. Why call this anything less than understanding?

Vladimír Havlík defends a sophisticated version of this view. He argues that the meanings of linguistic expressions in LLMs are grounded — in his words — “neither in the world, nor in an internal idea of the world,” but within the linguistic corpus as a whole, and that this turns out to be sufficient for what he calls referential grounding (Havlík, 2024). The picture deserves a fair hearing. If meaning lives in patterns of public use, and a model has absorbed those patterns at civilization scale, then the model — the argument runs — has whatever it takes.⁵

I think the argument fails. And where it fails reveals what Twin Earth really showed.

What Twin Earth really showed

Putnam’s thought experiment said something stronger than meaning lies outside the speaker. It said that meaning depends on the world the practice latches onto. Oscar and Twin Oscar both participate in fluent verbal practice; both communities use water the same way; the difference is only what their respective practices are anchored to. The anchoring fixes which stuff the word reaches for. Talk that is not anchored does not reach.

A language model has the corpus. It does not have the anchoring. It has the residue of the anchoring, frozen in token statistics, with no living relation to the stuff the tokens came from. When the model produces water, no path runs from the word back to any water — not in training, not in deployment, not even, in any straightforward sense, in the data. The data records human anchoring in compressed form; the model inherits a derivative shadow of that anchoring; the shadow does informative work, sometimes spectacularly so — but a shadow of an anchor does not anchor anything. John Searle, the Berkeley philosopher whose Chinese Room argument we will meet again, made an adjacent point four decades ago — syntax, however elaborate, does not constitute semantics (Searle, 1980) — and the externalist diagnosis converges with his from the other side: both isolate the same missing thing, a relation between symbols and the world they purport to describe.⁶ The model does not lack complexity. It lacks that relation.⁷

This isn’t a denial of the model’s achievement, which is real and genuinely impressive in ways I don’t want to minimize. Fluent next-token prediction over a record of meaning counts for something — but it does not count as the same accomplishment as meaning. Meaning is what the record records. The record itself, separated from the activity it records, has no pointing power of its own. Twin Earth tells us so: without the right anchoring, even an inner life qualitatively identical to ours fails to mean what we mean. A model without any anchoring at all does no better.

Where this leaves the reader

The point isn’t to settle the AI question in one essay — that takes a longer argument — but to remove a misleading picture that gets in the way of seeing the question clearly. The picture says meaning is a stuff inside heads, and minds reach into the world by carrying that stuff outward. The picture is wrong. Meaning is what minds do when they reach — a relation, not an inner cargo. The kettle whistles, the word lands, the kettle holds water, and the linguistic community has been latching onto water for a long time. That whole arrangement is what makes your word work. None of it lives between your ears alone.

The mind doesn’t make meaning by storing it. It makes meaning by reaching — and a reach with nothing at the far end is not a reach. It’s a gesture.

Notes
1. Putnam’s argument runs through “The Meaning of ‘Meaning’” (in K. Gunderson, ed., Minnesota Studies in the Philosophy of Science, vol. 7 [Minneapolis: University of Minnesota Press, 1975], 131–193), with the slogan at p. 144. The slogan is often quoted as if it were the conclusion; in Putnam’s text it sits midway through the development, after the Twin Earth case has done its work and before the apparatus of stereotype, normal form description, and the division of linguistic labor is introduced. The full position is more structural than the slogan suggests: meaning, on Putnam’s account, supervenes on a four-element vector (syntactic markers, semantic markers, stereotype, extension), and only the first three live “in the head.” The extension — the actual stuff the word picks out — lies outside, and the extension is constitutive of meaning. The division of linguistic labor does additional load-bearing work the slogan hides: a lay speaker can mean gold without being able to tell gold from pyrite, because the community houses experts whose discriminations the lay speaker defers to. Reference is thus a collective achievement distributed across a community and its history, not a private hookup renewed in each head — a point that matters directly when the question turns to a system that has the corpus but stands in no deferential relation to any expert in it. ↩
2. The Twin Earth argument depends on two further commitments Putnam developed in parallel with Saul Kripke’s work in Naming and Necessity (Cambridge, MA: Harvard University Press, 1980; the lectures were delivered in 1970). Natural-kind terms like water are rigid designators: they pick out the same kind in every possible world in which that kind exists. And the identity water = H₂O is a necessary truth discovered a posteriori: not derivable from the concept of water alone, but, once established, holding of metaphysical necessity. These two commitments together explain why Oscar’s water and Twin Oscar’s water cannot have the same reference even when their inner states are identical. It bears emphasis against a standard misreading: the indexicality of water (Putnam’s “this liquid, the same liquid as that“) does not relocate the difference back inside the head as a difference in narrow content. The demonstrative reaches its referent only through a causal-historical relation to a sample, and it is that relation — not any inner accompaniment of the demonstrating — that differs between the twins. ↩
3. Burge’s case is developed first in “Individualism and the Mental,” Midwest Studies in Philosophy 4 (1979): 73–121, and expanded across decades into the systematic anti-individualism of Origins of Objectivity (Oxford: Oxford University Press, 2010), esp. chaps. 2–3. Burge’s claim is stronger than Putnam’s along two axes. First, it needs no hidden microstructure: the arthritis case turns on a purely social fact — that “arthritis” is, in the speaker’s community, a disease of the joints — so the externalist conclusion extends to artifact and institutional terms (sofa, contract) that have no chemical essence at all. Second, and more carefully than the main text’s compression allows: Burge’s point is not that the patient is ignorant of a dictionary entry, but that the content of his belief is fixed by his community’s practice despite his incomplete grasp of the term. The patient genuinely believes that his arthritis has spread to his thigh — a false belief about arthritis, deferring to a practice that determines what “arthritis” picks out — rather than a true belief about some idiosyncratic private concept. Strip away the community and the very identity of the concept he is deploying goes indeterminate. Content, that is, is constitutively dependent on relations the thinker bears to a wider linguistic and physical environment, not merely causally downstream of them. ↩
4. Byrne and Tye, “Qualia Ain’t in the Head,” Noûs 40, no. 2 (2006): 241–255. The argument runs as an externalist extension of Tye’s strong representationalism (Ten Problems of Consciousness [Cambridge, MA: MIT Press, 1995], esp. chaps. 4–5): if phenomenal character is identical to representational content of the right kind, and if representational content is itself externally determined (per Putnam, Burge, and the wider tradition), then phenomenal character cannot be wholly internal to the perceiving system. The strongest objection is Ned Block’s “mental paint” line (Block, “Mental Paint,” in M. Hahn and B. Ramberg, eds., Reflections and Replies: Essays on the Philosophy of Tyler Burge [Cambridge, MA: MIT Press, 2003], 165–200): there are, Block argues, intrinsic phenomenal features — the “paint” on the inner canvas — that vary independently of any represented worldly property, as in cases of phenomenal inversion or in afterimages, so that two experiences could represent the same scene yet differ in felt character. Met at full strength, the objection is answered, not conceded, by holding the line on the identity claim: the cases Block adduces are redescribed as differences in what is represented (a difference in represented hue, an afterimage represented as a colored region of the visual field) rather than as residue left over once representational content is fixed. “Mental paint” names exactly the inner cargo the externalist denies; to grant it as an independent variable would be to smuggle the bad picture back in under a new label. The present essay endorses the strong representationalist line and takes the burden Block identifies — to redescribe every putative case of paint without remainder — to be one the view can carry. ↩
5. Havlík, “Meaning and Understanding in Large Language Models,” Synthese 204, article 71 (2024). Havlík distinguishes three candidate locations for the grounding of LLM meanings — the world, an internal world-model, and the linguistic corpus itself — and argues that the first two cannot be required of an LLM without begging questions about what counts as grounding; his positive proposal is that meaning in LLMs is grounded intra-linguistically, within the corpus, so that referential success becomes a property of distributional structure rather than of any speaker-world relation. The objection pressed in the main text grants Havlík his negative point — referential grounding is indeed not the only way to fix meaning for a symbolic system — while denying the positive one: intra-linguistic structure can do real semantic work (disambiguation, inference, paraphrase) precisely because it inherits the compressed trace of relations the original speakers bore to the world, but it cannot do the constitutive work the externalist tradition has identified, because the relation that did that work was severed at training time and the trace is not the relation. Compare the converging diagnosis of Emily M. Bender and Alexander Koller, “Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data,” Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020): 5185–5198, who argue from the side of form what externalism argues from the side of reference: a system exposed to form alone, however much of it, has no route to communicative intent or to the world that intent is about. ↩
6. Searle, “Minds, Brains, and Programs,” Behavioral and Brain Sciences 3, no. 3 (1980): 417–424; followed up in “Is the Brain a Digital Computer?” Proceedings and Addresses of the American Philosophical Association 64, no. 3 (1990): 21–37. For the present essay’s externalist diagnosis, Searle’s two arguments converge from different directions: the 1980 paper isolates the syntax/semantics gap from the side of what symbol manipulation alone delivers (a syntactic engine, however fast, never crosses into semantics by running faster); the 1990 paper isolates it from the side of what counts as a symbol in the first place (syntax is not intrinsic to physics — it is assigned by an interpreter, which threatens any account that hopes to read off semantics from a system’s formal structure). The externalist arrives at the same gap from a third direction: even granting determinate symbols, their reference is fixed by relations to a world, and those relations are not among the system’s formal properties. Three independent roads, one missing ingredient. ↩
7. The relevant technical distinction: training statistics encode the distributional facts about how speakers in a corpus deploy tokens relative to one another, but do not encode the referential facts about which extensions those deployments succeeded in picking out. The two are correlated — because the human speakers were anchored — but the correlation does not survive the move from the speakers to the trained model: the model has access to the shadow the anchoring cast on usage, not to the anchoring. This is why the natural reply — “but a model can be grounded, via multimodal training, robotic embodiment, or retrieval against a live environment” — is not a counterexample but a concession in disguise. Each such proposal works precisely by restoring some causal-historical relation between the system’s symbols and the world, which is the externalist’s point: reference is purchased by anchoring, and where genuine anchoring is added the verdict can change (cf. Michael Tye, “How Can We Tell if a Machine is Conscious?” Inquiry [advance online publication, 2024], https://doi.org/10.1080/0020174X.2024.2434856, on the embodied conditions under which machine reference could succeed). The claim of this essay is narrow and exact: a text-only model trained on a static corpus has no such relation, and so its fluency, however vast, is fluency over a record of meaning rather than an instance of it. ↩
May 12, 2026