Can an AI truly understand meaning without grounding?

Can an AI truly understand meaning without grounding?

von HUF03 Nguyễn Phú Cường -
For decades, Computational Lexicographers and linguists built structured databases like WordNet to manually map how words relate to the real world (e.g., a "chair" is a ...

mehr...

For decades, Computational Lexicographers and linguists built structured databases like WordNet to manually map how words relate to the real world (e.g., a "chair" is a physical object used for sitting). Modern Large Language Models (LLMs), however, learn meaning strictly through distributional semantics—calculating the statistical probability of words appearing near each other. They have never seen a chair, felt its texture, or sat in one.

Re: Can an AI truly understand meaning without grounding?

von HUF03 Nguyễn Phú Cường -
a machine understands a word if it can navigate its complex relationships within a linguistic system, using lexical databases and statistical patterns to distinguish ...

mehr...

a machine understands a word if it can navigate its complex relationships within a linguistic system, using lexical databases and statistical patterns to distinguish between various senses of a word—a process known as Word Sense Disambiguation. If a model can perfectly predict, translate, and manipulate language to achieve a specific goal, many argue that it possesses a practical, operational form of meaning. In this view, meaning is a "web" of connections where every word is defined by its position relative to every other word in the lexicon. However, proponents of the "Symbol Grounding" view argue that true semantic analysis requires a connection to the non-linguistic, physical world. They contend that while an AI can perform sophisticated pattern matching, it remains a "stochastic parrot" because it lacks the sensory-motor experience that humans use to ground concepts in reality. To an AI, the word "hot" is simply a token that frequently appears near "fire" or "sun," whereas to a human, "hot" is an embodied sensation of heat. The future of computational linguistics likely lies in multimodal systems that combine text with visual and physical data, attempting to move beyond the "black box" of pure statistics toward a more human-like, grounded understanding of the world.