scripod.com

#426 – Edward Gibson: Human Language, Psycholinguistics, Syntax, Grammar & LLMs

In this episode, Edward Gibson, a professor of psycholinguistics at MIT, dives into the intricate structures and evolution of human language. The discussion spans a wide range of topics, from the mathematical foundations of syntax to the cognitive mechanisms that underlie language acquisition and processing. Drawing from both theoretical and experimental perspectives, Gibson explores how languages are shaped by human cognition, culture, and communication needs.
Gibson discusses the universal patterns in word order, such as verb-initial and verb-final systems, and how dependency grammar models sentence structure. He contrasts the strengths of large language models in capturing form with their limitations in understanding meaning. The conversation also covers the cognitive costs of complex syntax, such as center embedding, and how dependency length affects comprehension. Topics like language evolution, the influence of culture on language development, and the role of language in cognition—especially in numerical reasoning—are explored. The episode concludes with reflections on the challenges of machine translation, the potential for animal communication, and the enduring impact of language on identity and society.
13:44
13:44
LLMs are good at handling form but struggle with deeper meaning
18:12
18:12
Languages minimize dependencies between words for structural efficiency.
30:23
30:23
High-frequency words tend to be irregular, acting like sticky slang that breaks grammatical rules.
36:25
36:25
Tsimané color terms reflect environmental and communicative needs
42:08
42:08
English shifted from verb-final to verb-medial due to contact with French.
1:15:10
1:15:10
Real languages consistently have shorter dependency lengths than randomized controls
1:36:55
1:36:55
Global aphasics can perform non-language tasks like math, chess, and driving
1:51:19
1:51:19
LLMs may implement dependency grammar but lack reasoning capabilities.
2:03:16
2:03:16
Center embedding significantly impairs comprehension and recall in legal texts.
2:19:42
2:19:42
All human languages solve communication problems but are not necessarily optimized for learning.
2:29:00
2:29:00
fMRI studies show language networks are distinct from other cognitive systems
2:33:30
2:33:30
The Pirahã language uses 'few', 'some', and 'many' in a context-determined way rather than exact numbers.
2:47:17
2:47:17
Language serves as a tool for identity and resistance.
2:49:01
2:49:01
Translating stylistic form, such as rhythm and musicality, remains a major challenge for machine translation.
2:52:23
2:52:23
Earth Species Project aims to decode animal communication