Hostname: page-component-745bb68f8f-5r2nc Total loading time: 0 Render date: 2025-02-11T21:17:12.506Z Has data issue: false hasContentIssue false

Linguistic structure emerges through the interaction of memory constraints and communicative pressures

Published online by Cambridge University Press:  02 June 2016

Molly L. Lewis
Affiliation:
Department of Psychology, Stanford University, Stanford, CA 94305. mll@stanford.edumcfrank@stanford.eduhttp://web.stanford.edu/~mll/http://web.stanford.edu/~mcfrank/
Michael C. Frank
Affiliation:
Department of Psychology, Stanford University, Stanford, CA 94305. mll@stanford.edumcfrank@stanford.eduhttp://web.stanford.edu/~mll/http://web.stanford.edu/~mcfrank/

Abstract

If memory constraints were the only limitation on language processing, the best possible language would be one with only one word. But to explain the rich structure of language, we need to posit a second constraint: the pressure to communicate informatively. Many aspects of linguistic structure can be accounted for by appealing to equilibria that result from these two pressures.

Type
Open Peer Commentary
Copyright
Copyright © Cambridge University Press 2016 

Christiansen & Chater (C&C) claim that memory limitations force the cognitive system to process the transient linguistic signal by compressing it. They suggest that this processing pressure influences the ultimate structure of language over the course of language evolution. Taken at face value, this proposal would lead to a degenerate linguistic structure, however. If memory constraints were the only pressure on language, languages would evolve to compress meaning into the simplest possible form – a single word (Horn Reference Horn and Schiffrin1984). But, as the authors point out, natural languages are not of this sort; they are richly structured into lexical and phrasal units of varying length. To account for this variability, we highlight the need to consider the communicative function of language. Communication serves as an important counter-pressure against compression in language processing, not just as a caveat.

Interlocutors use language with the goal of communicating information, but they also aim to minimize energetic cost (Zipf Reference Zipf1949). For the speaker, this goal implies minimizing production cost, and for the listener it implies minimizing comprehension cost. Importantly, these processing constraints have opposing cost functions (Horn Reference Horn and Schiffrin1984; Zipf Reference Zipf1949). For a producer, processing is minimized when a form is easy to say, and thus highly compressible. For the comprehender, however, processing is minimized when a form is minimally ambiguous and thus verbose. Compressing information is a useful strategy for a speaker who faces memory constraints, but it is useful only to the extent that the listener can still recover the intended meaning. This view of language use as rational action – minimizing costs while maximizing information transfer – is supported by a rich body of theoretical and empirical work (Clark Reference Clark1996; Frank & Goodman Reference Frank and Goodman2012; Goodman & Stuhlmüller Reference Goodman and Stuhlmüller2013; Grice Reference Grice, Cole and Morgan1975).

Although C&C argue that compression is the key factor in the emergence of structure, evidence at both the acquisition and evolution timescales suggests language is the product of the interaction between both compression and informativity. At the timescale of acquisition, experimental work suggests the resolution of reference in word learning is the product of communicative inferences (e.g., Baldwin Reference Baldwin1991; Reference Baldwin1993; Frank et al. Reference Frank, Goodman and Tenenbaum2009; Frank & Goodman Reference Frank and Goodman2014). And at the timescale of language evolution, a growing body of work suggests that the forms of words are also equilibria between these two pressures (Lewis & Frank Reference Lewis, Sugarman and Frank2014; Mahowald et al. Reference Mahowald, Fedorenko, Piantadosi and Gibson2012; Piantadosi et al. Reference Piantadosi, Tily and Gibson2011; Zipf Reference Zipf1936). For example, Piantadosi et al. (Reference Piantadosi, Tily and Gibson2011) found that words that are less predictable in their linguistic context are longer, suggesting that speakers may lengthen words that are surprising in order to increase time for the listener to process.

In addition to linguistic form, these pressures influence the mapping between form and meaning. An equilibrium in the structure of form-meaning mappings is one in which the listener is able to recover the intended meaning, but the speaker does not exert additional effort over-describing. A range of semantic domains reflect this equilibrium (Baddeley & Attewell Reference Baddeley and Attewell2009; Kemp & Regier Reference Kemp and Regier2012; Regier et al. Reference Regier, Kay and Khetarpal2007), and ambiguity, more generally, has been argued to reflect this communicative tradeoff (Piantadosi et al. 2012). Ambiguity is an equilibrium in cases where the listener can recover the intended meaning from the communicative context. One example is the word “some,” which has a literal meaning of “at least one and possibly all” but can be strengthened pragmatically to mean “at least one but not all” (Horn Reference Horn1972). Because its meaning is determined through communicative context, its literal semantics can overlap those of its competitor, “all.”

The key challenge associated with this broader proposal – that processing pressures influence linguistic structure – is providing direct evidence for a causal link between these two timescales. This problem is difficult to study in the laboratory because the proposed mechanism takes place over a long timescale and over multiple individual speakers. Furthermore, the presence of a causal link does not entail that phenomena in processing are directly reflected in linguistic structure – rather, entirely new properties may emerge at higher levels of abstraction from the interactions of more fundamental phenomena (Anderson Reference Anderson1972). It may, therefore, not be possible to directly extrapolate from brief communicative interactions observed in the laboratory to properties of linguistic structure.

Several recent pieces of experimental data begin to address this challenge, however. In one study, Fedzechkina et al. (Reference Fedzechkina, Jaeger and Newport2012) asked speakers to learn an artificial language that arbitrarily distinguished nouns through case-marking. Over learning sessions, speakers developed a system for marking in contexts where meanings were least predictable – a pattern reflected in the case-marking systems of natural language. Other work has used a similar paradigm to reveal the emergence of typologically prevalent patterns in the domains of word order (Culbertson et al. Reference Culbertson, Smolensky and Legendre2012; Culbertson & Newport Reference Culbertson and Newport2015) and phonology (Wilson 2008).

A particularly promising approach for exploring this causal link is through transmission chains (Kirby et al. Reference Kirby, Cornish and Smith2008; Reali & Griffiths Reference Reali and Griffiths2009). In a transmission chain, a participant learns and recalls a language, and then the recalled language becomes the learning input for a new learner. By iterating over learners, we can observe how languages change across transmission of learners over the course of language evolution. Kirby et al. (Reference Kirby, Tamariz, Cornish and Smith2015) have compared the emergence of linguistic structure in a regime that iterates over different partners of learners versus a regime where the same two partners repeatedly interact with each other. They find that linguistic structure emerges only by iterating over different partners, demonstrating the unique contribution of cross-generational learning to the emergence of structure. Others have begun to use this paradigm to link the interaction of processing pressures to the emergence of communicative regularities in semantic structure (Carstensen et al. Reference Carstensen, Xu, Smith, Regier, Noelle, Dale, Warlaumont, Yoshimi, Matlock, Jennings and Maglio2015; Lewis & Frank Reference Lewis, Frank, Noelle, Dale, Warlaumont, Yoshimi, Matlock, Jennings and Magli2015).

In sum, the consequences of memory constraints are likely a critical factor in shaping language structure. But an additional important constraint is the pressure to communicate informatively, and this constraint should not be overlooked in accounting for linguistic structure.

References

Anderson, P. W. (1972) More is different. Science 177:393–96.Google Scholar
Baddeley, R. & Attewell, D. (2009) The relationship between language and the environment: Information theory shows why we have only three lightness terms. Psychological Science 20:1100–107.Google Scholar
Baldwin, D. (1991) Infants' contribution to the achievement of joint reference. Child Development 62:874–90.Google Scholar
Baldwin, D. A. (1993) Infants' ability to consult the speaker for clues to word reference. Journal of Child Language 20:395–18.Google Scholar
Carstensen, A., Xu, J., Smith, C. & Regier, T. (2015) Language evolution in the lab tends toward informative communication. In: Proceedings of the 37th Annual Meeting of the Cognitive Science Society, Austin, TX, July 2015, ed. Noelle, D. C., Dale, R., Warlaumont, A. S., Yoshimi, J., Matlock, T., Jennings, C. D. & Maglio, P. P., pp. 303–308. Cognitive Science Society.Google Scholar
Clark, H. H. (1996) Using language. Cambridge University Press.Google Scholar
Culbertson, J. & Newport, E. L. (2015) Harmonic biases in child learners: In support of language universals. Cognition 139:7182.Google Scholar
Culbertson, J., Smolensky, P. & Legendre, G. (2012) Learning biases predict a word order universal. Cognition 122(3):306–29.Google Scholar
Fedzechkina, M., Jaeger, T. F. & Newport, E. L. (2012) Language learners restructure their input to facilitate efficient communication. Proceedings of the National Academy of Sciences 109:1789717902.Google Scholar
Frank, M. C. & Goodman, N. (2012) Predicting pragmatic reasoning in language games. Science 336:998–98.Google Scholar
Frank, M. C. & Goodman, N. D. (2014) Inferring word meanings by assuming that speakers are informative. Cognitive Psychology 75:9096.CrossRefGoogle ScholarPubMed
Frank, M. C., Goodman, N. D. & Tenenbaum, J. B. (2009) Using speakers' referential intentions to model early cross-situational word learning. Psychological Science 20:579–85.Google Scholar
Goodman, N. D. & Stuhlmüller, A. (2013) Knowledge and implicature: Modeling language understanding as social cognition. Topics in Cognitive Science 5:173–84.Google Scholar
Grice, H. P. (1975) Logic and conversation. In: Syntax and Semantics, ed. Cole, P. & Morgan, J., pp. 4158. Academic Press.Google Scholar
Horn, L. (1984) Toward a new taxonomy for pragmatic inference: Q-based and R-based implicature. In: Meaning, form, and use in context, ed. Schiffrin, D., pp. 1142. Georgetown University Press.Google Scholar
Horn, L. R. (1972) On the semantic properties of logical operators in English. Unpublished doctoral dissertation, UCLA.Google Scholar
Kemp, C. & Regier, T. (2012) Kinship categories across languages reflect general communicative principles. Science 336:1049–54.Google Scholar
Kirby, S., Cornish, H. & Smith, K. (2008) Cumulative cultural evolution in the laboratory: An experimental approach to the origins of structure in human language. Proceedings of the National Academy of Sciences 105:10681–85.Google Scholar
Kirby, S., Tamariz, M., Cornish, H. & Smith, K. (2015) Compression and communication drive the evolution of language. Cognition 141:87102.Google Scholar
Lewis, M. & Frank, M. C. (2015) Conceptual complexity and the evolution of the lexicon. In: Proceedings of the 37th Annual Meeting of the Cognitive Science Society, Austin, TX, July 2015, ed. Noelle, D. C., Dale, R., Warlaumont, A. S., Yoshimi, J., Matlock, T., Jennings, C. D. & Magli, P. P., pp. 1138–343. Cognitive Science Society.Google Scholar
Lewis, M., Sugarman, E. & Frank, M. C. (2014) The structure of the lexicon reflects principles of communication. In: Proceedings of the 36th Annual Meeting of the Cognitive Science Society, Austin, TX, July 2014, pp. 845–50, Cognitive Science Society.Google Scholar
Mahowald, K., Fedorenko, E., Piantadosi, S. & Gibson, E. (2012) Info/information theory: Speakers actively choose shorter words in predictable contexts. Cognition 126:313–18.Google Scholar
Piantadosi, S., Tily, H. & Gibson, E. (2011) Word lengths are optimized for efficient communication. Proceedings of the National Academy of Sciences 108:3526–29.Google Scholar
Reali, F. & Griffiths, T. (2009) The evolution of frequency distributions: Relating regularization to inductive biases through iterated learning. Cognition 111:317–28.Google Scholar
Regier, T., Kay, P. & Khetarpal, N. (2007) Color naming reflects optimal partitions of color space. Proceedings of the National Academy of Sciences 104:1436–41.Google Scholar
Wilson, C. (2006) Learning phonology with substantive bias: An experimental and computational study of velar palatalization. Cognitive Science 30:945–82.Google Scholar
Zipf, G. (1936) The psychobiology of language. Routledge.Google Scholar
Zipf, G. K. (1949) Human behavior and the principle of least effort. Addison-Wesley.Google Scholar