Hostname: page-component-745bb68f8f-v2bm5 Total loading time: 0 Render date: 2025-02-11T06:29:19.392Z Has data issue: false hasContentIssue false

What can frequency effects tell us about the building blocks and mechanisms of language learning?

Published online by Cambridge University Press:  03 February 2015

INBAL ARNON*
Affiliation:
Hebrew University
Rights & Permissions [Opens in a new window]

Abstract

Type
Commentaries
Copyright
Copyright © Cambridge University Press 2015 

Over the past decades, numerous studies have documented the way input frequency affects children's language learning on all levels: from the learning of sounds, through words, multiword sequences and more abstract constructions. This paper provides a timely and thorough review of the pervasiveness of frequency effects in first language acquisition, showing how frequency impacts not only vocabulary acquisition, but also children's learning of inflectional morphology, and more abstract syntactic constructions. The review shows that learning is sensitive to multiple frequency measures: from that of specific exemplars (e.g., the frequency of the word cake), through morphological types (e.g., the frequency of -ed as a past tense marker), to more abstract form–function mappings (e.g., that object relative clauses tend to have inanimate heads). Drawing on findings from both experimental and corpus-based studies, the authors argue that higher-frequency forms tend to be acquired earlier, and that both correct productions and error patterns can be traced back to input frequencies. The review brings together empirical findings from several distinct domains and argues convincingly that any model of language acquisition has to be able to account for them. However, it does not address the underlying causes or consequences of frequency effects. Here, I focus on the implications frequency effects have for how we understand the process and product of language learning.

Frequency effects are not interesting in and of themselves. They are interesting because they reveal something about the learning mechanisms and units used in language learning. Research over the past twenty years has shown that infants (and adults) are very adept at extracting distributional regularities from their environment, and can use this information to learn about linguistic structure (see Romberg & Saffran, Reference Romberg and Saffran2010, for a review). These statistical learning abilities can help infants discover word boundaries (e.g., Saffran, Aslin, & Newport, Reference Saffran, Aslin and Newport1996), phonetic categories (Maye, Werker, & Gerken, Reference Maye, Werker and Gerken2002), and even grammatical categories (e.g., Gomez & Lakusta, Reference Gomez and Lakusta2004). There are many parallels between these early statistical learning abilities and the frequency effects reported in this paper. In both cases, children attend to distributional information on multiple levels (between sounds, words, word classes) and use it to make generalizations. In fact, children's sensitivity to frequency in their linguistic input can be conceptualized as an extension of these early statistical learning abilities. The two bodies of literature – which are usually studied separately – also raise similar challenges.

The main challenge is a mechanistic one: why do frequency effects emerge? One possible answer lies in the role of prediction in language learning. Predictability plays an important role in language processing: speakers form expectations about upcoming topics, words, and constructions and use that to guide on-line processing (e.g., Hale, Reference Hale2006; Levy, Reference Levy2008; Jaeger, Reference Jaeger2010). Despite the importance of prediction in language processing, its role in first language learning has been relatively less explored. In looking at how input patterns influence learning, researchers have highlighted the role of frequency but not of predictability (a pattern that holds in this paper). Recent years have seen growing interest in the role of prediction in learning, with the successful application of discriminative learning theory to language learning (Arnon & Ramscar, Reference Arnon and Ramscar2012; Ramscar, Yarlett, Dye, Denny, & Thorpe Reference Ramscar, Yarlett, Dye, Denny and Thorpe2010; Ramscar, Dye, & McCauley, Reference Ramscar, Dye and McCauley2013). In such models, learning happens when there is prediction error: when there is a discrepancy between what is expected and what is encountered in the environment. A major component of learning involves forming predictions about how language unfolds over time. While both frequency and predictability influence child language use (as is the case for adults), predictability offers a more functional explanation for why distributional information plays a crucial role in language learning. Children are not just ‘counting up’ how many times forms appear. Instead, they are trying to make sense of the world around them by developing their ability to predict what will happen next. Viewing the child's task as one of prediction (e.g., Chater & Christiansen, Reference Chater and Christiansen2010; Elman, Reference Elman1990; Ramscar et al., Reference Ramscar, Yarlett, Dye, Denny and Thorpe2010) opens up new ways of thinking about the relation between what children hear (i.e., their input) and what they say (i.e., their output).

Frequency effects provide insight not only into the mechanisms used in learning (e.g., statistical learning, prediction), but also into the building blocks used in learning. Finding that children are sensitive to the frequency of multiword strings challenges the traditional notion of words as the basic building blocks for language learning and use (e.g., Pinker, Reference Pinker1999). The authors review many findings showing that children's correct and incorrect uses are affected by multiword frequency. For instance, children are better at repeating higher-frequency phrases (Bannard & Matthews, Reference Bannard and Matthews2008) and make more errors when the incorrect string appears often in another construction (e.g., more errors like me do it when children are exposed often to correct preverbal uses like let me do it; Kirjavainen, Theakston, & Lieven, Reference Kirjavainen, Theakston and Lieven2009). Such findings suggest that children use multiword units as building blocks for learning, as predicted by usage-based models (e.g., Abbot-Smith & Tomasello, Reference Abbot-Smith and Tomasello2006; Lieven & Tomasello, Reference Lieven, Tomasello, Robinson and Ellis2008). This sensitivity to multiword information is not limited to young learners. There is growing evidence that adults are also sensitive to the distributional properties of multiword sequences and draw on such information in production and comprehension (e.g., Arnon & Snider, Reference Arnon and Snider2010; Arnon & Cohen Priva, Reference Arnon and Cohen Priva2013; Reali & Christiansen, Reference Reali and Christiansen2007; Tremblay, Derwing, Libben, & Westbury, Reference Tremblay, Derwing, Libben and Westbury2011). Taken together, the developmental and psycholinguistic findings highlight the parallels in processing words and larger sequences; and point to the importance of multiword units in language learning and use.

The past decades have seen a significant shift in the study of language acquisition from models that prioritized innate mechanisms and abstract knowledge (e.g., Pinker, Reference Pinker1999) to ones that emphasize children's input and learning mechanisms (e.g., Tomasello, Reference Tomasello2003; Lieven & Tomasello, Reference Lieven, Tomasello, Robinson and Ellis2008). Frequency effects have played an important role in providing evidence for key usage-based predictions, in particular (a) the role of children's input in learning, and (b) the gradual move from lexically specific knowledge to more abstract knowledge. Today, there is extensive evidence documenting frequency effects in many languages and in many linguistic domains. We can now proceed to use these effects as a way to ask fundamental questions about the mechanisms of language learning and the resulting linguistic representations.

References

REFERENCES

Abbot-Smith, K., & Tomasello, M. (2006). Exemplar-learning and schematization in a usage based account of syntactic acquisition. Linguistic Review, 23, 275290.Google Scholar
Arnon, I,. & Cohen Priva, U. (2013). More than words: the effect of multi-word frequency and constituency on phonetic duration. Language and Speech, 56, 349373.CrossRefGoogle ScholarPubMed
Arnon, I., & Ramscar, M. (2012). Granularity and the acquisition of grammatical gender: how order-of- acquisition affects what gets learned. Cognition, 122, 292305.Google Scholar
Arnon, I., & Snider, N. (2010). More than words: frequency effects for multi-word phrases. Journal of Memory and Language, 62, 6782.CrossRefGoogle Scholar
Bannard, C., & Matthews, D. (2008). Stored word sequences in language learning. Psychological Science, 19, 241248.Google Scholar
Chater, N., & Christiansen, M. H. (2010). Language acquisition meets language evolution. Cognitive Science, 34, 11311157.Google Scholar
Elman, J. L. (1990). Finding structure in time. Cognitive Science, 14, 179211.Google Scholar
Gomez, R. L., & Lakusta, L. A. (2004). A first step in form-based category abstraction by 12-month-old infants. Developmental Science, 7, 567580.Google Scholar
Hale, J. (2006). Uncertainty about the rest of the sentence. Cognitive Science, 30, 609642.Google Scholar
Jaeger, T. F. (2010). Redundancy and reduction: speakers manage syntactic information density. Cognitive Psychology, 61(1), 2362.Google Scholar
Kirjavainen, M. M. M., Theakston, A. L., & Lieven, E. V. (2009). Can input explain children's me-for-I errors? Journal of Child Language, 36, 10911114.Google Scholar
Levy, R. (2008). Expectation-based syntactic comprehension. Cognition, 106, 11261177.Google Scholar
Lieven, E., & Tomasello, M. (2008). Children's first language acquisition from a usage-based perspective. In Robinson, P. & Ellis, N. (eds), Handbook of cognitive linguistics and second language acquisition (pp. 168196). New York and London: Routledge.Google Scholar
Maye, J., Werker, J. F., & Gerken, L. (2002). Infant sensitivity to distributional information can affect phonetic discrimination. Cognition, 82, B101B111.CrossRefGoogle ScholarPubMed
Pinker, S. (1999). Words and rules. New York: Harper Perennial.Google Scholar
Ramscar, M., Dye, M., & McCauley, S. M. (2013). Error and expectation in language learning: the curious absence of ‘mouses’ in adult speech. Language, 89, 760793.CrossRefGoogle Scholar
Ramscar, M., Yarlett, D., Dye, M., Denny, K., & Thorpe, K. (2010). The effects of feature-label order and their implications for symbolic learning. Cognitive Science, 34, 909957.Google Scholar
Reali, F., & Christiansen, M. H. (2007). Processing of relative clauses is made easier by frequency of occurrence. Journal of Memory and Language, 57, 123.Google Scholar
Romberg, A. R., & Saffran, J. R. (2010). Statistical learning and language acquisition, Wiley Interdisciplinary Review Cognitive Science, 1, 906914.Google Scholar
Saffran, J. R., Aslin, R. N., & Newport, E. L. (1996). Statistical learning by 8-month-old infants. Science, 274, 19261928.Google Scholar
Tomasello, M. (2003). Constructing a language: a usage-based theory of language acquisition. Cambridge, MA: Harvard University Press.Google Scholar
Tremblay, A., Derwing, B., Libben, G., & Westbury, G. (2011). Processing advantages of lexical bundles: evidence from self-paced reading and sentence recall tasks. Language Learning, 61, 569613.Google Scholar