Introduction
One of the fundamental assumptions of language is that the linguistic sign is arbitrary (Hockett, Reference Hockett and Greenberg1963). Until recently it was largely believed that the relationship between a linguistic form and its meaning was based on convention. However, non-arbitrary mappings between form and meaning are far more widespread than previously believed (Aryani, Conrad, Schmidtke & Jacobs, Reference Aryani, Conrad, Schmidtke and Jacobs2018; Blasi, Wichmann, Hammarstrom, Stadler & Christiansen, Reference Blasi, Wichmann, Hammarstrom, Stadler and Christiansen2016; Monaghan, Shillcock, Christiansen & Kirby, Reference Monaghan, Shillcock, Christiansen and Kirby2014; Sidhu, Westbury, Hollis, & Pexman, Reference Sidhu, Westbury, Hollis and Pexmanin press; see also Perniss, Thompson & Vigliocco, Reference Perniss, Thompson and Vigliocco2010). One example of non-arbitrariness is iconicity: the resemblance between form and meaning in language (for reviews see Dingemanse, Blasi, Lupyan, Christiansen & Monaghan, Reference Dingemanse, Blasi, Lupyan, Christiansen and Monaghan2015; Dingemanse, Perlman & Perniss, Reference Dingemanse, Perlman and Perniss2020). The most common instance of iconicity in spoken language is onomatopoeia, in which the form of a word directly resembles the sound to which it refers (e.g., woof, mumble). Spoken language can also be iconic by way of sound symbolism: associations between certain phonemes and particular perceptual and/or semantic features (e.g., between /b/ or /u/ and roundness; for reviews see Lockwood & Dingemanse, Reference Lockwood and Dingemanse2015; Sidhu & Pexman, Reference Sidhu and Pexman2018a). That is, when a word evokes associations that resemble its meaning, it could also be considered iconic (e.g., balloon and roundness). In this paper, we use iconicity to refer to any resemblance-based mapping between a word's form and its meaning (Dingemanse et al., Reference Dingemanse, Blasi, Lupyan, Christiansen and Monaghan2015).
There is considerable evidence for the existence of iconicity in languages (Blasi et al., Reference Blasi, Wichmann, Hammarstrom, Stadler and Christiansen2016; Dingemanse et al., Reference Dingemanse, Blasi, Lupyan, Christiansen and Monaghan2015; Perniss et al., Reference Perniss, Thompson and Vigliocco2010). There is also evidence that this iconicity has consequences for language processing and for language acquisition. For example, experiments have demonstrated that both adults and children respond differently to iconic vs. arbitrary words (e.g., Kanero, Imai, Okuda, Okada & Matsuda, Reference Kanero, Imai, Okuda, Okada and Matsuda2014; Sidhu, Vigliocco, & Pexman, Reference Sidhu, Vigliocco and Pexman2020). Other experiments have found that adults, children, and pre-linguistic infants are sensitive to sound symbolism (Asano, Imai, Kita, Kitajo, Okada & Thierry, Reference Asano, Imai, Kita, Kitajo, Okada and Thierry2015; Imai, Kita, Nagumo & Okada, Reference Imai, Kita, Nagumo and Okada2008; Imai, Miyazaki, Yeung, Hidaka, Kantartzis, Okada & Kita, Reference Imai, Miyazaki, Yeung, Hidaka, Kantartzis, Okada and Kita2015; Kovic, Plunkett & Westermann, Reference Kovic, Plunkett and Westermann2010; Maurer, Pathman & Mondloch, Reference Maurer, Pathman and Mondloch2006; Ozturk, Krehm & Vouloumanos, Reference Ozturk, Krehm and Vouloumanos2013; Yang, Asano, Kanazawa, Yamaguchi & Imai, Reference Yang, Asano, Kanazawa, Yamaguchi and Imai2019), while Perniss, Lu, Morgan and Vigliocco, (Reference Perniss, Lu, Morgan and Vigliocco2018) found that iconicity was elaborated in sign language directed to children. Importantly, there is evidence that iconicity is more prevalent in early-acquired signed and spoken language (Perry et al., Reference Perrt, Perlman and Lupyan2015; Thompson, Vinson, Woll & Vigliocco, Reference Thompson, Vinson, Woll and Vigliocco2012; Vinson, Cormier, Denmark, Schembri & Vigliocco, Reference Vinson, Cormier, Denmark, Schembri and Vigliocco2008). For example, Perry, Perlman and Lupyan (Reference Perrt, Perlman and Lupyan2015) collected iconicity ratings of ~600 English words that are commonly produced in the first three years of life. Participants rated the extent to which the words’ sounds matched their meanings (see Methods for further details). The authors found a negative association between iconicity and each word's production frequency at 30 months, even after excluding onomatopoeia from the analysis and controlling for other factors related to age of acquisition. Thus, the earliest-learned words also tended to be the most iconic.
Several studies have replicated and extended this work (Hinojosa, Haro, Magallares, Duñabeitia & Ferr, Reference Hinojosa, Haro, Magallares, Duñabeitia and Ferr2020; for a review see Laing, Reference Laing2019). For example, Perry, Perlman, Winter, Massaro and Lupyan (Reference Perry, Perlman, Winter, Massaro and Lupyan2018) found that conversations between adults were characterized by the use of low-iconicity, highly arbitrary words, whereas early parent-child conversations were rich in highly iconic words. Further, low-iconicity words increased in frequency as a function of age, suggesting that children gradually shift toward an adult style of speech by using more arbitrary words. Meanwhile, Massaro and Perlman (Reference Massaro and Perlman2017) examined the average iconicity of children's vocabularies during the first four years of life and found that iconicity gradually decreased with age, starting at 12 months. Thus, children's speech is initially characterized by frequent use of high-iconicity words, but arbitrary words start to dominate after 12 months. Finally, Perlman, Fusaroli, Fein and Naigles (Reference Perlman, Fusaroli, Fein, Naigles, Gunzelmann, Howes, Tenbrink and Davelaar2017) performed a longitudinal analysis of parent-child dyads, and found that the average iconicity of both children's and parents’ speech was initially high but gradually decreased between the ages of 12 and 69 months. However, children tended to increase their use of arbitrary words, while parents decreased their use of iconic words. Changes in iconicity were also moderated by individual differences in children's cognitive skills, suggesting that parents tailored their use of iconicity to their child's skill level.
Iconicity and word learning
Taken together, the results from these studies demonstrate that high-iconicity words are both acquired earlier and used more often by infants and young children. Imai and Kita (Reference Imai and Kita2014) suggested that iconicity might help infants map speech sounds onto referents and constrain a word's potential meaning (i.e., Imai & Kita, Reference Imai and Kita2014; Perniss & Vigliocco, Reference Perniss and Vigliocco2014). This is what Nielsen and Dingemanse (Reference Nielsen and Dingemanse2020) termed local learning enhancement (i.e., iconic words are easier to learn), in contrast to general learning enhancement (i.e., learning iconic words makes all words easier to learn). Their characterization of the literature is that there is a good deal of evidence for local enhancement, but no clear evidence for general enhancement.
If iconicity does provide local enhancement of language acquisition, there should be evidence of a learning advantage for iconic words, especially in early vocabulary (Perlman et al., Reference Perlman, Fusaroli, Fein, Naigles, Gunzelmann, Howes, Tenbrink and Davelaar2017). Indeed, word-learning experiments have demonstrated that, in some cases, iconic words have a learning and processing advantage over non-iconic words (Imai et al., Reference Imai, Kita, Nagumo and Okada2008; Kanero et al., Reference Kanero, Imai, Okuda, Okada and Matsuda2014; Kovic et al., Reference Kovic, Plunkett and Westermann2010; Laing, Reference Laing2017; Lockwood, Dingemense & Hagoort, Reference Lockwood, Dingemense and Hagoort2016; Nielsen & Rendall, Reference Nielsen and Rendall2012). Meanwhile, indirect evidence for the role of iconicity in word learning comes from the high proportion of iconic words in early parent-child conversations and the fact that iconic words tend to be acquired earlier (Perlman et al., Reference Perlman, Fusaroli, Fein, Naigles, Gunzelmann, Howes, Tenbrink and Davelaar2017; Perry et al., Reference Perry, Perlman, Winter, Massaro and Lupyan2018).
While there is evidence of a learning advantage for iconic words, there remains a great deal of work to be done in understanding the developmental trajectory of this advantage (see Nielsen & Dingemanse, Reference Nielsen and Dingemanse2020). For instance, a study by Massaro and Perlman (Reference Massaro and Perlman2017) found that the first ~60 words understood by children are not rated as being high in iconicity. In addition, Tzeng, Nygaard and Namy (Reference Tzeng, Nygaard and Namy2017) reported that 3-year-olds were not sensitive to the sound symbolism of existing foreign words. This suggests that effects of iconicity may not be consistent throughout development and also highlights the need for various kinds of data beyond production.
One goal of the present study was to help piece together the role of iconicity across different stages in development. While previous research has examined the frequency of iconic words in language used by and produced for infants and preschoolers, here we also examined the relative frequency of iconic words in vocabulary acquisition during the school years. Thus, we were able to investigate the role of iconicity in vocabulary acquisition beyond infancy and the preschool years. In addition, we examined this using a variety of measures (including production, comprehension and input) to gain a fuller appreciation of the different ways that effects of iconicity manifest throughout development.
Iconicity and part of speech
In addition to the observed relationship with age of acquisition, iconicity has been shown to vary by syntactic class. Perry et al. (Reference Perrt, Perlman and Lupyan2015) found that onomatopoeia and interjections were rated as the most iconic word classes, followed by adjectives, adverbs, and verbs, with nouns and function words rated as least iconic. Work by Winter, Perlman, Perry and Lupyan (Reference Winter, Perlman, Perry and Lupyan2017) has shown that words with a greater amount of associated sensory experience tend to be more iconic. This makes sense because iconicity often involves a resemblance of some perceptual experience. Hence, since adjectives and adverbs depict sensorimotor experiences, their greater iconicity is consistent with what one would expect. Subsequently, Winter et al. (Reference Winter, Perlman, Perry and Lupyan2017) replicated the different iconicity among syntactic classes using ratings for an additional ~ 2,400 items, and also found that words with a greater amount of auditory and tactile information tended to be more iconic (see also Perlman, Little, Thompson & Thompson, Reference Perlman, Little, Thompson and Thompson2018). This finding may help explain the higher iconicity of verbs relative to nouns, given the greater amount of associated auditory experience among verbs than nouns (e.g., Sidhu & Pexman, Reference Sidhu and Pexman2018b). Thus, the meanings of words from certain syntactic classes may be easier to depict through iconicity compared to those from other classes.
Differences in iconicity between syntactic classes may also be affected by semantic neighbourhood density, or the number of words with similar meanings. Dense semantic neighbourhoods may impede iconicity due to the greater risk for confusion among similar forms with similar meanings. Indeed, Sidhu and Pexman (Reference Sidhu and Pexman2018b) found that iconic words tend to have sparser semantic neighbourhoods. Further, Imai and Kita (Reference Imai and Kita2014) speculated that verbs might have sparser semantic neighbourhoods than nounsFootnote 1 – since verbs tend to make less fine-grained semantic distinctions than do nouns. These sparser neighbourhoods may in turn contribute to the greater iconicity of verbs.
Importantly, the relationship between age of acquisition and iconicity could be moderated by syntactic class. Investigating this possibility was another goal of the present study. It is possible that differences in semantic density between syntactic classes are not uniform throughout development. While nouns’ semantic neighbourhoods may be especially dense in adult vocabulary, this may not yet be the case for children.Footnote 2 This would mean that early vocabularies are still able to support iconic nouns, without running into the difficulty of dense semantic neighbourhoods. If this is true, one would predict that the average iconicity of nouns in infants’ vocabulary would decline faster than it does for other syntactic classes. Further, iconicity may aid in the learning of other syntactic classes even after noun space has become too crowded. Indeed, Imai and Kita (Reference Imai and Kita2014) speculated that iconicity could facilitate the learning of verbs past the stage at which it is facilitative for nouns, due to verbs’ sparser neighbourhoods overall.
Present study
In the present study, we investigated iconicity in development by making use of existing language development datasets. In particular, we examined whether there is evidence that the acquisition of iconic words is facilitated across a variety of measures, including children's early production, comprehension, and input frequency. In addition, we explored whether this facilitation exists across various stages of development beyond infancy. Finally, in a largely exploratory set of analyses, we examined whether the developmental trajectory of iconic words differs based on syntactic classes.
Method
Iconicity was quantified using subjective ratings collected by Perry et al. (Reference Perrt, Perlman and Lupyan2015; Experiment 1) and later supplemented by Winter et al. (Reference Winter, Perlman, Perry and Lupyan2017). In total, iconicity ratings were available for 3,001 English words. Raters in these studies were shown words one at a time and asked to rate the extent to which their sounds matched their meanings. Ratings were made on a scale ranging from -5 (“words that sound like the opposite of what they mean”) to +5 (“words that sound like what they mean”), with 0 indicating words that “do not sound like what they mean or the opposite” (Perry et al., Reference Perrt, Perlman and Lupyan2015; p. 12). We used each word's average rating as the measure of its iconicity.
This study involved analysis of data from four datasets: the CHILDES corpus (MacWhinney, Reference MacWhinney2000), the Wordbank child production data (Frank, Braginsky, Yurovsky & Marchman, Reference Frank, Braginsky, Yurovsky and Marchman2017), the Dale and O'Rourke (Reference Dale and O'Rourke1981) vocabulary comprehension data, and the Zeno word frequency norms (Zeno, Ivens, Millard & Duvvuri, Reference Zeno, Ivens, Millard and Duvvuri1995). Note that in all cases we only analyzed nouns, verbs and adjectives. Part of speech was based on Brysbaert, New and Keuleers (Reference Brysbaert, New and Keuleers2012).
CHILDES corpus
The CHILDES database (MacWhinney, Reference MacWhinney2000) includes over 5,000 transcriptions of children's speech. These transcriptions can be used to investigate changes in child speech by examining the average lexical and semantic properties of utterances at different ages.
Transcripts of 620 dialogues, containing a total of 125,353 child sentence-utterances were downloaded from CHILDES together with the provided grammatical annotation. These were from typically developing children aged between 9 and 62 months, from all English language corpora (32 in total) that were available at the time of download in 2015–2016. Using information provided in the database we decomposed children's sentences into words (309,116 word-occurrences were found) and assigned their part of speech (POS). A subset of dialogues (comprising 5% of the word-occurrences in the sample) was checked for accuracy, identifying instances where the CHILDES annotation was problematic for mapping accurately to words in the iconicity norms, including compound nouns, contractions, POS, parsing errors, and typos. This process found an error rate of 19.7%. Half of these errors were contractions which are decomposed in CHILDES annotation. We did not want to include in our analyses the iconicity for the constituent words. Another common issue was the CHILDES annotation of compound words, for which we again did not want to include the iconicity values for the constituent words but only for the compound itself.
After identifying these types of errors in the subset of dialogues, the entire database was cleaned to correct them (e.g., spelled out contractions and compound nouns were restored, such that the iconicity of their constituent words were not included in the analyses; plural forms included only where there was a plural form in the Perrt et al., Reference Perrt, Perlman and Lupyan2015; or Winter et al., Reference Winter, Perlman, Perry and Lupyan2017 iconicity norms). After data cleaning, 75% of the children's words were found to have corresponding iconicity norms. The final sample consisted of 530 separate dialogues from 157 children. We computed the average iconicity of words uttered in each dialogue, using the ratings collected by Perry et al. (Reference Perrt, Perlman and Lupyan2015) and Winter et al. (Reference Winter, Perlman, Perry and Lupyan2017).
Wordbank data
This dataset combines data from the Macarthur-Bates Communicative Development Inventory (CDI) across multiple sites (see Frank et al., Reference Frank, Braginsky, Yurovsky and Marchman2017). In particular, we made use of data regarding the percentage of children producing a given word at ages ranging from 16 to 30 months. Using this, we derived a measure of age of acquisition for each word, defined using the same criterion as Dale and O'Rourke (Reference Dale and O'Rourke1981): the youngest age at which more than 67% of children produced a given word. Values on all predictors were available for 387 words.
Dale and O'Rourke data
The Dale and O'Rourke (Reference Dale and O'Rourke1981) data estimate age of acquisition (AoA) by objective means, testing children's vocabulary knowledge across school grades and including over 31,000 unique words. Each word's AoA is equal to the lowest grade in which it is known to an estimated 50–70% of students, based on three-alternative multiple choice tests and corrected for guessing. Brysbaert and Biemiller (Reference Brysbaert and Biemiller2017) updated and expanded the data so that they offer vocabulary estimates for grades 2, 4, 6, 8, 10, 12, 13, and 16 (the latter two are college levels). Brysbaert and Biemiller reported that the correlation between the Dale and O'Rourke test-based AoA values and the subjective ratings of AoA from Kuperman, Stadthagen-Gonzalez and Brysbaert (Reference Kuperman, Stadthagen-Gonzalez and Brysbaert2012; in which adult participants estimated the age at which they had learned each word) was strong at r = .757.
In the present analyses, we used the same set of 1,860 items in the analyses of these and the Zeno et al. norms. This was the maximum number of items that were present in both sets of items and had values available for all predictors.
Zeno et al. norms
These norms estimate the vocabulary encountered in schooling for American students in grades 1 through college (Zeno et al., Reference Zeno, Ivens, Millard and Duvvuri1995). Using 6,300 text sources and a corpus of over 17,000,000 tokens (154,000 word types), the norms estimate word frequency grade by grade. In our analysis, we examined the frequency of a word in each grade level.
Results
Analyses were carried out using the statistical software R (R Core Team, 2018). Mixed effects models were computed using the “lme4” package (Bates, Maechler, Bolker & Walker, Reference Bates, Maechler, Bolker and Walker2015b). We developed each model's random effects structure using the approach suggested by Bates, Kliegl, Vasishth and Baayen (Reference Bates, Kliegl, Vasishth and Baayen2015a). This involved first fitting a model with all random intercepts and slopes (for fixed effects of interest), then using principal components analysis to determine the number of random effects that the data could support, using the “RePsychLing” package (Baayen, Bates, Kliegle & Vasishth, Reference Baayen, Bates, Kliegle and Vasishth2015). Finally, we tested the inclusion of correlations among random effects and the effects themselves using likelihood ratio tests. Continuous predictors were always standardized. Data and code for all analyses are available here: https://osf.io/nz8e3/.
CHILDES corpus
We began with an analysis including all nouns, verbs and adjectives. This consisted of a linear mixed effects model in which the average iconicity of words uttered in that dialogue was the dependent variable. Our predictor of interest was the child's age in each dialogue.Footnote 3 Here, and in subsequent analyses, we included the following control variables: log transformed frequency in child-directed speech (in CHILDES)Footnote 4, word length, orthographic Levensthein distance (OLD; Yarkoni, Balota & Yap, Reference Yarkoni, Balota and Yap2008), and concreteness (Brysbaert, Warriner & Kuperman, Reference Brysbaert, Warriner and Kuperman2014). In the present analyses these were averaged within each dialogue. We also included a random subject intercept. This revealed a significant effect of age (b = −0.09, p < .001) such that dialogues from younger children tended to contain words with higher average iconicity (see Table 1). It also revealed that more iconic dialogues tended to contain words that were more frequent, shorter, more orthographically distinct, and more concrete.
Table 1 Linear mixed effects regression model predicting average dialogue iconicity in the CHILDES corpus.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_tab1.png?pub-status=live)
Notes. OLD is orthographic Levenshtein distance (Yarkoni et al., Reference Yarkoni, Balota and Yap2008).
* p < .05, ** p < .01, *** p < .001.
We next calculated separate dialogue iconicity averages for nouns (n Tokens = 37,399), verbs (n Tokens = 50,512), and adjectives (n Tokens = 6,041), and included part of speech as a predictor (dummy coded; nouns as reference category) along with its interaction with age. We found that there was no interaction between part of speech and age for verbs (p = .91) or for adjectives (p = .26; see Figure 1). Thus, the relationship between age and iconicity did not differ by syntactic class.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_fig1.png?pub-status=live)
Figure 1. The average iconicity of dialogues at different ages as a function of syntactic class.
Wordbank data
These analyses consisted of linear regressions in which the dependent measure was production-based AoA. The predictor of interest was iconicity. This analysis revealed a significant effect of iconicity (b = −0.32, p = .02) such that words higher in iconicity were produced earlier (see Table 2). The analysis also revealed that words produced at a younger age were more frequent and more concrete. Next, we included part of speech (dummy coded; nouns as a reference group) as a predictor, along with its interaction with iconicity. We found that there was no interaction between part of speech and iconicity for verbs (p = .27), or for adjectives (p = .43; see Figure 2). Given the pattern observable in Figure 2, these nonsignificant interactions may be due to the small number of items in the analysis (n Nouns = 259, n Verbs = 87, n Adjectives = 41). Nevertheless, we conclude that the relationship between iconicity and age did not differ by syntactic class.
Table 2 Linear regression model predicting age at which words are produced, in the Wordbank data (Frank et al., Reference Frank, Braginsky, Yurovsky and Marchman2017).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_tab2.png?pub-status=live)
Notes. OLD is orthographic Levenshtein distance (Yarkoni et al., Reference Yarkoni, Balota and Yap2008).
* p < .05, ** p < .01, *** p < .001.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_fig2.png?pub-status=live)
Figure 2. The average age at which words of different iconicity are produced, as a function of syntactic class.
Dale and O'Rourke data
These analyses and those of the Zeno et al. norms (reported next) used the same set of items (1,256 nouns, 332 verbs, and 272 adjectives). The present analyses consisted of linear regressions in which the dependent measure was test-based AoA. The predictor of interest was iconicity. This analysis revealed a significant effect of iconicity (b = −0.09, p = .006) such that words higher in iconicity were acquired earlier (see Table 3). As would be expected, the analysis also revealed that words comprehended in an earlier grade are more frequent, shorter, and more concrete. Next, we included part of speech (dummy coded; nouns as a reference group) as a predictor, along with its interaction with iconicity. We found that there was no interaction between part of speech and iconicity for verbs (p = .48) or for adjectives (p = .11; see Figure 3). Thus, the relationship between iconicity and age did not differ by syntactic class.
Table 3 Linear regression model predicting grade at which competency is achieved, in the Dale & O'Rourke data.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_tab3.png?pub-status=live)
Notes. OLD is orthographic Levenshtein distance (Yarkoni et al., Reference Yarkoni, Balota and Yap2008).
* p < .05, ** p < .01, *** p < .001.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_fig3.png?pub-status=live)
Figure 3. The average grade at which words of different iconicity are comprehended, as a function of syntactic class.
We performed an additional analysis that removed the earliest acquired words (n = 462). This was operationalized as those appearing in the Macarthur Bates Developmental Inventory (Fenson, Reference Fenson2007), which was intended to represent the words acquired in the first three years. With these items, the effect of iconicity was marginally significant (p = .051). When an interaction with syntactic class was included the effect of iconicity was significant (p = .004) and did not interact with syntactic class for verbs (p = .25) or adjectives (p = .08).
Zeno et al. norms
The analysis of Zeno et al. word frequency values consisted of a mixed effects linear regression. The dependent variable was the log transformed frequency of each word +1 at each grade. Our predictor of interest was an interaction between iconicity and grade. The model also included a random item intercept, and a random item slope for grade. This revealed a significant interaction between iconicity and grade (b = −0.10, p < .001; see Table 4). The nature of this interaction was that iconic words were more frequent in earlier grades, but less frequent in later grades. The results were unchanged by removing words that are outliers in each grade (+/− 3 SDs from mean grade frequency). In addition, words that were more frequent in educational materials were more frequent in child-directed speech, less orthographically distinct, and less concrete. Next we included a three-way interaction between iconicity, grade, and part of speech (dummy coded; nouns as a reference category). This revealed a marginally significant three-way interaction for verbs (b = −0.06, p = .06) but not for adjectives (p = .30; see Figure 4). We thus conducted separate analyses on the nouns, verbs and adjectives. We found a significant interaction between iconicity and grade for nouns (b = −0.08, p < .001), verbs (b = −0.16, p < .001), and adjectives (b = −0.12, p < .001), though it was largest for verbs.
Table 4. Linear mixed effects regression model predicting word frequency at each grade in Zeno et al. norms.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_tab4.png?pub-status=live)
Notes. OLD is orthographic Levenshtein distance (Yarkoni et al., Reference Yarkoni, Balota and Yap2008).
* p < .05, ** p < .01, *** p < .001.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220213175048613-0623:S0305000921000040:S0305000921000040_fig4.png?pub-status=live)
Figure 4. The log word frequency of words of different iconicity at grades 1, 5, 9 and 13, as a function of syntactic class.
We again performed an additional analysis in which we removed words from the Macarthur Bates Developmental Inventory (Fenson, Reference Fenson2007). The direction and significance of the results did not change, except that the interaction between iconicity and part of speech was now significant for verbs (p = .02).
Discussion
Previous work has shown that there is a negative relationship between age and iconicity, such that earlier acquired words tend to be higher in iconicity (e.g., Perry et al., Reference Perrt, Perlman and Lupyan2015). We examined this relationship across four different kinds of data: examples of child language production between the ages of nine and 63 months (the CHILDES corpus; MacWhinney, Reference MacWhinney2000); parent-reported estimates of children's word production between 16 and 30 months (the Wordbank data; Frank et al., Reference Frank, Braginsky, Yurovsky and Marchman2017); word comprehension data from school age children to young adults between grades 2 and 13 (Dale & O'Rourke, Reference Dale and O'Rourke1981); and word frequency norms from school materials written for school age children to adults between grades 1 and 13 (Zeno et al., Reference Zeno, Ivens, Millard and Duvvuri1995). We found that earlier words tended to be higher in iconicity in each of these databases.
There are several novel contributions of this work. The first is the finding that the relationship between age and iconicity continues beyond infancy. Previous work has shown a relationship between iconicity and production frequency at 30 months (Perry et al., Reference Perrt, Perlman and Lupyan2015), as well as production frequency across the ages of 12 to 69 months (Perry et al., Reference Perry, Perlman, Winter, Massaro and Lupyan2018). Here we found that a relationship could be observed in language datasets for children ranging in age from six to 19 years (i.e., Dale & O'Rourke, Reference Dale and O'Rourke1981; Zeno et al., Reference Zeno, Ivens, Millard and Duvvuri1995). In particular, in the Dale and O'Rourke data, iconic words were comprehended earlier than arbitrary words. It should be noted that the majority of the words in our sample from the Dale and O'Rourke data were learned between grades two and four. Nevertheless, this still explores an age range of seven to ten years, thus going well beyond infancy. Importantly, this relationship remained (p = .051) even after removing the earliest acquired words; thus, it was not simply a carryover from infancy. This suggests that the benefit of iconicity to word learning is not restricted to infancy. Further, while Perlman et al. (Reference Perlman, Little, Thompson and Thompson2018) supposed that iconicity might benefit word leaning by providing referential insight and/or simply making words easier to learn, our results from later childhood suggest that the mechanism cannot be referential insight alone.
While the Dale and O'Rourke data provided a glimpse at a period that was largely focused between grades two and four, the Zeno et al. norms provided insight into a much longer time span. The results suggested that the transition towards arbitrariness plays out over a period extending well beyond infancy. The implication is that the lexicon does not arrive at an arbitrary state after infancy, but that this happens gradually. While there are benefits to iconicity, arbitrariness is also important: for example, arbitrariness has been suggested to improve the discriminability of words, particularly in crowded lexicons (see Monaghan, Christiansen & Fitneva, Reference Monaghan, Christiansen and Fitneva2011; Sidhu & Pexman, Reference Sidhu and Pexman2018b). Arbitrariness also allows language to refer to meanings that cannot be imitated. As a child's lexicon develops it becomes more dense, and must refer to a wider array of more complex meanings. Both of these factors may direct it towards arbitrariness.
Another important aspect of this work is the variety of ways in which lexical acquisition was operationalized. Previous work has largely focused on child language production corpora (e.g., Perry et al., Reference Perrt, Perlman and Lupyan2015; Reference Perry, Perlman, Winter, Massaro and Lupyan2018), as did the present work with the CHILDES and Wordbank datasets. However, we went beyond this by including a measure of comprehension (Dale & O'Rourke, Reference Dale and O'Rourke1981). Thus, not only are iconic words produced more often earlier on, they are also comprehended more often earlier on. This provides direct support for the proposal that iconic words are easier to learn and understand. Additionally, we found that iconic words were included more frequently in educational materials for younger children vs. older children and young adults. Previous work has shown that child- vs. adult-direct speech contains a greater proportion of iconic words (Perlman et al., Reference Perlman, Fusaroli, Fein, Naigles, Gunzelmann, Howes, Tenbrink and Davelaar2017; Perry et al., Reference Perry, Perlman, Winter, Massaro and Lupyan2018; for a review, see Imai & Kita, Reference Imai and Kita2014). Here we extended that work using written input that was specifically designed for children of different ages. The findings support Perry et al.'s (Reference Perry, Perlman, Winter, Massaro and Lupyan2018) proposal that adults adjust their language use for younger individuals to aid comprehension.
It may be that authors of educational material are either implicitly or explicitly aware of the learning benefit for iconic words and make greater use of them for younger readers. Of course, it is also possible that the greater frequency of iconic words in earlier grades is simply a function of authors using words that tend to be acquired earlier, and these are more likely to be iconic. This would still beg the question of why iconic words tend to be acquired earlier, and the easier acquisition of these words is a reasonable suggestion.
The final contribution of this work is exploring how the relationship between iconicity and age plays out differently across syntactic classes. We did not find evidence for the hypothesized sharp decline in iconic noun frequency. It may be that this change happens in production at an age past the highest age tested in CHILDES. We did, however, observe a marginal interaction involving verbs in the Zeno et al. norms. In this analysis we observed a marginal three-way interaction between iconicity, frequency and syntactic class (comparing nouns to verbs). While the relationship between iconicity and frequency changed over time for all words, with iconic words becoming less frequent, this change was perhaps most pronounced for verbs. The fact that this interaction only approached significance, however, means that it should not be overinterpreted.
In addition to exploring the prevalence of iconicity in early language, the corresponding shift towards arbitrariness is also important to understand. It has been proposed that arbitrariness provides benefits to language by enhancing discriminability (see Monaghan et al., Reference Monaghan, Christiansen and Fitneva2011; Sidhu & Pexman, Reference Sidhu and Pexman2018b), and allowing reference to meanings that cannot be imitated (see Dingemanse et al., Reference Dingemanse, Blasi, Lupyan, Christiansen and Monaghan2015). These benefits might affect the shift towards arbitrariness, as language begins to address topics that are vulnerable to confusion or are difficult to imitate.
In conclusion, we replicated and extended previous findings of iconicity providing a benefit to word learning using a variety of child language data sources. We also observed a gradual shift towards arbitrariness over the school years. These data demonstrate that the balance between iconicity and arbitrariness plays a role throughout language development and is important to the understanding of language as a whole.
Acknowledgements
This work was supported by the Natural Sciences and Engineering Research Council (NSERC) of Canada under Discovery Grant # RGPIN/03860-2018 to PMP.