Hostname: page-component-745bb68f8f-grxwn Total loading time: 0 Render date: 2025-02-11T06:24:46.402Z Has data issue: false hasContentIssue false

Extended Statistical Learning as an account for slow vocabulary growth*

Published online by Cambridge University Press:  24 May 2011

STEPHANIE F. STOKES*
Affiliation:
University of Canterbury, New Zealand
SOPHIE KERN
Affiliation:
Institut des Sciences de l'Homme, Lyon
CHRISTOPHE DOS SANTOS
Affiliation:
Unité Imagerie et Cerveau Inserm U930, Université François Rabelais de Tours
*
[*]Address for correspondence: Stephanie F. Stokes, Department of Communication Disorders, University of Canterbury, New Zealand. e-mail: stephanie.stokes@canterbury.ac.nz
Rights & Permissions [Opens in a new window]

Abstract

Stokes (2010) compared the lexicons of English-speaking late talkers (LT) with those of their typically developing (TD) peers on neighborhood density (ND) and word frequency (WF) characteristics and suggested that LTs employed learning strategies that differed from those of their TD peers. This research sought to explore the cross-linguistic validity of this conclusion. The lexicons (production, not recognition) of 208 French-speaking two-year-old children were coded for ND and WF. Regression revealed that ND and WF together predicted 62% of the variance in vocabulary size, with ND and WF uniquely accounting for 53% and 9% of that variance respectively. Epiphenomenal findings were ruled out by comparison of simulated data sets with the actual data. A generalized Mann–Whitney test showed that children with small vocabularies had significantly higher ND values and significantly lower WF values than children with large vocabularies. An extended statistical learning theory is proposed to account for the findings.

Type
Articles
Copyright
Copyright © Cambridge University Press 2011

This research compares the characteristics of the lexicons of children who have been described as ‘late talkers’ (LT) with those of their typically developing (TD) peers. LTs have a slow onset of expressive vocabulary, while having no other indications of developmental disability (see Demarais, Sylvestre, Meyer, Bairati & Rouleau, Reference Desmarais, Sylvestre, Meyer, Bairati and Rouleau2008, for a review). Not only is onset late in comparison with their TD peers, but these children are usually identified by their small expressive vocabularies, whether it be by the metric of less than 50 words or no word combinations at age 2 ; 0–2 ; 6 (e.g. Paul, Reference Paul1996), or by the metric of below the 10th or 15th percentile for age (e.g. Bishop, Price, Dale & Plomin, Reference Bishop, Price, Dale and Plomin2003) on the MacArthur-Bates Communicative Development Inventory (MCDI; Fenson, Dale, Reznick, Thal, Bates, Hartung, et al., Reference Fenson, Dale, Reznick, Thal, Bates and Hartung1993; Fenson, Marchman, Thal, Dale, Reznick & Bates, Reference Fenson, Marchman, Thal, Dale, Reznick and Bates2007). The point is that all current definitions identify a child as being a LT on a quantitative measure. In addition, one of the mysteries surrounding LT status is the fact that about two-thirds of these children go on to have language abilities that fall within the normal range, albeit still significantly lower than children who had never been late talkers (e.g. Rescorla, Reference Rescorla2002). Children who approach TD performance on language tests between two and four years are referred to as ‘late bloomers’ (LB).

Stokes (Reference Stokes2010) asked whether the lexicons of LTs differed qualitatively from those of TD children on variables known to affect word learning, specifically phonological neighborhood density and word frequency, and whether there was any indication that these variables could separate LTs from LBs. The term phonological neighbor refers to words that differ from all other words by the substitution, deletion or addition of a sound in any word position (± one segment; Luce & Pisoni, Reference Luce and Pisoni1998). Words that have many phonological neighbors are said to reside in dense neighborhoods, while those with few phonological neighbors reside in sparse neighborhoods. Word frequency is generally defined as the rate of occurrence of a given word in a spoken corpus, where the corpus varies depending on purpose, for example, child-directed speech (Swingley, Reference Swingley2003) or the CELEX (Baayen, Piepenbrock & Gulikers, Reference Baayen, Piepenbrock and Gulikers1995).

Most studies of early vocabulary development have found that first words tend to come from dense phonological neighborhoods in the ambient language. However, there are individual differences across children (Coady & Aslin, Reference Coady and Aslin2003; Storkel, Reference Storkel2004; Reference Storkel2009). The impact of word frequency on vocabulary learning is less clear-cut. Goodman, Dale and Li (Reference Goodman, Dale and Li2008) noted that although there is a general consensus that words that are frequent in child-directed speech (CDS) are learned the earliest, there had not been any direct test of this hypothesis. These authors assigned words on the MCDI to one of six lexical categories: common nouns, people words, verbs, adjectives, closed class and others. (Common nouns were words that encoded objects and substances, like ball, frog and juice, and nouns that labeled events or locations, like park and lunch were categorized as other.) The relationship between the frequency of each word in CDS and the age of emergence of each word according to the Dale and Fenson (Reference Dale and Fenson1996) database was examined to test the hypothesis. Word frequency was negatively correlated with age of acquisition for the entire word set, indicating that earliest learned words were of low frequency. When each word category was considered in turn, the expected relationship was found – the higher the word frequency the earlier the word was learned, with the variables being negatively correlated. Goodman et al.'s (Reference Goodman, Dale and Li2008) results are particularly important for understanding Stokes's (Reference Stokes2010) results for English (see below).

Findings for British English

Much of the research on early vocabulary development had employed the MCDI (Fenson et al., Reference Fenson, Dale, Reznick, Thal, Bates and Hartung1993) as a measurement of vocabulary size, and normative data on vocabulary development has been used as a basis for describing lexical and sublexical characteristics of children's lexicons, such as neighborhood density, word frequency and phonotactic probability (e.g. Storkel, Reference Storkel2004; Reference Storkel2009). In Stokes (Reference Stokes2010), 222 parents checked off the words that their toddlers (aged 2 ; 0–2 ; 6) were known to use (speak) on the MCDI (British-English version; Klee & Harrison, Reference Klee and Harrison2001). Each word in each child's MCDI list was coded for the neighborhood density (De Cara & Goswami, Reference De Cara and Goswami2002) of the word in the ambient language (British English), and for the frequency of occurrence of the word (word frequency) from the CELEX database (Baayen et al., Reference Baayen, Piepenbrock and Gulikers1995). Mean ND and WF values were generated for each child. MCDI scores had a strong, negative and significant correlation with ND scores, and a moderate, positive and significant correlation with WF scores. Large vocabularies had lower density scores, suggesting more words in their inventories from sparse neighborhoods. Children with small vocabularies (LTs) appeared to be learning words that were of low frequency in the input, and came from dense neighborhoods in the ambient language. A hierarchical regression revealed that the variables together accounted for 61% of the variance in vocabulary scores, with ND scores and WF uniquely accounting for 47% and 14% of that variance respectively. Also, children who scored more than one standard deviation below the mean (for age in months; 16th percentile) on the MCDI scored significantly higher on ND and significantly lower on WF than children who scored above the cut point.

A small group of children (N=27) had very small vocabularies (more than 1·5 SD below the mean for age in months on the MCDI; approximately the 7th percentile). Of these children, nine had mean ND values that resembled the ND values of the TD children. Stokes (Reference Stokes2010) suggested that these children may eventually become LBs, as they may have learning strategies that resembled those of TD children. The remaining eighteen children at this cut point had very high mean ND values, which led Stokes to conclude that these children may continue to have atypical language learning strategies, eventually being classified as having a language impairment.

Findings from Wright (Reference Wright, Local, Ogden and Temple2004), Scarborough (Reference Scarborough2004) and Munson & Solomon (Reference Munson and Solomon2004) were invoked to account for the results. These authors reported that speakers implicitly regulate production of high-density words to expand the vowel space and increase word duration to maximize listener perception of these words. Stokes (Reference Stokes2010) suggested that very young children with relatively poor vocabulary development (LTs) may be tuning into words that are implicitly exaggerated for the listener. This would mean that words from dense neighborhoods were more perceptually salient as formant cues were exaggerated, cues that LTs took advantage of to learn words from dense neighborhoods. Research on younger children provided evidence that infants aged 0 ; 6–0 ; 8 take advantage of prosodic cues to statistical information (exaggerated pitch peaks) in infant-directed speech (Thiessen, Hill & Saffran, Reference Thiessen, Hill and Saffran2005), suggesting that this is a plausible account. The implication is a perceptual deficit in LTs, or at least a preference for some types of vowel and duration cues, rather than a preference for highly recurring lead (CV+ e.g. hat, ham, have), rhyme (+VC e.g. hat, cat, mat), or consonant (C+C e.g. hat, hot, hut) combinations in the input.

An alternative interpretation was also related to children's perceptual abilities, but was more directly focused on learning mechanisms. Stokes (Reference Stokes2010) suggested that the LT or low vocabulary children, having become adept at abstracting familiar word structures (recurring CV+, +VC or C+C structures), from the ambient language, failed to move beyond that point, thereby failing to begin to process words from sparser neighborhoods. Simply put, they became stuck in one learning mechanism. The long-standing tradition of research into infant and toddler perceptual learning appears to support this view. Research has demonstrated that up until about age 0 ; 9 infants are able to discriminate between any two phonetic contrasts in human languages, but that after this age, perception begins to approximate adult performance in that the ability to discriminate non-native contrasts diminishes. For example, Japanese adults are unable to discriminate between ra and la (a non-native contrast), whereas Japanese infants can do so before about age 0 ; 9 (see a summary in Kuhl, Reference Kuhl2004).

This suggests that neurological reorganization (called neural commitment by Kuhl and colleagues, e.g. Kuhl, Conboy, Coffey-Corina, Padden, Rivera-Gaxiola & Nelson, Reference Kuhl, Conboy, Coffey-Corina, Padden, Rivera-Gaxiola and Nelson2008), stimulated by the infant's attention to the statistical and distributional properties of his/her native language, acts as a foundation for subsequent language development. Indeed, Kuhl and colleagues (summarised in Kuhl et al., Reference Kuhl, Conboy, Coffey-Corina, Padden, Rivera-Gaxiola and Nelson2008 ) have convincingly demonstrated that infants at age 0 ; 7·15 who tune into the statistical and distributional properties of their native language have better subsequent language development (vocabulary and early syntactic complexity) than their peers who do not demonstrate such fine-tuning. Using a head turn paradigm, Kuhl et al. measured infants' abilities to discriminate between two syllables in both native and non-native languages. The children with good native discrimination at age 0 ; 7·15 had better language scores at ages 1 ; 6, 2 ; 0 and 2 ; 6 than children with poor native discrimination at 0 ; 7·15 and the children with poor non-native discrimination abilities had better subsequent language scores than those with good non-native discrimination abilities. That is, children who were less tuned to the contrasts of their native language at 0 ; 7·15 had slower language development. This fine-tuning was termed constrained statistical learning by Saffran and colleagues (Aslin & Newport, Reference Aslin, Newport, Colombo, McCardle and Freund2008; Saffran, Reference Saffran2002; Reference Saffran2003) where early constrained statistical learning is a positive influence on later vocabulary growth (Lany & Saffran, Reference Lany and Saffran2010).

Stokes (Reference Stokes2010) showed that statistical learning could also be an important factor impinging on children's abilities to expand their lexicon past the 50-word stage. She suggested that for language to grow at a satisfactory rate, toddlers, not only infants, need to tune into subtleties of the statistical and distributional properties of their native language. The hypothesis is that having developed appropriate and useful constrained statistical learning mechanisms to find a way into the lexicon (albeit late), LTs fail to loosen these constraints to allow vocabulary expansion. In the normal process of development, children are slower to learn words that have fewer neighbors in the ambient input. In order to expand the lexicon, toddlers need to broaden their attunement to the statistical regularities of words from sparser neighborhoods by loosening constrained statistical learning strategies, expanding the ability to perceive, organize and use words that have fewer phonological neighbors. It is assumed that children with larger lexicons are those who have loosened or broadened their learning strategies in order to perceive, organize and use word forms of lower statistical probability from the ambient input stream. Aslin and Newport (Reference Aslin, Newport, Colombo, McCardle and Freund2008) suggested that successful early effective constrained statistical learning could ‘block’ later learning in some children and we suggest that we have found evidence to support this view. We term this phenomenon extended statistical learning (ESL).

While the premise of an ESL mechanism may hold promise for further investigations into the learning mechanisms of LTs, and insights into possible causes for slow vocabulary development, Stokes (Reference Stokes2010) reported findings from only British-English-speaking children. Questions remain about whether ESL could be a phenomenon that occurs in children learning languages other than English, that by definition have different distributional frequencies, different (C)V(C) combinatorial constraints and different prosodic structures. For example, Hohle, Bijeljac-Babic, Herold, Weissenborn and Nazzi (Reference Hohle, Bijeljac-Babic, Herold, Weissenborn and Nazzi2009), among others, report differences between English-, German- and French-speaking children for stress perception. It is possible that different languages present different challenges to toddlers at the onset of using their expressive vocabularies. To explore these possibilities, we interrogated a comparable database from French-speaking toddlers, and these findings are the focus for this report.

The phonological features of English and French with which we are primarily concerned (stress, rhythm, number of consonants and vowels, ratio of consonants/vowels and phonotactics), are shown in Table 1. We have chosen French as a comparison language because the languages are essentially similar on all features except stress and rhythm type. There is one primary difference between the two languages that should be noted and that could have an effect on the results. There is a marked difference between the languages in the use of syllable structure for monosyllabic words. The percentage of open, closed and vowel only structures is 55·71%, 26·09% and 17·22% for French, and 27·48%, 55·71% and 16·81% for English. The potential impact of this difference in open and closed syllable structure on the current analysis is not clear, but it is noted here as an a priori factor because it could have an impact on the findings. Our hypothesis was that the lexicons of French-speaking children would show the same types of distributional properties that were found for English (Stokes, Reference Stokes2010).

Table 1. Selected phonological features of English and French

Notes: aLadefoged (Reference Ladefoged1999).

b Haspelmath, Dryer, Gil & Comrie (Reference Haspelmath, Dryer, Gil and Comrie2008) .

c Fourgeron & Smith (Reference Fourgeron and Smith1999) .

d Inventory size and categorization (small, large, etc.) for consonants and vowels, where vowels includes diphthongs, size notation is from Maddieson (Reference Maddieson, Haspelmath, Dryer, Gil and Comrie2008a) .

e C/V=Consonant/Vowel Ratio, a representation of the phonological complexity of languages (Maddieson, Reference Maddieson, Haspelmath, Dryer, Gil and Comrie2008b) .

f The notation (0–4 ) indicates that the number of final consonants could be 0, 1, 2, 3 or 4. Derived from Lexique3 (New et al., Reference New, Brysbaert, Veronis and Pallier2007).

The aim of the study was to explore the lexical characteristics of the expressive vocabularies of French-speaking two-year-old children. In order to test if Stokes's (Reference Stokes2010) findings for English hold for French, we explored the same two lexical characteristics: neighborhood density (ND) and word frequency (WF). The ND metric used was the ± one phoneme substitution, addition or deletion definition (Ph±1 metric, e.g. Charles-Luce & Luce, Reference Charles-Luce and Luce1990), for example for English, hat neighbors include hot, cat, ham, and for French, bulle neighbors include mule, bel, bus. WF was defined as the number of times a given word appears in more than 50 million words (see ‘Method’ below). Before turning to the study proper, there is one final issue that should be addressed, albeit briefly: that of the concept of frequency-weighted neighborhood density.

Some readers may be inclined to question the validity of the current research given that it does not use frequency-weighted neighborhood density as a measurement variable. The issue is covered at some length in Stokes (Reference Stokes2010), to which the reader is referred. In brief, while historically high-frequency words were thought to have more neighbors than low-frequency words, closer examination of the derivation of these claims reveals that the metric does not hold for phonological neighbors and all word lengths, but rather is germane to orthographic neighbors of words four letters in length.

The research questions were:

  1. 1. How much variance in vocabulary size is accounted for by neighborhood density and word frequency together and independently in French-speaking two-year-old children?

  2. 2. Is there a significant difference between children with small and large vocabularies in neighborhood density and word frequency?

  3. 3. Are the distributions for English and French similar?

METHOD

Participants

The sample consisted of 220 children (110 girls) aged between 2 ; 0 and 2 ; 6 who were a subset of the 663 children (age range 1 ; 4−2 ; 6) studied for the French standardization of the MacArthur-Bates Communicative Development Inventory (L'Inventaire Français du Développement Communicatif, IFDC; Kern, Reference Kern2003). Exclusion criteria were any of the following: being other than 2 ; 0–2 ; 6, having a non-native French-speaking parent, repeated ear infections, diagnosed developmental delays, premature birth or twin status. In addition, of the 220 who met the inclusion criteria, 12 children were dropped from the analysis; four children were reported to use less than 25 words, five children had incomplete datasets, and the parents of three children reported a native language other than French. Table 2 shows the number of boys and girls at each age.

Table 2. Age (months) and sex breakdown for the sample

Procedures

The IFDC (Kern, Reference Kern2003; Kern & Gayraud, Reference Kern and Gayraud2010) was distributed to parents by pediatricians (members of the French Association of Ambulatory Pediatricians) during a home visit. Parents filled in the forms alone and mailed them back directly to the research group. The IFDC is comprised of 690 words arranged in 22 categories, similar to checklists for other languages (animal names, toys, adjectives, quantifiers, articles, verbs, etc). Consistent with Stokes (Reference Stokes2010), only 12 categories (518 words) were retained, those that represented core vocabulary that was likely to be shared across children rather than being context specific. (Examples of categories that cannot be shared across children are pets' names and baby-sitter's name.) Included categories were verbs (N=102), food and drink (73), adjectives (65), small household objects (56), animals (43), furniture (33), clothing (32), outside things (31), body parts (28), places to go (23), toys (18), and vehicles (14). The range of scores (of a possible 518 words) for the 208 children was 28–499.

Data reduction

Of the 518 words, only monosyllabic words were chosen for data coding by ND and WF. This decision was driven by three reasons. First, 76% and 73% of all words in English and French respectively are monosyllabic. Second, although some words larger than one syllable do have neighbors (e.g. converse, converge, convert), many do not (e.g. popcorn), and adding longer words would significantly skew the data. Third, other studies of this type have also included only monosyllabic words (e.g. Storkel, Reference Storkel2004; Zamuner, Reference Zamuner2009). Selection criteria included the presence of a vowel, regardless of the number of consonants (e.g. ‘tree’ arbre). Words that were notionally bisyllabic but included an unmarked schwa in the first syllable were counted as monosyllabic (e.g. ‘little’ petite is realized as /pti/), although words of this structure, but with complex onsets were not included (e.g. ‘frog’ grenouille/grnuj/). Finally, for all verbs with a mono- or bisyllabic lemma, the most frequent or only monosyllabic form was chosen (e.g. the monosyllabic lemma ‘to take’ prendre /pʁdʁ/ has several monosyllabic forms, the more frequent one is /pʁ/; the bisyllabic lemma ‘to walk’ marcher /maʁʃe/ has only one monosyllabic form /maʁʃ/). This resulted in a selected list of 223 words: 134 nouns, 30 adjectives, 3 adverbs and 56 verbs. As for English, the three words that appeared twice on the list were restricted to one occurrence, for example ‘water’ (eau) appears in both ‘food’ and ‘outside objects’. The other duplicated words were ‘park’ (parc) and ‘pot’ (pot), leaving a final list of 220 words.

Neighborhood density

Both ND and WF were generated from the Lexique3 reference database, a corpus of adult language (more than 50 million words; New, Brysbaert, Veronis, & Pallier, Reference New, Brysbaert, Veronis and Pallier2007). This decision may be queried by some readers, however Gierut and Dale (Reference Gierut and Dale2007) make the excellent point that it would be difficult to know how to limit a child-directed-speech (CDS) corpus for use in a study on child vocabulary development. Does one only select the words actually addressed to children aged 2 ; 0 for those children, and others that were definitely spoken to children aged 2 ; 6 for that age group? Where is the cut point? This is an empirical question that may need to be addressed in future work. However, there are now at least three lines of argumentation to support the use of adult corpora. The first is that children are not only exposed to child-directed speech. They are exposed to adult−adult and child−child conversations too. As Jusczyk, Luce and Charles-Luce (Reference Jusczyk, Luce and Charles-Luce1994) noted, children do indeed extract the patterns of adult language, a fact reported many times by Storkel (e.g. Reference Storkel, Rickheit and Strohner2008) in justification of the use of adult corpora. Second, corpora should be very large to generate realistic and reliable WF and ND values. Small corpora such as those usually found in the French CDS database on CHILDES are just too small (eight children). Third, our own comparison of WF values in a CDS corpus with an adult corpus showed that the results were strongly correlated (r(222)=0·90, p<0·001; Stokes, Reference Stokes2010), a finding similar to that reported by Gierut and Dale (Reference Gierut and Dale2007) and Jusczyk et al. (Reference Jusczyk, Luce and Charles-Luce1994) .

Neighborhood density for word type was calculated from the most frequent phonological form in the Lexique3. For example, for the verb ‘to sing’ chanterte/, ND was calculated from the form /ʃt/, and for the verb ‘to take’ prendre /pʁdʁ/, only the form /pʁ/ was taken into account.

Word frequency

The frequency of occurrence of each word was also determined using the Lexique3 reference database (New et al., Reference New, Brysbaert, Veronis and Pallier2007). In this database, the frequency of each word is reported for each category in which it appears. For example, words like ‘big’ (grand) or ‘eat’ (manger) can be part of different word classes depending on the context ‘eat’ (manger) can appear as a verb, ‘eat chocolate’ (manger du chocolat), or a noun, ‘I brought my food’ (j'ai apporté mon manger). As manger is part of the category ‘action words’ in the IFDC, only the frequency of the verb and not the frequency of the noun was included. For nouns, singular and plural homophonic forms are summed into one count, for example, ‘lion’ and ‘lions’ because both are pronounced as /lj/. Each word for each child was coded for ND and WF and a mean value was generated for each child.

RESULTS

In order to exactly replicate Stokes's (Reference Stokes2010) findings for English, the same statistical analyses were conducted, and findings were reported using the same terminology and similar phrasing. First, the data were standardized by age group (see below). Then ND and WF were examined as predictors of vocabulary size in French, followed by between-group comparisons for ND and WF distributions for ‘high’ and ‘low’ vocabulary children. Next, data simulations were generated to examine whether or not the results were authentic or epiphenomenal, as was done for English, in order to rule out artifactual effects. Finally, the British and French distribution patterns were compared.

Data transformation

A multivariate analysis of variance showed that there was a significant effect of Age on ND, WF and IFDC scores (F(6,201)=2·90, p=0·01, partial η2=0·08; F(6,201)=2·34, p=0·03, partial η2=0·07; and F(6, 201)=5·23, p<0·001, partial η2=0·14 respectively), although the effect sizes were small. Age was also significantly correlated with the other variables (r(208) Age by IFDC=0·35, p<0·001; r(208) Age by ND=−0·25, p<0·001; and r(208) Age by WF=0·21, p=0·002), so all variables were converted to Z scores within age groups for subsequent analyses, for example IFDC for age 2 ; 0, 2 ; 1, etc. This effectively controls for Age in all analyses.

Predicting vocabulary size

In preparation for answering the first research question (How much variance in vocabulary size is accounted for by ND and WF together and independently in two-year-old children?), correlations among the variables were examined. IFDC score had a moderate, positive, significant correlation with WF and a strong, negative, significant correlation with ND (r(208)=0·48, p<0·001; r(208)=−0·73, p<0·001 respectively). That is, as vocabulary size increased, WF increased and ND decreased. ND and WF were weakly and negatively correlated (r(208)=−0·25, p<0·001).

Plots of these relationships are shown in Figures 1 and 2. The plot for IFDC by ND reflects the significant negative correlation, with low vocabularies being comprised of high NDs relative to larger vocabularies. The plot for IFDC by WF reflects the significant positive correlation, with low vocabularies being generally comprised of low WF, although it is clear that there is some variability in both WF and ND values for the lowest vocabulary scores. A curve estimation regression revealed that both linear and quadratic relationships fit the data. While both were significant (F(1, 206)=222·60, p<0·001 and F(2, 205)=129·12, p<0·001 respectively), the linear fit was stronger and was chosen for subsequent analyses.

Fig. 1. Scatterplot of neighborhood density by vocabulary size.

Fig. 2. Scatterplot of word frequency by vocabulary size.

A multiple regression was conducted with IFDC as the outcome variable, and ND and WF predictors entered together, using the backward method. Probability plots suggested that the residuals were satisfactorily distributed, so the linear model was used. The model was significant (F(2, 205)=166·53, p<0·001), with WF and ND accounting for 62% of the variance in vocabulary scores. Inspection of the t values in the table of coefficients (Table 3) shows that ND was the strongest predictor, followed by WF. The partial correlation for ND was r=−0·71, and for WF it was r=0·48, suggesting that WF and ND should be considered as separate factors contributing to the variance in vocabulary scores. With these results, a hierarchical multiple regression was run in which ND accounted for 53% of unique variance in IFDC scores (F(1, 206)=227·60, p<0·001), and WF accounted for 9% of additional unique variance (F(1, 205)=50·63, p<0·001).

Table 3. Table of coefficients for the multiple regression predicting IFDC scores

Note: ND=neighborhood density; WF=word frequency.

Overall then, neighborhood density was inversely related to vocabulary size and was a strong predictor of vocabulary size in these two-year-old children. Larger vocabularies contained more words from sparse neighborhoods in the ambient language. Word frequency was directly related to vocabulary size; as vocabulary size increased, so did word frequency.

Group differences on neighborhood density

To answer the second research question (Is there a significant difference between children with small and large vocabularies in ND and WF?), children who scored at or below 1 SD on the IFDC for age were coded as ‘low vocabulary’ yielding 40 children in the low vocabulary group and 168 in the high vocabulary group. Inspection of the distributions of ND scores for the two groups indicated a lack of homogeneity of variance. A non-parametric Generalized Mann–Whitney test for unequal sample sizes (Odeh, Reference Odeh1972) revealed a significant difference between low and high vocabulary children in ND values (U=846·5, p<0·001). Children who scored at or more than 1 SD below the mean for age on the IFDC had higher ND than children scoring above 1SD below the mean (M=1·27, SD=1·13, and M=−0·30, SD=0·65, respectively).

The same analysis was conducted for WF scores. The children with low IFDC scores scored significantly lower on WF than the children with higher vocabulary scores (U=1568·5, p<0·001; M=−0·73, SD=1·02; and M=0·17, SD=0·89 for low and high vocabulary groups). Figure 3 shows the error bar plots (bars indicate confidence interval for the mean) for the two groups. Clearly there is more variability in the low vocabulary group for both ND and WF (discussed below). In summary, the lexicons of children with small vocabularies are comprised of words that are of high ND and low WF in the ambient language.

Fig. 3. Error bar plots for low and high vocabulary groups for neighborhood density and word frequency.

Fact or artifact?

Just as for English (Stokes, Reference Stokes2010), it was necessary to rule out possible artifact effects on the outcomes. The question ‘Are these child vocabulary results authentic or epiphenomenal of the dataset?’ must be asked. Simulations of random distributions of the 220 words in the database were generated using SPSS to explore the possibility that the results for the 208 two-year-old children were simply an artifact of the IFDC dataset. First, using SPSS Macros, syntax commands were written to generate 50 random samples at each of 5%, 10%, 15% … 90% of the data, that is, 50 samples of a lexicon size of 11 words, 50 samples of a lexicon of 22 words, etc. (900 random samples). For each sample, average ND and WF were generated, just as had been done for the child data. The scatterplots of IFDC by ND and IFDC by WF for these random samples are shown in Figures 4 and 5. Both ND and WF show extreme variability for small vocabulary sizes, and reduced variability in larger vocabularies (heteroscedasticity). For example, for the smallest vocabulary size the mean ND value could be anywhere between <15 to >28 and the mean WF value could be anywhere between <10 to >1000.

Fig. 4. Scatterplot of neighborhood density by vocabulary size for 900 simulated cases.

Fig. 5. Scatterplot of word frequency by vocabulary size for 900 simulated cases.

Second, from the 900 simulated children, five random samples of 208 cases (comparable to our start point for this study) were generated for comparison with our real dataset. Figures 6 and 7 show the scatterplots of these five random samples with the scores from our real cases. For ND the actual child data does not map onto the random samples, having a more linear distribution, indicating that the results of this study are authentic rather than epiphenomenal, that is, these are non-random developmental profiles. For ND, the children appear to score higher than the random samples. For WF, there appears to be more overlap in mean scores of the actual data with some of the random samples. For WF, the children appear to score lower than some of the random samples.

Fig. 6. Scatterplot of neighborhood density by vocabulary size for five random samples and actual child data.

Fig. 7. Scatterplot of word frequency by vocabulary size for five random samples and actual child data.

In order to find statistical evidence for this visual pattern, two one-way ANOVAs were run to determine whether or not the distributions differed among groups. (Levene's test of equality of error variances was not violated; F(5, 1242)=1·09, p=0·36.) For ND, there was a significant difference between the actual child data and all random samples although the effect size was small, but no differences among the random samples (F(1, 5)=38·56, p<0·01, partial ŋ 2=0·13). For WF the difference was not significant (F(1, 5)=2·21, p=0·051, partial ŋ 2=0·009). Box plots of ND and WF for simulated and real samples (95% confidence intervals for group means on the ordinal axis) are shown in Figure 8. The statistical analysis shows that the probability that the child distributions for both WF and ND arose by chance was <0·06.

Fig. 8. Error bar plots for neighborhood density and word frequency for five random samples and actual child data.

From these simulations we can conclude that the results for the 208 children were not artifactual but truly reflect the propensity of children with small vocabularies to learn words from dense neighborhoods in the ambient language. The result was not as clear-cut for WF, where the scores had a 5% chance of occurring at random.

Comparison with English

Table 4 summarizes the findings for the two languages. First, for both languages ND and WF together account for a high proportion of the variance in vocabulary scores, and ND was a strong predictor, having an inverse relationship with vocabulary size. When the samples were split into low and high vocabulary groups, there is a slight difference between English and French. For the low vocabulary groups, there is marked variability in ND and WF values for both languages. For the high vocabulary group, while together the variables accounted for a high proportion of the variance in vocabulary size (44% and 48%), the contributions of the two predictors differ. WF accounted for one-third of the variance in vocabulary size for English but ND accounted for one-third of the variance in vocabulary for French. This difference may simply reflect the degree of variability in scores for the two languages for vocabulary size in the normal range for this age group. The relationships between WF and ND and WF and vocabulary size are discussed in Stokes (Reference Stokes2010) . While the relationships are weak, this is an issue that requires further exploration.

Table 4. Variance in vocabulary scores accounted for by neighborhood density and word frequency in English and French for the total sample and low and high vocabulary groups

Notes: * p<0·05, ** p<0·01.

a MCDI and the IFDC indicate the MacArthur-Bates Communicative Development Inventory (Fenson et al., Reference Fenson, Dale, Reznick, Thal, Bates and Hartung1993; Fenson et al., Reference Fenson, Marchman, Thal, Dale, Reznick and Bates2007), British version (Klee & Harrison, Reference Klee and Harrison2001) and the French Communicative Development Inventory (Kern, Reference Kern2003; Kern & Gayraud, Reference Kern and Gayraud2010).

Summary of findings

Neighborhood density had a strong inverse relationship with vocabulary size and predicted 53% and 47% of the variance in vocabulary size for French and English respectively. Word frequency did predict vocabulary size, but it only accounted for 9% (French) and 14% (English) of the variance once neighborhood density had been accounted for. Children with CDI scores more than 1 SD below the mean for their age had significantly higher ND values and significantly lower WF values than children with average to large lexicons.

DISCUSSION

This research addressed three questions: How much variance in vocabulary size is accounted for by neighborhood density and word frequency together and independently in French-speaking two-year-old children? Is there a significant difference between children with small and large vocabularies in neighborhood density and word frequency? Are the distributions for English and French similar? Our hypotheses were that ND would be a very strong predictor of vocabulary size, that children with small vocabularies would be significantly different from the children with average−large lexicons in ND and WF values, and that the results for English and French would be similar. All hypotheses were confirmed.

Predicting vocabulary size

For this French sample, ND and WF together accounted for 62% of the variance in vocabulary size. ND alone accounted for 53% of the variance in vocabulary size, and WF contributed 9%. This strength of this relationship is unprecedented in the search for factors that predict vocabulary size. The first study for the English sample (Stokes & Klee, Reference Stokes and Klee2009) compared demographic, cognitive, behavioral and psycholinguistic predictors of vocabulary size and found that nonword repetition scores (36%), sex (5%) and age (4%) all contributed uniquely to variance accounted for in vocabulary size. The second study (Stokes, Reference Stokes2010) reported that ND and WF together accounted for 61% of the variance in vocabulary size, with ND accounting for 47% of unique variance. The French data confirmed the findings for English, that ND is a very strong predictor of vocabulary size and that children at the earliest point of lexical development are more likely to learn words that come from dense neighborhoods in the ambient language (Coady & Aslin, Reference Coady and Aslin2003; Storkel, Reference Storkel2004). This does not mean that if we examined the actual expressive lexicons of children with small vocabularies (within-child lexicons) we would find that all of the words sounded the same. They don't, although Dollaghan (Reference Dollaghan1994) showed that of all the monosyllabic words on an earlier version of parent checklists, 84% of the words had a least one neighbor.

As is the case for English, French-speaking children with small vocabularies seemed to be learning words that came from dense phonological neighborhoods in the ambient language. Given the phonotactic differences between the two languages, it was possible that the results for French would differ from those for English. Recall that the percentage of open, closed and vowel only word structures (ending in a vowel, a consonant or comprising vowel only) is 55·71%, 26·09% and 17·22% for French, and 27·48%, 55·71% and 16·81% for English. With ND being defined as lead, rhyme and consonant neighbors, it was possible that this difference in word structure would contribute to across-language differences, but none emerged. Further exploration of neighborhood types could be interesting.

The lexicons of children with low vocabulary scores were qualitatively different from those of children with higher vocabulary scores. The question is why. There are detailed accounts of both behavioral (Saffran & Graf Estes, Reference Saffran and Graf Estes2006) and computational evidence (Christiansen, Onnis, & Hockema, Reference Christiansen, Onnis and Hockema2009) of statistical learning of phonological detail in early lexical development. Children with small vocabularies, who are slow to expand their lexicons, are learning words that have many phonological neighbors in the input stream (overlapping word forms; Saffran & Graf Estes, Reference Saffran and Graf Estes2006). We suggest that late vocabulary learning is indicative of slowness to attune to the statistical regularities of the input language, but once attunement has been achieved, and constrained statistical learning has developed as a mechanism for learning first words, some of these toddlers continue to employ this strategy for an extended period of time (Extended Statistical Learning). This could be indicative of protolexical learning (Swingley, Reference Swingley2005). Swingley proposed that success with statistical learning enabled infants to generate a protolexicon of word forms that enabled less labored word learning, releasing cognitive processing to allow sound−meaning mapping. This constrained statistical learning may well be a necessary step en route to developing a lexicon. At some period, this constrained learning mechanism must be loosened, broadened or abandoned to allow mapping of words from sparser neighborhoods. How might this be achieved?

Our theory is that in early lexical learning, words from dense neighborhoods provide a recognizable familiar phoneme stream which is less taxing of short-term memory abilities than words from sparse neighborhoods (Saffran & Graf Estes, Reference Saffran and Graf Estes2006; Swingley, Reference Swingley2005). Words from dense neighborhoods presumably lay down representations in long-term memory that can be called on to aid new word learning. Mirman, Graf Estes and Magnuson (Reference Mirman, Graf Estes and Magnuson2010), in a study of network learning, demonstrated that high transitional probability generated distinct phonological representations that allowed for fast learning of novel ‘words’ that were partly comprised of the high-probability syllables. High-density words may serve the same function perhaps in a processing mechanism akin to redintegration (Gathercole, Reference Gathercole1999; Stokes, Wong, Fletcher & Leonard, Reference Stokes, Wong, Fletcher and Leonard2006; Stuart & Hulme, Reference Stuart, Hulme, Thorne and Page2009). These representations are organized into lexical networks (Vitevitch, Reference Vitevitch2008). In early lexical learning, words that come from dense neighborhoods need not be networked to each other, for example ball and bird are both of high densities (43 and 31 type neighbors respectively in British English) and both are learned early in development (Dale & Fenson, Reference Dale and Fenson1996). Representations would not be linked initially, but eventually their neighborhoods would be linked due to links formed from shared neighbors (e.g. bird – bed – fed – fell – fall – ball) as is shown in the network example (generated using Pajek; Batagelj & Mrvar, Reference Batagelj and Mrvar1998), in Figure 9.

Fig. 9. An example of how bird and ball can be linked within a lexicon.

We believe that it is at this point that the child's constrained statistical learning mechanism is relaxed or broadened, as the overlap in words creates larger networks that facilitate new word learning. Thus long-term memory (stored phonological structures of words) begins to influence new word learning. Recall that some children with very small lexicons had ND values within the range of their more able peers. It is possible that these children have failed to yet use a statistical learning mechanism, restricting word learning to highly functional words. Confirmation of this hypothesis awaits analysis of actual words in the lexicons of these children. Obviously this is speculative and requires testing with longitudinal data from children. Only analyses of within-child lexicons will answer this question.

Finally, it is important to note that this study on French confirmed the surprising results for WF found for English, that of a direct relationship with vocabulary size, whereas prior research (Storkel, Reference Storkel2004) predicted that small lexicons would be comprised of highly frequent words. As vocabulary size was positively correlated with age, we found the same results that Goodman et al. (Reference Goodman, Dale and Li2008) found for their entire word set. It is possible that our results resemble the whole-sample results from Goodman et al. because of the impact of adjectives and verbs in the current study. From Goodman et al. we know that verbs and adjectives tend to be acquired later than common nouns and that the correlation between the two former categories and word frequency was r(90)=0·22 and r(55)=0·28 respectively, compared with common nouns at r(256)=0·55 (see Figure 1 and Table 2 in Goodman et al., Reference Goodman, Dale and Li2008). Focusing only on open-class words, Goodman et al. found, as had Storkel (Reference Storkel2004), that words of higher frequency were learned earlier. Further investigation is warranted to unpack how word frequency and lexical categories interact in word learning.

Limitations and further research

There are a number of possible limitations that should be considered in this study. First, phonological strings are not the only determinants of word learning. There are a host of other variables that play a role, for example, conceptual, linguistic, social-pragmatic and perceptual sources of information (Booth & Waxman, Reference Booth and Waxman2008; Thiessen et al., Reference Thiessen, Hill and Saffran2005) that have not been explored here. Second, we considered only monosyllables for derivation of ND as other researchers have done. Multisyllabic words could be included but the effect is likely to be a skew in ND values because many multisyllabic words do not have neighbors (e.g. popcorn). Third, we studied only expressive vocabulary, not receptive vocabulary, and then only from parent-reported lexicons. Examining receptive and expressive vocabularies in the same children would be instructive. Fourth, it would also be instructive to carefully track the expanding lexicons of individual children to describe more finely the relationship between WF and lexical growth. Fifth, the distributions of neighborhood types should be examined (e.g. lead versus rhyme versus consonant neighborhoods; Zamuner, Reference Zamuner2009) to test where the mapping occurs most easily. Finally, longitudinal data, computational modeling and experimental studies of word learning (see Saffran, Reference Saffran and Johnson2009) should be used to examine the hypothesis proposed to account for the findings. It is possible that the relationship between neighborhood density and vocabulary size is an inverted U shape. Lexicons from a large sample of younger children would shed light on this possibility.

Summary

We have suggested that children who are slow to learn vocabulary (albeit typically developing in every other respect) may be slow to capitalize on the statistical regularities of the input language, and slow to use a statistical learning mechanism, and once they do so, may be slow to move on from this, thereby showing a period of extended statistical learning. A network model of neighborhood densities was invoked to explain how a statistical learning mechanism could be broadened or loosened to enable learning of words from sparse neighborhoods in the input stream. The theory is also able to account for late talkers who appear to be learning sparse words early in lexical development. The theory should account for other cases of late talkers, and for children learning other languages. This work may move us forward to examining intersections across experimental, computational and corpora-based research in language learning.

References

REFERENCES

Aslin, R. N. & Newport, E. L. (2008). What statistical learning can and can't tell us about language acquisition. In Colombo, J., McCardle, P. & Freund, L. (eds), Infant pathways to language: Methods, models and research directions, 1529. Hove: Psychology Press.Google Scholar
Baayen, R. H., Piepenbrock, R. & Gulikers, L. (1995). The CELEX Lexical Database (CD-ROM). Linguistic Data Consortium, University of Pennsylvania, Philadelphia, PA.Google Scholar
Batagelj, V. & Mrvar, A. (1998). Pajek: A program for large network analysis. Connections 2, 4757, available at <http://pajek.imfm.si/doku.php> accessed 10 December 2010.Google Scholar
Bishop, D. V. M., Price, T. S., Dale, P. S. & Plomin, R. (2003). Outcomes of early language delay: II. Etilogy of transient and persistent language difficulties. Journal of Speech, Language, and Hearing Research 46, 561–75.CrossRefGoogle Scholar
Booth, A. E. & Waxman, S. R. (2008). Taking stock as theories of word learning take shape. Developmental Sciences 11, 185–94.Google Scholar
Charles-Luce, J. & Luce, P. A. (1990). Similarity neighborhoods of words in young children's lexicons. Journal of Child Language 17, 205215.CrossRefGoogle ScholarPubMed
Christiansen, M. H., Onnis, L. & Hockema, S. A. (2009). The secret is in the sound: From unsegmented speech to lexical categories. Developmental Science 12, 388–95.CrossRefGoogle ScholarPubMed
Coady, J. A. & Aslin, R. N. (2003). Phonological neighbourhoods in the developing lexicon. Journal of Child Language 30, 441–69.CrossRefGoogle ScholarPubMed
Dale, P. S. & Fenson, L. (1996). Lexical development norms for young children. Behavior Research Methods, Instruments & Computers 28, 125–27.CrossRefGoogle Scholar
De Cara, B. & Goswami, U. (2002). Similarity relations among spoken words: The special status of rimes in English. Behavior Research Methods, Instruments, & Computers 34, 416–23.Google Scholar
Desmarais, C., Sylvestre, A., Meyer, F., Bairati, I. & Rouleau, N. (2008). Systematic review of the literature on characteristics of late-talking toddlers. International Journal of Language and Communication Disorders 43, 361–89.Google Scholar
Dollaghan, C. A. (1994). Children's phonological neighbourhoods: Half empty or half full? Journal of Child Language 21, 257–72.CrossRefGoogle ScholarPubMed
Fenson, L., Dale, P. S., Reznick, J. S., Thal, D., Bates, E., Hartung, J. P. et al. (1993). MacArthur Communicative Development Inventories: User's guide and technical manual. Baltimore: Brookes.Google Scholar
Fenson, L., Marchman, V. A., Thal, D., Dale, P. S., Reznick, J. S. & Bates, E. (2007). MacArthur-Bates Communicative Development Inventories: User's guide and technical manual, 2nd edn.Baltimore: Brookes.Google Scholar
Fourgeron, C. & Smith, C. L. (1999). French. Handbook of the International Phonetic Association, 7881. Cambridge: Cambridge University Press.Google Scholar
Gathercole, S. E. (1999). Cognitive approaches to the development of short-term memory. Trends in Cognitive Science 3, 410–19.Google Scholar
Gierut, J. A. & Dale, R. A. (2007). Comparability of lexical corpora: Word frequency in phonological generalization. Clinical Linguistics & Phonetics 21, 423–33.CrossRefGoogle ScholarPubMed
Goodman, J. C., Dale, P. S. & Li, P. (2008). Does frequency count? Parent input and the acquisition of vocabulary. Journal of Child Language 35, 515–31.Google Scholar
Haspelmath, M., Dryer, M. S., Gil, D. B. & Comrie, B. (eds)2008. The world atlas of language structures online. Munich: Max Planck Digital Library. Available online at <http://wals.info/feature/1> accessed 21 January, 2010.Google Scholar
Hohle, B., Bijeljac-Babic, R., Herold, B., Weissenborn, J. & Nazzi, T. (2009). Language specific prosodic preferences during the first half year of life: Evidence from German and French infants. Infant Behavior and Development 32, 262–74.CrossRefGoogle ScholarPubMed
Jusczyk, P. W., Luce, P. A. & Charles-Luce, J. (1994). Infants' sensitivity to phonotactic patterns in the native language. Journal of Memory and Language 33, 630–45.Google Scholar
Kern, S. (2003). Le compte-rendu parental au service de l'évaluation de la production lexicale des enfants français entre 16 et 30 mois. Glossa 85, 4862.Google Scholar
Kern, S. & Gayraud, F. (2010). IFDC. Grenoble: Les editions la cigale.Google Scholar
Klee, T. & Harrison, C. (2001). CDI Words and Sentences: Validity and preliminary norms for British English. Paper presented at Child Language Seminar, University of Hertfordshire, England.Google Scholar
Kuhl, P. (2004). Early language acquisition: Cracking the speech code. Nature Reviews: Neuroscience 5, 831–43.CrossRefGoogle ScholarPubMed
Kuhl, P. K., Conboy, B. T., Coffey-Corina, S., Padden, D., Rivera-Gaxiola, M. & Nelson, T. (2008). Phonetic learning as a pathway to language: New data and native language magnet theory expanded (NLM-e). Philosophical Transactions of the Royal Society B 363, 9791000.CrossRefGoogle ScholarPubMed
Ladefoged, P. (1999). American English. Handbook of the International Phonetic Association, 4144. Cambridge: Cambridge University Press.Google Scholar
Lany, J. & Saffran, J. R. (2010). From statistics to meaning: Infants' acquisition of lexical categories. Psychological Science 21, 284–91.CrossRefGoogle ScholarPubMed
Luce, P. A. & Pisoni, D. B. (1998). Recognizing spoken words: The neighborhood activation model. Ear & Hearing 19, 136.CrossRefGoogle ScholarPubMed
Maddieson, I. (2008a). Consonant inventories. In Haspelmath, M., Dryer, M. S., Gil, D. & Comrie, B. (eds), Chapter 1. Available online at <http://wals.info/feature/1> accessed 21 January 2010.+accessed+21+January+2010.>Google Scholar
Maddieson, I. (2008b). Consonant−vowel ratio. In Haspelmath, M., Dryer, M. S., Gil, D. & Comrie, B. (eds), Chapter 3. Available online at <http://wals.info/feature/1> accessed 21 January 2010.+accessed+21+January+2010.>Google Scholar
Mirman, D., Graf Estes, K. & Magnuson, J. S. (2010). Computational modeling of statistical learning: Effects of transitional probability vs. frequency and links to word learning. Infancy 15, 471–86.CrossRefGoogle Scholar
Munson, B. & Solomon, N. P. (2004). The effect of phonological neighborhood density on vowel articulation. Journal of Speech, Language, and Hearing Research 47, 1048–58.CrossRefGoogle ScholarPubMed
New, B., Brysbaert, M., Veronis, J. & Pallier, C. (2007). The use of film subtitles to estimate word frequencies. Applied Psycholinguistics 28, 661–77.CrossRefGoogle Scholar
Odeh, R. E. (1972). The generalized Mann−Whitney U-Statistic. Journal of the Royal Statistical Society. Series C (Applied Statistics) 21, 348–51.Google Scholar
Paul, R. (1996). Clinical implications of the natural history of slow expressive language development. American Journal of Speech-Language Pathology 5, 5–21.CrossRefGoogle Scholar
Rescorla, L. (2002). Language and reading outcomes to age 9 in late-talking toddlers. Journal of Speech, Language, and Hearing Research 45, 360–71.CrossRefGoogle ScholarPubMed
Saffran, J. R. (2002). Constraints on statistical language learning. Journal of Memory and Language 47, 172–96.Google Scholar
Saffran, J. R. (2003). Statistical language learning: Mechanisms and constraints. Current Directions in Psychological Science 12, 110–14.CrossRefGoogle Scholar
Saffran, J. R. (2009). What is statistical learning, and what statistical learning is not. In Johnson, S. (ed.), Neuroconstructivism, 180–95. Oxford Scholarship Online Monographs.Google Scholar
Saffran, J. R. & Graf Estes, K. (2006). Mapping sound to meaning: Connections between learning about sounds and learning about words. Advances in Child Development and behavior 34, 138.CrossRefGoogle ScholarPubMed
Scarborough, R. A. (2004). Coarticulation and the structure of the lexicon. Unpublished doctoral dissertation, University of California, Los Angeles. Available at <www.linguistics.ucla.edu/faciliti/research/scarb_diss.pdf>; accessed 7 May 2009.;+accessed+7+May+2009.>Google Scholar
Stokes, S. F. (2010). Neighborhood density and word frequency in toddlers. Journal of Speech, Language, and Hearing Research 53, 670–83.CrossRefGoogle ScholarPubMed
Stokes, S. F. & Klee, T. (2009). Factors that influence vocabulary development in two-year-old children. Journal of Child Psychology and Psychiatry 50, 498505.CrossRefGoogle ScholarPubMed
Stokes, S. F., Wong, A. M-Y., Fletcher, P. & Leonard, L. B. (2006). Nonword repetition and sentence repetition as clinical markers of specific language impairment: The case of Cantonese. Journal of Speech, Language, and Hearing Research 49, 219–36.CrossRefGoogle ScholarPubMed
Storkel, H. L. (2004). Do children acquire dense neighborhoods? An investigation of similarity neighborhoods in lexical acquisition. Applied Psycholinguistics 25, 201221.CrossRefGoogle Scholar
Storkel, H. L. (2008). First utterances. In Rickheit, G. & Strohner, H. (eds), Handbook of communication competence, 125–47. Berlin: Mouten de Gruyter.Google Scholar
Storkel, H. L. (2009). Developmental differences in the effects of phonological, lexical and semantic variables on word learning by infants. Journal of Child Language 36, 291321.CrossRefGoogle ScholarPubMed
Stuart, G. P. & Hulme, C. (2009). Lexical and semantic influences on immediate serial recall: A role for redintegration. In Thorne, A. & Page, M. (eds), Interactions between short-term and long-term memory in the verbal domain, 157–76. New York: Psychology Press.Google Scholar
Swingley, D. (2003). Phonetic detail in the developing lexicon. Language and Speech 46, 265–94.CrossRefGoogle ScholarPubMed
Swingley, D. (2005). Statistical clustering and the contents of the infant vocabulary. Cognitive Psychology 50, 86132.CrossRefGoogle ScholarPubMed
Thiessen, E. D., Hill, E. A. & Saffran, J. R. (2005). Infant-directed speech facilitates word segmentation. Infancy 7, 5371.Google Scholar
Vitevitch, M. S. (2008). What can graph theory tell us about word learning and lexical retrieval? Journal of Speech, Language, and Hearing Research 51, 408422.CrossRefGoogle ScholarPubMed
Wright, R. (2004). Factors of lexical competition in vowel articulation. In Local, J., Ogden, R. & Temple, R. (eds), Papers in Laboratory Phonology VI, 7587. Cambridge: Cambridge University Press.Google Scholar
Zamuner, T. S. (2009). The structure and nature of phonological neighborhoods in children's early lexicons. Journal of Child Language 36, 3–21.Google Scholar
Figure 0

Table 1. Selected phonological features of English and French

Figure 1

Table 2. Age (months) and sex breakdown for the sample

Figure 2

Fig. 1. Scatterplot of neighborhood density by vocabulary size.

Figure 3

Fig. 2. Scatterplot of word frequency by vocabulary size.

Figure 4

Table 3. Table of coefficients for the multiple regression predicting IFDC scores

Figure 5

Fig. 3. Error bar plots for low and high vocabulary groups for neighborhood density and word frequency.

Figure 6

Fig. 4. Scatterplot of neighborhood density by vocabulary size for 900 simulated cases.

Figure 7

Fig. 5. Scatterplot of word frequency by vocabulary size for 900 simulated cases.

Figure 8

Fig. 6. Scatterplot of neighborhood density by vocabulary size for five random samples and actual child data.

Figure 9

Fig. 7. Scatterplot of word frequency by vocabulary size for five random samples and actual child data.

Figure 10

Fig. 8. Error bar plots for neighborhood density and word frequency for five random samples and actual child data.

Figure 11

Table 4. Variance in vocabulary scores accounted for by neighborhood density and word frequency in English and French for the total sample and low and high vocabulary groups

Figure 12

Fig. 9. An example of how bird and ball can be linked within a lexicon.