A large body of research has documented differences in linguistic ability between monolingual and bilingual adults on tests that require lexical access and fluency. For example, bilinguals produce fewer words than monolinguals on verbal fluency tests (Gollan, Montoya & Werner, Reference Gollan, Montoya and Werner2002; Rosselli, Ardila, Araujo, Weekes, Caracciolo, Padilla & Ostrosky-Solis, Reference Rosselli, Ardila, Araujo, Weekes, Caracciolo, Padilla and Ostrosky-Solis2000), require longer response times than monolinguals on picture naming tasks (Gollan, Fennema-Notestine, Montoya & Jernigan, Reference Gollan, Fennema-Notestine, Montoya and Jernigan2007; Gollan, Montoya, Fennema-Notestine & Morris, Reference Gollan, Montoya, Fennema-Notestine and Morris2005; Roberts, Garcia, Desrochers & Hernandez, Reference Roberts, Garcia, Desrochers and Hernandez2002), make more errors on a speeded version of the task (Bialystok, Craik & Luk, Reference Bialystok, Craik and Luk2008a), experience more tip-of-the-tongue episodes than monolinguals (Gollan & Acenas, Reference Gollan and Acenas2004; Gollan & Silverberg, Reference Gollan and Silverberg2001), and produce longer reaction times than monolinguals on a lexical decision task (Bijeljac Babic, Biardeau & Grainger, Reference Bijeljac Babic, Biardeau and Grainger1997; Ransdell & Fischler, Reference Ransdell and Fischler1987). These differences are even reported when bilinguals respond in their first and dominant language (Ivanova & Costa, Reference Ivanova and Costa2008). All of these tasks depend on vocabulary level and therefore point to a potential challenge for bilinguals in linguistic knowledge and fluent language production. Most studies of bilingual language processing and language proficiency compare bilingual performance in their two languages, determining, for example, the relation between the dominant and non-dominant languages (e.g., Costa, Reference Costa, Kroll and B De Groot2005; Gollan, Salmon, Montoya & da Pena, Reference Gollan, Salmon, Montoya and da Pena2010). The question for the present study, however, is different: Do bilingual and monolingual adults differ in vocabulary level in the language of the community? This possibility is rarely assessed in the studies reporting differences in levels of linguistic ability noted above.
There are both theoretical and clinical reasons why this issue is important. Theoretically, performance on verbal and, to some extent, cognitive tasks depends crucially on levels of linguistic proficiency. In psycholinguistic studies of linguistic processing and modeling, vocabulary size influences outcomes; for example, word identification depends on the presence and strength of lexical neighbors (e.g., McClelland & Elman, Reference McClelland and Elman1986; Seidenberg & McClelland, Reference Seidenberg and McClelland1989). Moreover, models of language comprehension and production have been based on such tasks as lexical decision, verbal fluency, and picture naming, all of which are sensitive to basic vocabulary knowledge and are performed differently by bilinguals. If bilingual speakers do not have language proficiency equivalent to monolingual speakers, then such models may need to be revised to accommodate proficiency differences associated with bilingualism.
An example of the influence of vocabulary knowledge on linguistic performance comes from the verbal fluency task. The basic procedure is to ask participants to generate as many words as possible in 60 seconds, words that satisfy a criterion determined either by the category (semantic fluency) or the initial letter of the word (phonological fluency). Vocabulary size is positively related to performance in these tasks in both younger and older monolingual adults (Hedden, Lautenschlager & Park, Reference Hedden, Lautenschlager and Park2005; Salthouse, Reference Salthouse1993). However, bilingual participants typically produce fewer words than monolingual participants, even if they are allowed to produce words in both languages (Gollan et al., Reference Gollan, Montoya and Werner2002). In a behavioral study of monolingual and bilingual young adults, performance of the bilinguals depended on their receptive vocabulary scores in English, the language of testing (Bialystok, Craik & Luk, Reference Bialystok, Craik and Luk2008b). More dramatically, the actual process by which individual words were retrieved was shown in a time-course analysis to be different for bilinguals than monolinguals and different for bilinguals with higher receptive vocabulary than for those with a lower receptive vocabulary (Luo, Luk & Bialystok, Reference Luo, Luk and Bialystok2010). Thus, language production is affected not only by bilingualism but also by vocabulary level in the language of testing, placing limits on the generalizability of theoretical models of language comprehension and production.
The possibility that there are pervasive differences in vocabulary size of monolingual and bilingual adults also has clinical implications. Tests used in neuropsychological assessments for diagnosis are often based on language production, but the clinical interpretations of performance on these tests are determined by monolingual norms. Thus, standard versions of the verbal fluency such as in the Delis–Kaplan Executive Function Battery (DKEFS; Delis, Kaplan & Kramer, Reference Delis, Kaplan and Kramer2001) and the Controlled Oral Word Association Test (COWAT; Strauss, Sherman & Spreen, Reference Strauss, Sherman and Spreen2006) are used diagnostically to determine the integrity of brain function and the presence of neurological disease. Clinicians, however, are unlikely to make adjustments in the standardized scores to account for bilingualism or more generally even determine whether or not a patient is bilingual. The real possibility that standard assessment is misleading for bilingual patients creates the potential for misinterpretation at best and misdiagnosis at worst. Since vocabulary is key to these results, it is important to establish whether or not there are systematic differences in vocabulary size for monolingual and bilingual adults.
One standardized method to evaluate vocabulary level is the Peabody Picture Vocabulary Test (PPVT–III). Participants are shown a plate with four pictures and indicate which one corresponds to a word spoken by the experimenter. Items become progressively more difficult and testing stops when an error criterion has been met and raw scores are converted to standard scores based on the participant's age. These standard scores have a population mean of 100 and standard deviation of 15.
Individual studies that have compared monolingual and bilingual participants for PPVT test scores in English have had mixed results. In some studies monolingual and bilingual participants achieved equivalent scores (e.g., Bialystok, Craik, Klein & Viswanathan, Reference Bialystok, Craik, Klein and Viswanathan2004) but in others, monolinguals outperformed bilinguals in the language of testing (e.g., Portocarrero, Burright & Donovick, Reference Portocarrero, Burright and Donovick2007). Thus, the results are inconsistent and it is difficult to extrapolate to the population. The present approach is to combine data from individual studies to estimate the population distribution of a vocabulary score obtained from a large sample of monolingual and bilingual participants. Because the PPVT is a standardized test administered under identical procedures, scores from multiple studies can be combined without regard for the individual study in which the test was administered. Thus, the analysis is a large sample investigation of vocabulary size and is not a meta-analysis of previous studies.
In a recent study, we aggregated the PPVT scores from 1,738 children between three and ten years old (Bialystok, Luk, Peets & Yang, Reference Bialystok, Luk, Peets and Yang2010). As with adults, individual studies produce mixed results for the relation between receptive vocabulary levels for monolingual and bilingual participants, but by examining the data for a large sample of children, estimates come closer to population norms. In the study with children, there was clear evidence that for all ages included, monolingual children obtained significantly higher scores on the receptive vocabulary tests than did bilinguals. More importantly, the scores for both groups were distributed normally, with monolinguals obtaining a mean score of 106.8 and standard deviation of 12.3, and bilinguals, a mean score of 96.3 and standard deviation of 13.0. These results indicate that on average, bilingual children have a smaller vocabulary in the language of testing than do comparable monolinguals.
In the present study, we applied this method to vocabulary data from adults to determine whether such consistent population differences can be found in adulthood.
Method and results
Data were aggregated from 20 studies conducted by the first author over a period of seven years in which the PPVT–III had been administered to monolingual and bilingual adults. The sample included 1,605 participants between the ages of 17 and 89 years. There were 797 monolingual speakers of English (mean age = 34.8 years, SD = 21.1) and 808 bilingual participants who spoke English plus another language (mean age = 33.1 years, SD = 20.1), with no age difference between groups, F(1, 1603) = 2.8, MSE = 423.5, n.s. The non-English language included a large variety of European and Asian languages with no concentration on any one language. All participants lived in an English-speaking multicultural city, reported using both English and their other language on a daily basis, and stated they had high proficiency in both languages. Younger adults had typically been bilingual since at least beginning school and older adults since at least the age of 20 years old (i.e., at least for 50 years). Although English is most often the second language learned (L2), because of the daily use of both languages in an English community English is often the dominant language, making the distinction between L1 and L2 difficult to assess, In the study with children (Bialystok et al., Reference Bialystok, Luk, Peets and Yang2010), a comparison of Asian and non-Asian languages indicated no difference in English PPVT scores. Each of the studies from which these participants were extracted included measures to match monolingual and bilingual participants on cognitive variables, but because those measures were different across studies, it is not possible to aggregate cognitive scores for this analysis. However, since the comparability of cognitive level for monolingual and bilingual participants was confirmed individually for each study, it can be assumed that the total sample of monolingual and bilinguals participants is also equivalent on such measures.
The analysis was based on the age-corrected standard scores from the PPVT–III. The relative frequency distributions of standard scores for monolingual and bilingual participants are presented in Figure 1. Univariate statistics indicated that these frequencies were normally distributed, with skewness indices of 0.30 and 0.01 for monolinguals and bilinguals, respectively. A perfectly normal Gaussian distribution would result in a skewness index of zero, so the near-zero values indicate that the distributions approximated normality. Note that ANOVA is robust with respect to small deviations from normality in a large sample such as this, so simple tests of normality cannot be interpreted. A one-way ANOVA showed that the mean vocabulary score for the monolinguals was significantly higher than that for the bilinguals, F(1, 1603) = 138.4, MSE = 164.7, p < .0001. The statistical significance of the difference was confirmed in the non-overlapping 95% confidence intervals for monolinguals (104.4, 106.2) and bilinguals (96.8, 98.6).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160921030115-08376-mediumThumb-S136672891100040X_fig1g.jpg?pub-status=live)
Figure 1 Distribution of PPVT–III standard scores for monolingual and bilingual participants.
Because the age range was large, participants were divided into subgroups of younger and older adults to determine whether the pattern was consistent across the lifespan. Table 1 presents PPVT standard scores for the monolingual and bilingual participants by age group. A two-way ANOVA for age group and language group showed that older adults scored higher than young adults, F(1, 1601) = 24.7, MSE = 162.3, p < .0001, and monolinguals scored higher than bilinguals, F(1, 1601) = 105.8, MSE = 162.3, p < .0001, with no interaction, F < 1.
Table 1. Mean standard score for the PPVT–III (and standard deviation) for monolingual and bilingual participants by age group.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160921030115-08853-mediumThumb-S136672891100040X_tab1.jpg?pub-status=live)
Discussion
An analysis of a sample of over 1,600 participants showed that monolinguals obtained significantly higher receptive vocabulary scores than bilinguals in the language generally used for empirical testing, clinical assessment, and theoretical model building. These results replicate those from a similar study with children up to the age of 10 years (Bialystok et al., Reference Bialystok, Luk, Peets and Yang2010). Together, the two studies point to a lifelong shift in receptive vocabulary level in the community language for individuals who continue to function in another language as well. Although there is no reason to believe that bilinguals know fewer words in total than their monolingual peers, the evidence from the two studies indicates a significant difference in vocabulary size for the community language.
Although the distribution of PPVT scores approximates the Gaussian distribution for both monolingual and bilingual participants, the shape of the bilingual curve deviates from normality slightly more than does the monolingual curve. In contrast, both distributions were perfectly normal in the study with children (Bialystok et al., Reference Bialystok, Luk, Peets and Yang2010). This difference is likely attributable to the greater heterogeneity of adult bilinguals than is the case for children. The great majority of the bilingual children in our previous study spoke English at school and in the community and another language at home; the adult bilinguals in the present study had more diverse language histories. For example, the adults became bilingual at different ages and under different circumstances, factors that affect proficiency in the new language. In a study of 157 young adults, early bilinguals obtained higher PPVT scores than late bilinguals, all else being equal (Luk, de Sa & Bialystok, Reference Luk, de Sa and Bialystok2011). Similarly, the study by Portocarrero et al. (Reference Portocarrero, Burright and Donovick2007) reported a negative correlation between age of arrival in the United States and PPVT score. Therefore, unlike the children in the previous study for whom English was largely the dominant language, the adults in the present study have a greater variety of language backgrounds and language dominance patterns. Importantly, however, the distribution of scores demonstrates that neither of these factors overrides the general pattern showing a shift in the mean score by language group.
The results also showed an equivalent effect in both the monolingual and bilingual groups in which the older adults obtained higher vocabulary scores than the younger adults. This pattern is consistent with research showing that vocabulary size increases through late middle age, resulting in larger a vocabulary for older adults (review in Burke & Shafto, Reference Burke, Shafto, Craik and Salthouse2008). Thus, in spite of an absolute difference in vocabulary level between monolinguals and bilinguals, the normal processes associated with aging and cognitive change were maintained.
Interpretations of research and outcomes of clinical diagnoses make assumptions about language proficiency in general and vocabulary size in particular that may not apply to bilingual populations. The results of the present study point to the importance of objectively determining the vocabulary level of bilinguals in the language in which testing or assessment is taking place. Although the present results have no direct clinical application, they carry significant implications for theoretical interpretations and clinical decisions.