Introduction
Bilingual and multilingual individuals vary widely in their exposure to, and socially distributed use of, language, particularly if they live in highly multilingual locations. Individual differences in static language experiences, such as age of language acquisition (AoA; e.g., Flege, Munro & MacKay, Reference Flege, Munro and MacKay1995; Gullifer, Chai, Whitford, Pivneva, Baum, Klein & Titone, Reference Gullifer, Chai, Whitford, Pivneva, Baum, Klein and Titone2018; Kousaie, Chai, Sander & Klein, Reference Kousaie, Chai, Sander and Klein2017; Luk, De Sa & Bialystok, Reference Luk, De Sa and Bialystok2011; Piske, MacKay & Flege, Reference Piske, MacKay and Flege2001; Subramaniapillai, Rajah, Pasvanis & Titone, Reference Subramaniapillai, Rajah, Pasvanis and Titone2018; Titone, Libben, Mercier, Whitford & Pivneva, Reference Titone, Libben, Mercier, Whitford and Pivneva2011), and current language experiences, such as amount of second language (L2) exposure (e.g., Gullifer et al., Reference Gullifer, Chai, Whitford, Pivneva, Baum, Klein and Titone2018; Hartanto & Yang, Reference Hartanto and Yang2016; Hofweber, Marinis & Treffers-Daller, Reference Hofweber, Marinis and Treffers-Daller2016; Jylkkä, Soveri, Wahlström, Lehtonen, Rodriguez-Fornells & Laine, Reference Jylkkä, Soveri, Wahlström, Lehtonen, Rodriguez-Fornells and Laine2017; Pivneva, Mercier & Titone, Reference Pivneva, Mercier and Titone2014; Prior & Gollan, Reference Prior and Gollan2011; Subramaniapillai et al., Reference Subramaniapillai, Rajah, Pasvanis and Titone2018; Titone, Gullifer, Subramaniapillai, Rajah & Baum, Reference Titone, Gullifer, Subramaniapillai, Rajah, Baum and Bialystok2017), drive linguistic performance and executive control abilities. However, there is not yet consensus on the best practices for measuring current language experience (Baum & Titone, Reference Baum and Titone2014; Gollan, Weissberger, Runnqvist, Montoya & Cera, Reference Gollan, Weissberger, Runnqvist, Montoya and Cera2012; Surrain & Luk, Reference Surrain and Luk2017; Takahesu Tabori, Mech & Atagi, Reference Takahesu Tabori, Mech and Atagi2018; Tomoschuk, Ferreira & Gollan, Reference Tomoschuk, Ferreira and Gollan2018) despite its theoretical importance (Abutalebi & Green, Reference Abutalebi and Green2016; Green & Abutalebi, Reference Green and Abutalebi2013), in large part because bilingualism and bilingual experience are not homogenous constructs.
There is now a long history of measuring the impact of bilingual experience on language acquisition and processing, often through the use of self-report assessment instruments that tap into various constructs (see e.g., Bahrick, Hall, Goggin, Bahrick & Berger, Reference Bahrick, Hall, Goggin, Bahrick and Berger1994; Li, Sepanski & Zhao, Reference Li, Sepanski and Zhao2006; Li, Zhang, Tsai & Puls, Reference Li, Zhang, Tsai and Puls2014; Marian, Blumenfeld & Kaushanskaya, Reference Marian, Blumenfeld and Kaushanskaya2007; McLaughlin, Reference McLaughlin1977). However, standard operating procedures at present generally involve the computation of language proficiency along a single dimension (e.g., L2 proficiency). Few studies, particularly in cognitive domains, assess the impact of other background measures commonly elicited by standardized language history questionnaires, such as daily exposure to known languages or daily language use in various communicative contexts (see e.g., critiques raised by Gollan et al., Reference Gollan, Weissberger, Runnqvist, Montoya and Cera2012; Surrain & Luk, Reference Surrain and Luk2017; Tomoschuk et al., Reference Tomoschuk, Ferreira and Gollan2018). Moreover, few researchers treat language experience measures in a continuous manner (see e.g., critiques raised by Baum & Titone, Reference Baum and Titone2014), opting instead to dichotomize continuous variables into discrete groups. Although grouping may be warranted in some cases, such as when two distinct populations are being compared to address a particular question, it can be problematic in others, leading to a loss of information and poor statistical estimates, particularly when the phenomena under investigation exist on a continuum (MacCallum, Zhang, Preacher & Rucker, Reference MacCallum, Zhang, Preacher and Rucker2002). One likely reason for these practices is that the sheer number of variables characterizing individual differences in L2 experience can be overwhelming and highly correlated, thus creating challenges for researchers searching for a single dimension upon which to focus.
Here, we hope to overcome some of these limitations by offering an innovative way to characterize current language experience by capitalizing on measures often assessed but not evaluated, which, when combined with other data reduction techniques like principal component analysis, can yield a manageable set of individual differences variables. This approach involves a measure of language entropy that indexes the relative balance or diversity in the daily usage of two or more languages. Higher entropy values relate to more balanced language use and greater language diversity.
Practically, language diversity can vary among global locations; among communicative contexts within a location; and, crucially, among individuals. For example, while some geographic areas house a bilingual population, the bilinguals may tend function in a compartmentalized fashion, restricting use of specific languages to specific communicative contexts and avoiding language mixing. In other areas, bilinguals may tend to function in an integrated fashion, where all languages are used regardless of the communicative context and where language mixing (either across or within utterances) is common (see e.g., Beatty-Martínez & Dussias, Reference Beatty-Martínez and Dussias2017). Integrated bilingual language usage is the reputation of Montréal, the site of the present study (Heller, Reference Heller1982; Higgins, Reference Higgins, Leap and Boellstorff2004; Lamarre, Reference Lamarre2002). The multilingual nature of Montréal is apparent to any resident or visitor and is generally supported by language demographic data from the Canadian census (Statistics Canada, 2017).
On the census, when Montréalers are asked to identify their “most commonly used language(s)”, a vast majority report using French across communicative contexts (approximately 60–70% of respondents depending on the context), but a fair percentage also report the primary use of two or more languages (approximately 4–11%) in these contexts (proportionally more than Canada generally; 2–5%). These proportional data are illustrated in Figure 1A. In Figure 1B we illustrate the same proportional data as language entropy, which gives a sense of how language diversity varies by communicative context and geographic area. Across all contexts, Montréal exhibits higher entropy, and thus higher diversity than Québec, largely because of the dominance of the French language in Québec. Interestingly, Montréal has equal or lower language diversity relative to Canada as a whole for home-related contexts (i.e., mother tongue and primary home language). However, for the work-related context, Montréal exhibits substantially higher language diversity, indicative of more integrated usage in this context.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210119073730145-0852:S1366728919000026:S1366728919000026_fig1.png?pub-status=live)
Fig. 1. Plot of census data related to most common language used across available communicative contexts for the following geographic locations: Montréal (city), Québec (province), and Canada (country). We obtained count data from Statistics Canada (2017) for the following questions: What is the language that this person first learned at home in childhood and still understands? (i.e., mother tongue on the horizontal axis), What language does this person speak most often at home? (i.e., home language on the horizontal axis), In this job, what language did this person use most often? (i.e., work language on the horizontal axis). We transformed these data to proportions (illustrated on Panel A) and computed language entropy (illustrated on Panel B). Of note, for each question, respondents could report the common use of multiple languages. In these cases, we condensed the categories to “Other – Two languages” and “Other – Three languages.”
We note two caveats here that may result in the underestimation of language diversity in Montréal as illustrated. First, the census lacks questions about other communicative contexts that may reveal higher language diversity (e.g., social settings). Second, when Montréalers are asked about “other commonly used languages” (besides the most commonly used language(s); not illustrated here), English in Montréal experiences a significant boost in usage relative to other languages in the broader Canadian context. Together, these data support the notion that Montréal is highly bilingual and that there is substantial variability in language diversity among geographic locales and among different communicative contexts. Moreover, language entropy can provide a concrete estimate of diversity while simultaneously reducing the complexity of the data, making data visualization and modeling more tractable. Importantly, variability in language diversity holds theoretical implications for behavior, brain structure, and brain function (Abutalebi & Green, Reference Abutalebi and Green2016; Beatty-Martínez & Dussias, Reference Beatty-Martínez and Dussias2017; Green & Abutalebi, Reference Green and Abutalebi2013; Gullifer et al., Reference Gullifer, Chai, Whitford, Pivneva, Baum, Klein and Titone2018; Hartanto & Yang, Reference Hartanto and Yang2016; Yang, Hartanto & Yang, Reference Yang, Hartanto and Yang2016; Poarch, Vanhove & Berthele, Reference Poarch, Vanhove and Berthele2018), and our recent work shows that this variability is measurable at the level of the individual (Gullifer et al., Reference Gullifer, Chai, Whitford, Pivneva, Baum, Klein and Titone2018).
Recent psycholinguistic and neurocognitive perspectives of bilingualism, such as the adaptive control hypothesis (Abutalebi & Green, Reference Abutalebi and Green2016; Green & Abutalebi, Reference Green and Abutalebi2013), predict that individual differences in how bilinguals use their two or more languages across different social settings (i.e., interactional context) are critically important in determining how bilinguals represent, access, and control those languages. For example, bilinguals who use their two languages in primarily single language contexts (i.e., compartmentalized bilinguals) are predicted to have different language and executive control demands relative to bilinguals who use their two languages in dual language contexts (i.e., integrated bilinguals). We note that, while novel, this focus is a successor in the spirit of seminal work on language mode by Grosjean (Reference Grosjean1997, Reference Grosjean and Nichol2001). Accordingly, recent work has begun to examine the social diversity of language use in relation to executive control capacity and language processing by sampling various groups of participants from locations that are known to differ in the social diversity of language use (e.g., Beatty-Martínez & Dussias, Reference Beatty-Martínez and Dussias2017) or by computing difference scores between self-report usage measures for the first language (L1) and second language (e.g., Birdsong, Gertken & Amengual, Reference Birdsong, Gertken and Amengual2012; Hartanto & Yang, Reference Hartanto and Yang2016, Poarch et al., Reference Poarch, Vanhove and Berthele2018). While innovative, such approaches are limited in the ability to assess nuanced differences in the social diversity of language usage that likely exist at the level of the individual within a population or that arise from situations where people use more than two languages. Thus, despite the practical and theoretical significance of the social diversity of language use among bilinguals, the field has not converged upon an optimal way of characterizing this source of individual variation (see also Fricke, Zirnstein, Navarro-Torres & Kroll, Reference Fricke, Zirnstein, Navarro-Torres and Kroll2019; Kroll, Dussias & Bajo, Reference Kroll, Dussias and Bajo2018; Takahesu Tabori et al., Reference Takahesu Tabori, Mech and Atagi2018).
We propose language entropy as an innovative way to measure individual differences in the social diversity of language use, including the interactional context of language usage. Entropy is a concept with its roots in physics: a property of physical systems that is proportional to number of different configurations, or states, of those systems. The concept was adapted for information theory by Claude Shannon as a means to quantify information content or uncertainty (Shannon, Reference Shannon1948). Entropy has been used previously in psycholinguistics to quantify lexical and syntactic complexity (e.g., del Prado Martín, Kostić & Baayen, Reference del Prado Martín, Kostić and & Baayen2004; Hale, Reference Hale2003; Levy, Reference Levy2008), and we recently applied language entropy to study the neurocognition of bilingualism using resting state functional connectivity (see Gullifer et al., Reference Gullifer, Chai, Whitford, Pivneva, Baum, Klein and Titone2018). Entropy is useful psychometrically as it yields a continuous measure of diversity, and it is computed as a function of the probability with which a set of events or states occur. Information about individuals’ language use or exposure within various communicative contexts (e.g., language use at home, work, in social setting, etc.) is frequently elicited by the standard language history questionnaires that have become ubiquitous within the field (Anderson, Mak, Chahi & Bialystok, Reference Anderson, Mak, Chahi and Bialystok2018; Birdsong et al., Reference Birdsong, Gertken and Amengual2012; Dunn & Fox Tree, Reference Dunn and Fox Tree2009; Li et al., Reference Li, Zhang, Tsai and Puls2014; Marian et al., Reference Marian, Blumenfeld and Kaushanskaya2007), and these data are inherently proportional (or, in the case that data are collected via Likert scale, can be converted to a proportion). Thus, language entropy can be straightforwardly assessed for each communicative context, or globally across all contexts. We provide a fully-documented R package that computes language entropy (Gullifer & Titone, Reference Gullifer and Titone2018), available at: https://github.com/jasongullifer/languageEntropy.
Language entropy values range from 0 to log n, where n is the number of languages that entropy is computed over (e.g., with two languages, max entropy is 1; with three, it is approximately 1.585). A communicative context that is completely compartmentalized, where an individual reports using only one language, will have an entropy of 0, signifying no language diversity and very high predictability of an upcoming language within this context (i.e., a single-language interactional context). In contrast, a communicative context that is completely integrated, where two or more languages are used in perfect balance, will have maximum entropy, signifying high language diversity and very low predictability of an upcoming language within this context. Language entropy ranges continuously between 0 and the maximumFootnote 1, allowing for the assessment of language use that falls in between compartmentalized and integrated dual language contexts.
Thus, the goal of this paper is to establish a proof of concept, where we demonstrate the utility of using entropy as an estimate for the social diversity of language use in two ways. First, we show that entropy exhibits substantial variation across speakers and communicative contexts. Second, we address a classic question in L2 acquisition and bilingualism: namely, what factors predict self-perceived L2 accentedness and L2 abilities (e.g., similar to work by Flege et al., Reference Flege, Munro and MacKay1995). To this end, we analyze data from a large sample of bilinguals/multilinguals (N = 507) drawn from the highly multilingual city of Montréal, QC, Canada, and we replicate a classic finding using self-report data: that L2 AoA and L2 current exposure predict self-perceptions of L2 accentedness and L2 abilities. Crucially, we show that the social diversity of language use exhibits additional predictive power over these classic predictors for both outcome measures. Moreover, for L2 accentedness ratings, the impact of social diversity interacts with L2 AoA. Together, these results suggest that the social diversity of language usage is an important variable for future studies to consider. In the discussion we highlight potential applications for future research, guided by the adaptive control hypothesis.
Method
Participants
We analyzed language history data collected in the McGill Language and Multilingualism (MLL) lab over several years. From 2008 to 2015, approximately 507 bilingual or multilingual participants were tested who reported detailed language history information (including the relative exposure to and use of two or more languages). We report a qualitative analysis of participant characteristics in the results section below.
Materials
All participants in this sample completed a language background questionnaire adapted from the LEAP-Q (Marian et al., Reference Marian, Blumenfeld and Kaushanskaya2007) or LHQ 2.0 (Li et al., Reference Li, Zhang, Tsai and Puls2014), allowing us to probe language usage within the Montréal context.
Language history information
For the purposes of the analysis and entropy computation, we extracted several types of background measures outlined below. We extracted basic demographic information, including L2 AoA and L2 exposure, which served as predictors in the analysis. We extracted data on “language exposure in different usage contexts” for the purposes of computing language entropy, which served as another class of predictors in the analysis. Finally, we extracted self-reported L2 accent perception and L2 abilities, which served as outcome measures in the analysis. For some of these extracted measures, we applied data reduction techniques (i.e., aggregation or principal component analysis). A summary of the data, including aggregate measures or components, is available in Table 1.
Table 1. Participant language history information.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210119073730145-0852:S1366728919000026:S1366728919000026_tab1.png?pub-status=live)
Basic demographic information
In the questionnaire, participants reported basic information about their demographics and language use. We extracted classic measures of L2 exposure, such as L2 AoA (based on the onset of learning) and global exposure to the L1, L2, and L3 (third language). Global L2 exposure is frequently used as a covariate in the MLL lab (e.g., Pivneva et al., Reference Pivneva, Mercier and Titone2014; Subramaniapillai et al., Reference Subramaniapillai, Rajah, Pasvanis and Titone2018), and these global exposure measures did not factor in to the computation of language diversity to allow for a comparison of the measures.
Language exposure in different usage contexts
Participants reported the extent to which they used the L1, L2, and L3 in a variety of communicative contexts in the home, at work, in social settings, for reading, and for speaking. The questionnaire elicited language use at home, work, and in social settings via Likert scales (e.g., “Please rate the amount of time you use each language at home”), with a score of 1 indicating “no usage at all” and a score of 7 indicating “usage all the time”. We baselined Likert data at 0 by subtracting 1 from each response. Thus, a value of 0 reflects “no usage at all.” We converted these data to proportions of usage by dividing a given language's score by the sum total of the scores within context. For example, a participant who reported (after baselining) the following data for language usage at home, L1: 6, L2: 5, L3: 0, would receive the following proportions for the home context, L1: 6/11, L2: 5/11, L3: 0/11.
Language use for reading and speaking were collected through percentage of use (“What percentage of time would you choose to speak each language?”), which totaled to 100% within a particular context. We converted percentages to proportions, and we used this proportional usage data to compute the diversity of language use in each context.
L2 accent perception
Participants reported the extent to which they believe they have an accent in the L2 and the extent to which they believe others identify them as nonnative speakers based on their accent. This information was elicited through two questions using seven-point Likert scales (“How much of a foreign accent do you have in L2?”, 1 indicates no accent, 7 indicates a strong accent; and “Please rate how frequently others identify you as a non-native speaker based on your ACCENT in French”, 1 indicates never, 7 indicates all the time). Across the sample, the self-accent perception and other accent perception were positively correlated (Spearman rho: 0.79, p < 0.05), and we computed a mean accent perception score (M: 3.62, SD: 1.79). We used this score as a dependent variable in the analyses.
L2 abilities
Participants answered a series of 20 questions that probed self-rated abilities in the L2 and L1, including speaking, reading, writing, translating, listening, pronunciation, fluency, vocabulary knowledge, grammatical knowledge, and overall competence. To reduce the complexity of these data, we conducted a principal components analysis (three components, determined via scree plot, with oblimin rotation) using the psych package {Revelle:2017um} for R (R Core Team, 2017). Variables related to the L2 loaded onto one component, and this component explained 39% of the variance in the data. Variables related to the L1 loaded onto two other components, and each component explained 15% of the variance in the data. We extracted the scores for the L2 component to serve as an index of self-rated L2 abilities, and we used this as a dependent variable in the analysis.
Computing language entropy
For each usage context (see “Language exposure in different usage contexts”, above), we computed Shannon entropy (H) using the following equation $H = {\rm \;} -\mathop \sum \nolimits_{i = 1}^n P_ilog_2\lpar {P_i} \rpar $ and the methods available in the languageEntropy R package (Gullifer & Titone, Reference Gullifer and Titone2018). Here, n represents the total possible languages within the context (i.e., 3) and P i is the proportion that languagei is used within a context. To illustrate, if hypothetical bilingual reports using French 80% of the time and English 20% of the time within the work context, one computes language entropy by summing together 0.80*log 2(0.80) and 0.20*log 2(0.20), then multiplying by -1 to yield a positive entropy value. Thus, the hypothetical individual's language entropy in the work context would be approximately 0.72.
Theoretically, the entropy distribution has a minimum value of 0 that occurs when the proportion of usage for a given language is 1.0, representing a completely compartmentalized context. The distribution has a maximum value equal to log n (approximately 1.585 for three languages) when the proportion of use for each language is equivalent, representing a completely integrated context.
Our procedure resulted in five entropy scores for each participant, which pertained to language entropy for home, work, social, reading, and speaking. To reduce complexity of these data, we conducted a principal components analysis on the entropy data (two components, determined via scree plot, with oblimin rotation). The first component comprised reading, speaking, home, and social entropy, and this component explained 44% of the variance in the data. The second component comprised work with some cross-loading from social entropy, and this component explained 21% of the variance in the data. We extracted the component scores for each participant to serve as indices of language entropy at work and language entropy everywhere else.
Results
The data were prepared, plotted, and analyzed in R (R Core Team, 2017) using tidyverse (Wickham, Reference Wickham2017).
Qualitative analysis of participant characteristics
First, we offer a qualitative analysis of participant characteristics related to static historical language experience (i.e., L2 AoA) and current language experience (e.g., L2 exposure and language entropy), reported in Table 1.
In terms of static historical language experience, of the sampled 507 participants, we identified 51 individuals as simultaneous bilinguals, who acquired the L2 at or before 1 year of age, and 456 individuals as sequential bilinguals. Thus, the majority of the sample reported acquiring their L2 before the age of 15 (N = 494). Based on L2 AoA, we identified 240 participants as native English speakers, and 267 participants as native French speakers.
In terms of current language experience across the sample, participants reported, on average, being exposed to the L2 for roughly one third of the day. However, there was substantial variation in this measure indicated by the high standard deviation (M = 33.82, SD = 20.67). Participants reported, on average, minimal daily exposure to an L3 (M = 2.38, SD = 5.58). A qualitative analysis of language entropy measures suggests that, overall, participants were relatively integrated in terms of their bilingualism (see Table 1 for means and standard deviations). For example, 80-20 bilingualism would be reflected in an entropy value of 0.72 (approximately the average language entropy across contexts). Again there was substantial variation across individuals within contexts, indicated by the large standard deviation. While not reported in the table, participants spanned the whole range of the entropy spectrum from completely compartmentalized (entropy: 0) to fully integrated (entropy: 1.585). There was also variation in language entropy across usage contexts, in that entropy was higher and less variable in social and work contexts relative to other contexts, with for example 65-35 bilingualism reflected in an entropy value of 0.94. We note that L2 AoA was generally not highly correlated with the entropy measures (range of Spearman rho: −0.17–0.03), though L2 exposure was (range: 0.13–0.44). See Supplemental Figure S1 (Supplementary Material) for an illustration of the bivariate correlations between variables related to language experience, variables related to language entropy, and language entropy components. The results of this qualitative analysis pattern well with the components identified in the principal components analysis above: namely, that the work context loads onto a separate component with some cross-loading from the usage context. See Table 1 (Language entropy) for descriptive statistics.
Quantitative analyses assessing the utility of language entropy as a predictor
We used multiple linear regression to predict mean L2 accent ratings and scores on the L2 abilities component as dependent variables. For each dependent measure, we fit three nested models and selected the best model via a model comparison procedure. The first model (base model) included the L2 AoA and L2 exposure as fixed effects. The second model (additive model) additionally included non-work entropy and work entropy components. The third model (interaction model) additionally included all two-way interactions with L2 AoA (i.e., L2 exposure * L2 AoA, non-work entropy * L2 AoA, and work entropy * L2 AoA). Model comparisons using chi-squared tests assessed whether the addition of entropy measures and interactions with L2 AoA significantly improved model fit. We then examined the best-fitting models to assess the direction, magnitude, and significance of each fixed-effect slope estimate. All predictors in the models were centered and standardized.
Given the relatively few individuals in the sample who reported an L2 AoA greater than 15 years, it was possible that our results could be driven by the presence of outliers or high leverage points. Consequently, we conducted additional analyses using robust linear regression with Huber weights to attenuate the influence of observations with high residuals, allowing us to minimize undue influence from outliers while maintaining the continuous nature of the L2 AoA measure without removing participants from the sample. The general pattern of results did not change under robust linear regression, suggesting that the patterns are stable.
L2 accentedness
For the L2 accentedness model, the addition of entropy components improved model fit (χ 2(2) = 56.62, p < 0.05), and the addition of two-way interactions further improved model fit (χ 2(3) = 30.47, p < 0.05). Thus, the two entropy components explained unique variance relative to L2 AoA and L2 exposureFootnote 2, and there were significant interactions between L2 AoA and other predictors (see Table 2 for model outputs).
Table 2. Model outputs for the three nested models predicting mean L2 accentedness ratings. Model comparisons indicate that model 3 (interaction model) was the best-fitting model. We report 95% confidence intervals for each point estimate in parentheses.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210119073730145-0852:S1366728919000026:S1366728919000026_tab2.png?pub-status=live)
Note: *p < 0.1; **p < 0.05; ***p < 0.01
Inspection of the interaction model (adjusted R 2 = 0.30, Intercept = 3.627, SEM = 0.067, t(499) = 54.239, p < 0.05, 95% CI [3.496, 3.758]) showed a main effect of L2 AoA (b = 0.695, SEM = 0.068, t(499) = 10.184, p < 0.05, 95% CI [0.562, 0.829]), indicating that later L2 AoA was associated with higher L2 accentedness ratings. There was a main effect of L2 exposure (b = −0.445, SEM = 0.072, t(499) = 6.138, p < 0.05, 95% CI [−0.587, −0.303]), indicating more L2 exposure was associated with lower L2 accentedness ratings. There was a main effect of non-work entropy (b = −0.323, SEM = 0.077, t(499) = 4.222, p < 0.05, 95% CI [−0.473, −0.173]), indicating that higher non-work entropy (i.e., more integration) was associated with lower L2 accentedness ratings. There was no significant main effect of work entropy (b = −0.078, SEM = 0.071, t(499) = 1.112, p > 0.05, 95% CI [−0.217, 0.060]). There was an interaction between L2 AoA and L2 exposure (b = 0.142, SEM = 0.061, t(499) = 2.338, p < 0.05, 95% CI [0.023, 0.261]), indicating that exposure effects were of greater magnitude at earlier L2 AoA. There was an interaction between L2 AoA and non-work entropy (b = 0.169, SEM = 0.072, t(499) = 2.339, p < 0.05, 95% CI [0.027, 0.310]), indicating that non-work entropy effects were of greater magnitude at earlier L2 AoA. There was an interaction between L2 AoA and work entropy (b = −0.186, SEM = 0.076, t(499) = 2.437, p < 0.05, 95% CI [−0.336, −0.036]), indicating higher work entropy was associated with lower ratings of L2 accentedness at later L2 AoA.
L2 abilities
For the L2 abilities model, the addition of entropy components improved model fit (χ 2(2) = 36.52, p < 0.05), but the addition of two-way interactions did not further improve model fit (χ 2(3) = 0.83, p > 0.05). Thus, the two entropy components explained unique variance relative to L2 AoA and L2 exposureFootnote 2 and there was no evidence for interactions with L2 AoA (see Table 3 for model outputs).
Table 3. Model outputs for the three nested models predicting scores on the L2 abilities component. Model comparisons indicate that model 2 (additive model) was the best-fitting model. We report 95% confidence intervals for each point estimate in parentheses.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210119073730145-0852:S1366728919000026:S1366728919000026_tab3.png?pub-status=live)
Note:*p < 0.1; **p < 0.05; ***p < 0.01
Inspection of the additive model (adjusted R 2 = 0.40, Intercept = 0.000, SEM = 0.035, t(502) = 0.000, p = 1.00, 95% CI [−0.067, 0.067]) showed that there was a main effect of L2 AoA (b = −0.336, SEM = 0.035, t(502) = 9.681, p < 0.05, 95% CI [−0.404, −0.268]), indicating that later L2 AoA was associated with lower L2 ability ratings. There was a main effect of L2 exposure (b = 0.359, SEM = 0.037, t(502) = 9.651, p < 0.05, 95% CI [0.286, 0.432]), indicating more L2 exposure was associated with higher L2 ability ratings. There was a main effect of non-work entropy (b = 0.257, SEM = 0.039, t(502) = 6.525, p < 0.05, 95% CI [0.180, 0.334]), indicating that higher non-work entropy (i.e., more integration) was associated with higher L2 ability ratings. There was a main effect of work entropy (b = 0.075, SEM = 0.036, t(502) = 2.061, p < 0.05, 95% CI [0.004, 0.146]), indicating that higher work entropy (i.e., more integration) was associated with higher L2 ability ratings.
Discussion
Here, we applied the concept of entropy to formalize an index of social diversity of language use that would serve as a novel indicator of individual differences in current language experience. This measure, language entropy, allows for the continuous estimation of compartmentalized to integrated language use within and across bilingual communicative contexts, with higher entropy values corresponding to more integrated language usage. We computed language entropy for various contexts for a large sample (N = 507) of bilingual/multilingual speakers of French and English drawn from the population of Montréal. Our first, qualitative analysis provides support for the idea that bilingual language use in Montréal is highly diverse, particularly in social and work contexts, in line with common knowledge and available data the Canadian census in Montréal (Statistics Canada, 2017). Next, we showed that language entropy is an important predictor for self-perceived markers of L2 success, namely self-perceived L2 accentedness and L2 abilities, in line with recent theoretical perspectives on the neurocognition of bilingualism which state that the context of language usage is an important determinant of the way in which bilinguals represent, access, and control the two languages (Abutalebi & Green, Reference Abutalebi and Green2016; Green & Abutalebi, Reference Green and Abutalebi2013).
Our qualitative analysis suggests that while our sample was homogeneous in the sense that all individuals learned their L2 during early childhood or adolescence (the majority consisting of relatively early bilinguals who acquired the two languages before the age of 15), the population was quite heterogeneous in terms of daily usage of their languages across communicative contexts. Overall, entropy values indicated a general pattern of integrated language use, but there was variation across individuals within communicative contexts, suggesting that the people in our sample differed in whether they used their languages in a compartmentalized or integrated manner. Moreover, communicative contexts were also variable in terms of language entropy, with social and work contexts exhibiting higher language entropy (i.e., diversity), indicative of more integrated language use than the other contexts, notably the home context. This finding is compatible with data from the Canadian census in Montréal (Figure 1), showing lower language diversity in home contexts than in work contexts. The principal component analysis on entropy measures for each of the five communicative contexts further suggested that there may be two components underlying the social diversity of language use (at least for our sample drawn from Montréal): first, a more global language diversity (the non-work entropy component), and second, language diversity in the work context (the work entropy component).
Interestingly, the qualitative analysis also showed that language entropy values were not highly correlated with L2 AoA. This finding suggests that regardless of whether an individual acquired their second language early or later in young adulthood, they could come to use their languages in either a compartmentalized or integrated manner. More research is needed to determine whether this is also the case for individuals who acquire a second language later in life.
Crucially, we demonstrated the utility of the social diversity of language use as estimated by language entropy as a predictor, by testing whether individual differences in the two entropy components (i.e., work and non-work) predicted self-report data for L2 accentedness and L2 abilities over and above classic predictors of language experience: L2 AoA and L2 exposure. Our models indicated that early L2 AoA and high L2 exposure were associated with decreased L2 accentedness and increased L2 abilities ratings, consistent with previous work suggesting that the amount of language exposure (whether early or otherwise) and is an important predictor of L2 success (e.g., Flege et al., Reference Flege, Munro and MacKay1995). Importantly, we also found that both language entropy components improved model fit over and above L2 AoA and L2 exposure.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210119073730145-0852:S1366728919000026:S1366728919000026_fig2.png?pub-status=live)
Fig. 2. Partial effects plots for the L2 accentedness model. A. Plot of the interaction between L2 AoA and L2 exposure. B. Plot of the interaction between L2 AoA and the general entropy component. C. Plot of the interaction between L2 AoA and the work entropy component. Confidence intervals illustrate 1 SEM.
The addition of the entropy components explained unique variance related to L2 accentedness and L2 abilities. Although the R-squared improvement was modest, the effects were significant, and they patterned in similar and sensible directions across the two models. Specifically, higher scores on the entropy components related to decreased L2 accentedness ratings and increased L2 abilities ratings, suggesting that integrated language use is associated with self-report outcome measures traditionally thought to reflect L2 success (but see below for limitations) apart from the classic measures of L2 AoA and L2 exposure. For accentedness ratings, there were further interactions between static historical language experience (L2 AoA) and indices of current experience (L2 exposure and the entropy components), suggesting that the greatest impact of ongoing language experience may depend on the specific communicative context in which the language is acquired or most frequently used. For example, early bilinguals tend to acquire both languages in the home or in school, which were the environments that were most predictive for early bilinguals in our sample (represented by scores on our general entropy component). In contrast, late bilinguals may tend to acquire and use the L2 with associates at work, a context that tends to be more linguistically diverse overall in Montréal and that was the most predictive for late bilinguals in our sample. Overall, these results provide a first step in validating language entropy as a measure that is predictive of variance related to bilingual language representation.
One might argue that although the language entropy components explain additional variance in the models presented here, the associations between the components and the outcome measures were highly similar to that of L2 exposure (i.e., more language exposure is associated with better language outcomes), obviating the calculation of more complex measures such as language entropy. In our view, the fact that language entropy patterns well with L2 exposure but explains unique variance for a large sample of participants functions to validate the measure psychometrically and in relation to other known constructs. In future studies of bilingualism and bilingual language processing that employ different methodologies (e.g., online language processing tasks, executive control tasks, etc.), language entropy may pattern differently from global L2 exposure (e.g., see predicted differences between interactional contexts in Abutalebi & Green, Reference Abutalebi and Green2016; Green & Abutalebi, Reference Green and Abutalebi2013). To the extent that language entropy is an important predictor in these future studies, this work will be foundational in providing confidence that it is the construct of relative language balance behind language entropy explaining variance and not some other confounded variable. Furthermore, although entropy required more up-front feature engineering compared to simply using L2 exposure, it may actually yield simpler model specification and interpretation. For example, in the case that language balance is an important predictor theoretically (e.g., if some experimental effect peaks at 50-50 balanced bilingualism), one could model this effect by adding an additional nonlinear effect of L2 exposure, using more degrees of freedom and potentially making interpretation less straightforward. The case becomes even more complex if the researcher is interested in language diversity among multilinguals as opposed to bilinguals. In sum, language entropy can be an efficient way to model language balance and diversity among bilinguals and multilinguals.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210119073730145-0852:S1366728919000026:S1366728919000026_fig3.png?pub-status=live)
Fig. 3. Partial effects plots for the L2 abilities model. A. Plot effect of L2 AoA. B. Plot of the effect of L2 exposure. C. Plot of the effect of the general entropy component D. Plot of the effect of the work entropy component. Confidence intervals illustrate 1 SEM.
Of note, we wish to be careful in associating either of the outcome measures here (L2 accentedness and L2 abilities) strongly with objective or true language proficiency for at least two reasons. First, the way in which we assessed L2 accentedness and abilities was through self-report, a method that has been scrutinized recently because self-perceptions can be tainted by often stigmatized aspects of language usage (Gollan et al., Reference Gollan, Weissberger, Runnqvist, Montoya and Cera2012; Surrain & Luk, Reference Surrain and Luk2017; Tomoschuk et al., Reference Tomoschuk, Ferreira and Gollan2018). Second, even when these factors are measured objectively, through for example a language production task, they can be subject to a variety of psycholinguistic and sociolinguistic influences that may be unrelated to language proficiency per se. For example, phonetic productions are subject to cross-language competition (e.g., Goldrick, Runnqvist & Costa, Reference Goldrick, Runnqvist and Costa2014) and phonetic convergence (e.g., Pardo, Reference Pardo2006). That being said, the fact that the results pattern well with studies with arguably more objective data (i.e., accent ratings made on production data by research assistants; e.g., Flege et al., Reference Flege, Munro and MacKay1995) suggests some validity in the self-report measure in tapping into the construct with our sample of speakers. Crucially, we would argue that the language experience variables that we included as predictors here (e.g., L2 AoA, L2 exposure, and language use in various contexts) are less likely to be subject to the limitations associated with self-report, as they constitute self-report measures about relatively objective facts about language use, such onset and amount of language use, without tapping into subjective, self-evaluative feelings about one's own language ability relative to other speakers. Moreover, we have recently demonstrated that language entropy is related to functional brain organization and executive control abilities within a similar sample of bilingual speakers (Gullifer et al., Reference Gullifer, Chai, Whitford, Pivneva, Baum, Klein and Titone2018), providing further support for the efficacy of these variables as predictors of objective outcome measures.
A limitation of the present results that influences generalizability is related to the information that we did not elicit on the language history questionnaire, the lack of which could bias our estimates of language entropy. For example, estimates of entropy here could be biased depending on the amount of time an individual spends in each communicative context. if a hypothetical participant spends most of their daily hours in one integrated context and a minority of their time in several other compartmentalized contexts, then an aggregate measure of entropy for that participant (e.g., their component score across multiple contexts) would underestimate the extent to which that participant's entropy score reflects their true integratedness. When the amount of time spent in each communicative context is available, this information could be used to weight entropy scores accordingly. Moreover, we did not elicit information about code-switching behavior, another interactional context predicted to be of critical importance by instantiations of the adaptive control hypothesis (Abutalebi & Green, Reference Abutalebi and Green2016; Green & Abutalebi, Reference Green and Abutalebi2013; Green & Wei, Reference Green and Wei2014). Nor did we elicit information about translation experience, language brokering, or other important communicative contexts that may come to influence language and executive control (e.g., Dong & Liu, Reference Dong and Liu2016; Dong & Zhong, Reference Dong and Zhong2017; López & Vaid, Reference López and Vaid2018). Thus, at this stage more work is needed to determine the generalizability this measure is to other populations of interest. Crucially, entropy is a general measure that provides a measure of information related to states in a system and it could be easily adapted to apply to other communicative contexts/language usage environments where questions of diversity are at issue. Our recommendation is that future work should investigate these issues by eliciting information about contexts that are relevant to the population(s) of interest (e.g., language brokering for heritage populations) to ensure accurate estimates for that population.
An alternative explanation for the importance of the entropy as a predictor is that it accounts for unique variance simply because it incorporates information about L3 usage (a substantial portion of the sample reported knowledge of an L3), whereas L2 exposure and L2 AoA do not. To test this hypothesis, we conducted additional model comparisons on a set of models that included a measure of non-L1 exposure (i.e., the sum of L2 and L3 exposure), and the pattern of results remained highly similar to those reported above. Specifically, for the L2 accentedness model, the model with the interactions remained the best-fitting model (χ 2(3) = 29.63, p < 0.05) and the direction and significance of the estimates of that model were the same as reported above. For the L2 abilities model, the model with additive effects of the entropy components remained the best-fitting model (χ 2(2) = 26.17, p < 0.05). The direction of all estimates were the same as reported above, though slope of the work entropy component was no longer significantly different from 0 (b = 0.053, SEM = 0.037, t(502) = 1.417, p = 0.16, 95% CI [−0.020, 0.125]), suggesting that some variance in this component was accounted for by the inclusion of L3 exposure for this model. In sum, the importance of the non-work entropy component in both models and the work entropy component in the L2 accentedness model cannot be completely attributed to the mere fact that they incorporate additional information about L3 usage.
Instead, language entropy provides information about the relative balance in language usage. Alternatively, one could think of language entropy as providing information related to the number of different “language states”, or specific configurations of language use, that a bilingual might find themselves in over the course of daily life. Compartmentalized bilinguals with low language entropy across their communicative contexts experience few unique language states and may be relatively certain about which particular language will occur given the communicative context. In contrast, integrated bilinguals with high language entropy across their communicative contexts experience a greater number of language states and may have decreased certainty about when a particular language will occur at any given time.
Thus, throughout daily life, compartmentalized and integrated bilinguals (and those in between) may experience different rates of competition between their languages and may come to control that competition between their languages in different ways. For example, while compartmentalized bilinguals may experience competition between their languages, they may become adept at inhibitory control to suppress competition from the irrelevant language. By contrast, highly integrated bilinguals may actually benefit from cross-language activation, allowing them to flexibly switch between their different languages as the context demands; and indeed suppressing a language may lead to disfluency if that language is required. These are general predictions made by the Adaptive Control Hypothesis (Abutalebi & Green, Reference Abutalebi and Green2016; Green & Abutalebi, Reference Green and Abutalebi2013) which posits that an individual's cognitive system in charge of language representation and control adapts to meet the demands of one's interactional context (i.e., situations which reflect particular patterns of use such as contexts that involve the use of two languages vs. one language). Thus, the social diversity of language use, characterized using entropy, appears to be a fruitful way to measure interactional context among individuals.
The present data were collected on a large sample of bilingual/multilingual individuals living in a diverse linguistic environment. An important question is the extent to which language entropy generalizes to other populations of bilinguals and monolinguals. In our view language entropy is a general measure of diversity that should nicely characterize speakers in other locales as well to the extent that critical information about language usage in relevant communicative contexts is available for or can be elicited from these individuals in a non-stigmatized manner (e.g., for individuals living in locations where bilingualism is not valued). Thus, future research should investigate the extent to which groups of speakers who live in locales with different interactional contexts (e.g., primarily single language contexts vs. dual language contexts) exhibit differences in language entropy.
In order to facilitate future research and compute language entropy, researchers can use the methods available in the languageEntropy package (Gullifer & Titone, Reference Gullifer and Titone2018). The package includes instructions, examples, and help files for each of the provided functions. We offer a final pragmatic note to researchers who might try using language entropy to index language diversity. It would be fruitful to investigate the correlations between entropy measures for different contexts for the sample in question. In our sample, the majority of the entropy measures in each context patterned well together – with the exception of work entropy, yielding two principal components. However, this may not be the case with every sample, and we would caution researchers against applying the same criteria on a different sample from a different location.
Characterizing and quantifying individual differences related to language entropy may have implications for debates within the field of bilingualism, such as whether bilingual (relative to monolingual) experience leads to changes compared to monolinguals in general cognitive capacities (Bialystok, Craik, Klein & Viswanathan, Reference Bialystok, Craik, Klein and Viswanathan2004; de Bruin, Treccani & Sala, Reference de Bruin, Treccani and Sala2015; Paap, Johnson & Sawi, Reference Paap, Johnson and Sawi2015; Takahesu Tabori et al., Reference Takahesu Tabori, Mech and Atagi2018; Titone et al., Reference Titone, Gullifer, Subramaniapillai, Rajah, Baum and Bialystok2017). Many studies weighing in on either side of the debate rely upon comparisons between groups of speakers, and often fail to acknowledge the ways in which the bilingual populations sampled from around the globe differ from one another, including along the dimension of social diversity of language use (e.g., Baum & Titone, Reference Baum and Titone2014; Gullifer et al., Reference Gullifer, Chai, Whitford, Pivneva, Baum, Klein and Titone2018; Titone et al., Reference Titone, Gullifer, Subramaniapillai, Rajah, Baum and Bialystok2017). Thus, language entropy may provide a means to more accurately characterize bilingual populations when used in addition to other classic measures like L2 AoA and L2 exposure. A final point is that even less attention is dedicated to the ways in which monolinguals may differ from one another. Monolinguals can also exhibit diversity in language usage and exposure in the form of register switching or to the extent that they ambiently hear other languages that they do not speak. Language entropy may be fruitful in investigating diversity within monolingual populations as well. Thus, social diversity measures, such as entropy, that characterize language experience in more nuanced and realistic ways may be crucial for clarifying inconsistent evidence pertinent to such ongoing debates.
Author ORCIDs
Jason W. Gullifer, 0000-0003-2959-9072
Supplementary Material
For supplementary material accompanying this paper, visit https://doi.org/10.1017/S1366728919000026
Acknowledgements
This work was supported by the Natural Sciences and Engineering Research Council of Canada (individual Discovery Grant, 264146 to Titone); the Social Sciences and Humanities Research Council of Canada (Insight Development Grant, 01037 to Gullifer & Titone); the National Institutes of Health (Postdoctoral training grant, F32-HD082983 to Gullifer, Titone, and Klein); the Centre for Research on Brain, Language & Music.
The authors thank Ashley M. DaSilva, Denise Klein, Pauline Palma, Mehrgol Tiv, Janet Van Hell, Robert D. Vincent, Naomi Vingron, and Junyan Wei for fruitful discussions regarding (language) entropy.