Hostname: page-component-745bb68f8f-lrblm Total loading time: 0 Render date: 2025-02-11T10:16:17.403Z Has data issue: false hasContentIssue false

Majority language vocabulary and nonword repetition skills in children attending minority language immersion education

Published online by Cambridge University Press:  14 October 2022

Vasiliki Chondrogianni*
Affiliation:
School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, Scotland, United Kingdom
Florence Judge-Clayden
Affiliation:
School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, Scotland, United Kingdom
Morna Butcher
Affiliation:
NHS Ayrshire and Arran, Glasgow, Scotland, United Kingdom
*
*Corresponding author. Email: v.chondrogianni@ed.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

The present study examined nonword repetition (NWR) and comprehension/production of single-word vocabulary in the majority language (English) in six- to eight-year-old English-Gaelic emergent bilingual children attending Gaelic-medium primary education (GMPE) (primary years 2 and 3). GMPE is an immersion education model found in Scotland where the minority language, Gaelic, is the language of instruction, whereas English is the majority community language, not supported in school in the first three years of primary schooling. All children spoke English as their first language (L1) and had varying exposure to the second language (L2) Gaelic outside the school setting. We investigated how individual factors, such as age and frequency of exposure, and psycholinguistic factors, such as task modality, word class (for vocabulary), frequency, length, and prosody (for NWR), influenced performance. Later exposure to Gaelic and higher nonverbal IQ were contributing predictors of performance for English vocabulary. For NWR, participants with earlier and longer exposure to Gaelic who also used both languages at home more performed significantly better on this task. These results suggest that minority language immersion education differentially affects majority language abilities; later L2 onset benefited L1 vocabulary learning, whereas earlier L2 onset boosted L1 NWR skills in these bilinguals.

Type
Original Article
Copyright
© The Author(s), 2022. Published by Cambridge University Press

Although education in a majority second language (L2) is a necessity for children who speak a minority language (e.g., children from an immigrant background growing up in a country where the minority language is not the language of education or the wider community), for children speaking a majority language (e.g., English in the UK), attending educational programs in an L2 that is a minority language in the community is an educational choice. Generalizing across various immersion education programs, children in minority language immersion programs constitute a distinct type of bilingual children not least because teaching of the majority language within the school context is usually suspended, especially in the early years (usually the first three years of primary school), while minority language use is prioritized to promote its revitalization and maintenance (Paradis et al., Reference Paradis, Genesee and Crago2011).

A question that arises for majority language children attending minority language immersion education is how the majority language that is unsupported during the early primary school years develops during that time. A breadth of studies across different countries seems to suggest that majority language children who participate in immersion education acquire typical levels of native language development by the end of primary schooling (Bialystok, Reference Bialystok2018; Paradis et al., Reference Paradis, Genesee and Crago2011 for overviews). However (at least), two aspects of bilingual development remain particularly less well understood in an immersion education context. First, most studies focusing on the comparison between majority language students in immersion education and their monolingual peers in mainstream education have used standardized language assessments. This does not allow us to investigate the psycholinguistic and individual factors that may affect language development in majority language children in minority immersion education. This is particularly important for emergent bilingual children in the early years, when the majority language is not supported yet in the school setting (Montanari, Reference Montanari2014; Padilla et al., Reference Padilla, Fan, Xu and Silva2013; Paradis et al., Reference Paradis, Genesee and Crago2011). Second, vocabulary and nonword repetition (NWR), albeit related, have been shown to be differentially affected by bilingualism-related factors, such as the age (AoE) and length of exposure (LoE) to a particular language (Chondrogianni, Reference Chondrogianni, Miller, Bayram, Rothman and Serratrice2018; Golberg et al., Reference Golberg, Paradis and Crago2008; Paradis, Reference Paradis2011). Although both factors influence language outcomes in bilinguals, vocabulary has been shown to be more susceptible to exposure effects than NWR for sequential bilingual children, when tested in their weaker language (Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Oller et al., Reference Oller, Pearson and Cobo-Lewis2007; Paradis et al., Reference Paradis, Genesee and Crago2011). How exposure factors affect majority language outcomes in emergent bilingual children attending minority L2 immersion education remains controversial with mixed results (V. C. Gathercole & Thomas, Reference Gathercole and Thomas2009; Rhys & Thomas, Reference Rhys and Thomas2013).

Vocabulary and NWR are not only related (S. E. Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992; Masoura & Gathercole, Reference Masoura and Gathercole1999) but are also good predictors of academic ability (S. E. Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992). Vocabulary knowledge is pivotal for young children’s language development and an important predictor of later academic skills, in particular early literacy and reading (Duff et al., Reference Duff, Reen, Plunkett and Nation2015). NWR as an index of phonological short-term memory (STM) (S. E. Gathercole & Baddeley, Reference Gathercole and Baddeley1990) and also of sublexical representations, such as phonotactics and syllable structure (Jones et al., Reference Jones, Gobet and Pine2007), refers to the individual’s ability to temporarily retain phonemes in the working memory in the correct order while creating a motor plan to articulate auditory information. This ability is not only predictive of the individual’s ability to learn new words (S. E. Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992) but also the more words an individual knows, the better they are at producing nonwords by making use of lexical and sublexical information (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). For that reason, NWR correlates with various aspects of grammatical and vocabulary development, which in turn predict academic attainment (e.g., Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992).

In studies with monolingual children and, increasingly, with bilingual children, assessing vocabulary comprehension and production in the majority first (L1) or second language (L2) in mainstream education, psycholinguistic factors such as the class a word belongs to (noun or verb), or when it is estimated to be acquired, also known as the subjective estimation of the age of acquisition of a word (AoA), have been shown to influence language development (Haman et al., Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015; Hansen et al., Reference Hansen, Simonsen, Łuniewska and Haman2017; van Wonderen & Unsworth, Reference van Wonderen and Unsworth2020). Similarly, monolingual and bilingual children’s ability to correctly repeat nonwords is affected by how many phonological neighbors a nonword has, and hence, how likely it is to be phonologically similar to a real word, its prosody, and phonotactic probability (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015; Chiat & Polišenská, Reference Chiat and Polišenská2016; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). However, in existing studies the focus has been on the majority L1 (for monolingual children) or L2 (for sequential bilingual children) which are supported within the school context. Whether and how these factors interact in the same way and to the same degree in the majority L1 of bilingual children attending L2 minority immersion education remains open to investigation.

In Scotland, immersion education has been instantiated in the form of Gaelic-medium education (GME). GME is a type of immersion education that aims to revitalize Scottish Gaelic (Gaelic from this point onwards), one of the heritage languages of the country, by increasing its use among younger generations through Gaelic immersion education. In the present study, we investigated whether individual factors, such as the child’s current age, and exposure factors, such as the age when they started learning Gaelic and their degree of exposure to the two languages, affect the development of the majority L1 (English) vocabulary and NWR skills of pupils attending minority L2 (Gaelic) education in the early primary school years. We also investigated how psycholinguistic, item-level factors, such as the class (noun vs verb) a word belongs to and its estimated AoA (for vocabulary), the number of syllables, phonotactic structure, and prosody of a nonword affect performance on vocabulary and NWR, respectively. By focusing on the early years of immersion education, what is called “early total immersion” (Genesee, Reference Genesee2006), we were able to investigate how these language and academic-related skills develop in majority language pupils and how they are modulated by child- and item-specific factors in these emergent bilingual children (Paradis et al., Reference Paradis, Genesee and Crago2011).

Majority language abilities of children in immersion education

There is a broad consensus in the literature that the majority language skills of children in minority language immersion education are generally on a par with the language skills of children in mainstream majority language education by the time they reach later primary school years (Bialystok, Reference Bialystok2018; Paradis et al., Reference Paradis, Genesee and Crago2011 for reviews of relevant studies). This conclusion seems to hold regardless of cultural context or linguistic distance between the languages in immersion programs (Montanari, Reference Montanari2014; Padilla et al., Reference Padilla, Fan, Xu and Silva2013; Paradis et al., Reference Paradis, Genesee and Crago2011). This is attributed to three main reasons. First, the exposure that majority language students receive in the community dominant language seems to offset any effects of lack of educational support in immersion education, at least in the early years of immersion education (Paradis et al., Reference Paradis, Genesee and Crago2011). Second, the support the majority language receives in the later years of immersion education (e.g., after year 3 in primary education) compensates for any early lag (Genesee, Reference Genesee2006). Finally, academic and literacy skills developed in one language, even the minority L2, seem to transfer rather quickly and boost performance in the majority language (Cummins, Reference Cummins2000).

However, the bilingualism-related variables that may influence majority language outcomes in immersion education, such as age and length of exposure to the minority language through immersion education, are less well understood. The timing of L2 immersion has been shown to affect language outcomes in both the majority and the minority language. Studies with L1 English–L2 French-speaking children in the early years of French-Canadian immersion education have shown that those with an early L2 French immersion lag behind their monolingual English-speaking peers in mainstream majority language education on their L1 English performance on standardized assessments targeting literacy and mathematics compared to children with later L2 immersion, although performance improves with age (Genesee, Reference Genesee2006). Conversely, minority language immersion in the preschool and early primary school years leads to better performance in the minority language compared to late immersion, for example, at the beginning of secondary school (Paradis et al., Reference Paradis, Genesee and Crago2011), including immersion education for revitalization, such as in the case of Irish (Ó Muircheartaigh & Hickey, Reference Ó Muircheartaigh and Hickey2008).

In Scotland, GME is an immersion model with its main aim the revitalization of Gaelic, a minority language of the Celtic language family with an official status, since the Gaelic Language Act was introduced by the Scottish government in 2005, but under immense pressure from the majority, societal language, English. According to the most recent Census (2011), only 1.1% of the Scottish population can speak Gaelic (approx. 58,000 speakers out of the total population of 87,000 who are able to speak/read/understand Gaelic), but there was an increase between 2001 and 2011 in the proportion of young people who speak the language (National Records of Scotland, 2015, p. 9), mostly boosted by GME provision. GME is an immersion model found in Scotland that spans across preschool, primary, and secondary education and targets the acquisition of both Gaelic and English with the view to make children fully bilingual by the time they enter secondary education. Gaelic is prioritized in the first three years (P1-P3) of Gaelic-medium primary education (GMPE), and English is introduced slowly in lessons from P4 (O’Hanlon et al., Reference O’Hanlon, Paterson and McLeod2012). Pupils entering GMPE come from a variety of backgrounds. Most pupils come from families with no Gaelic at home and are immersed in Gaelic at school (Stephen et al., Reference Stephen, McPake, McLeod, Pollock and Carroll2010). A small proportion of parents (approximately 18%) are native speakers of Gaelic (O’Hanlon et al., Reference O’Hanlon, Paterson and McLeod2012). Nursery provision may or may not be attached to school(s) in areas that offer GMPE, and when it is, it may not be offered full-time. As a result, pupils enter primary schools with mixed prior experience of formal instruction in Gaelic. The combination of its minority status along with its use in primarily educational contexts places GME more on a par with other immersion programs where language revitalization through education is the main vehicle of this effort (as in some indigenous language immersion programs (Comajoan-Colomé & Coronel-Molina, Reference Comajoan-Colomé and Coronel-Molina2021)), rather than immersion programs where the minority language is a majority language elsewhere, and potentially spoken by millions of speakers (Baker, Reference Baker and Kaplan2010). An example of the latter would be the case of French immersion programs in Canada, with French being considered a minority in some Canadian provinces, but it is spoken by millions of people in France and globally.

Two studies to date have investigated the majority language (English) abilities of pupils in GMPE (Johnstone et al., Reference Johnstone, Harlen, MacNeil, Stradling and Thorpe1999; O’Hanlon et al., Reference O’Hanlon, Paterson and McLeod2013). Both studies investigated children in mid-primary school grades (P5-P7), that is children aged between 9 and 11 years old, using results from state-wide standardized assessments on science, mathematics, and English literacy abilities. Their findings converge that pupils in GMPE performed similarly to (mathematics or science), or in some cases outperformed (reading) their peers in English mainstream education. These results are in line with existing findings from the international literature on immersion vs. monolingual education provision, which suggest that children in later years in immersion education achieve comparable performance in their majority language (e.g., see Paradis et al., Reference Paradis, Genesee and Crago2011).

Given that both studies, however, focused on children in mid-primary education (children aged between 9 and 11 years old), we currently know little about how specific language areas of children in GMPE develop, and which individual factors modulate this development in the early years of primary school, that is, the first three years of schooling. To address this gap, in the present study, we tested children in what is called the first level of primary school in Scotland, namely P2 and P3 (aged between 6 and 8 years old), to examine how children’s majority language abilities develop in the early primary years, before the majority language is introduced into the curriculum, and once they have had at least one year of instruction in GMPE. We also evaluated how frequency and exposure to English and Gaelic outside the school context to Gaelic currently and before they started primary school affected performance on two specific linguistic domains that are predictors of linguistic and educational attainment, that is vocabulary knowledge and NWR. In the next section, we review how degree, length, and age of exposure to the two languages have been shown to influence the two language areas examined in the present study, namely vocabulary and NWR.

Linguistic and item-specific variables affecting vocabulary and NWR in bilingual children

Vocabulary knowledge encompasses the words the child can comprehend (receptive vocabulary) and produce (expressive vocabulary). Previous studies with children in majority language mainstream education have shown that both monolingual and L1 minority–L2 majority bilingual children performed better on receptive (comprehension) than expressive (production) tasks, but bilinguals had a larger gap between the two tasks compared to monolingual children (Gibson et al., Reference Gibson, Oller, Jarmulonowicz and Ethington2012; Kaushanskaya & Marian, Reference Kaushanskaya and Marian2009; Oller et al., Reference Oller, Pearson and Cobo-Lewis2007). Children’s better performance on receptive compared with expressive tasks has been attributed to the different psycholinguistic processes involved in comprehending versus producing a word (Levelt, Reference Levelt2001; Yan & Nicoladis, Reference Yan and Nicoladis2009). Although accessing semantic or phonological representations may be sufficient for success on receptive tasks, in an expressive task, participants are not only required to retrieve the correct lexical item but also articulate it, rendering production more demanding than comprehension (Levelt, Reference Levelt2001; Yan & Nicoladis, Reference Yan and Nicoladis2009). In the case of sequential bilingual children assessed in their L2, the receptive-expressive discrepancy has been attributed to the weak or underspecified semantic or phonological representations that may characterize bilingual children’s lexicon due to reduced language exposure, also known as the weaker links hypothesis (Gollan et al., Reference Gollan, Montoya, Cera and Sandoval2008). Reduced language exposure may impact on how strong the links are between lexical access and semantic or phonological representations. This, in turn, may impact on bilingual children’s ability to access words in the lexicon during comprehension and, subsequently, produce them (Gibson et al., Reference Gibson, Peña, Bedore and McCarter2020). When exposure to a language is reduced, as is possible in a bilingual context, the discrepancy between receptive and expressive abilities may also increase (Gibson et al., Reference Gibson, Peña, Bedore and McCarter2020).

Knowing a word also entails knowing the word class it belongs to, that is, whether a word is a noun or a verb. Nouns have been reported as earlier acquired compared to verbs (Haman et al., Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015; Łuniewska et al., Reference Łuniewska, Wodniecka, Miller, Smolík, Butcher, Chondrogianni, Hreich, Messarra, Razak, Treffers-Daller, Yap, Abboud, Talebi, Gureghian, Tuller and Haman2019) and children learn nouns more easily and have larger vocabulary sizes for nouns than verbs (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017; Haman, Wodniecka, et al., Reference Haman, Wodniecka, Marecka, Szewczyk, Białecka-Pikul, Otwinowska, Mieszkowska, Łuniewska, Kołak, Miękisz, Kacprzak, Banasik and Foryś-Nogala2017). In tasks usually administered to preschool or school-aged children, nouns depict concrete and imageable objects, which are perceptually easier to learn and identify than verbs, which require understanding events or actions and a relationship between participating entities (McDonough et al., Reference McDonough, Song, Hirsh-Pasek, Golinkoff and Lannon2011).

Vocabulary learning is also influenced by the age at which a word is acquired (AoA). Although AoA can be measured in different ways by looking for example at child corpora (Juhasz, Reference Juhasz2005), AoA can also be estimated using subjective, native speakers’ reports of when they thought they learnt a particular word (Haman et al., Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015). This measure, despite its subjectivity, has been found to consistently and reliably correlate with parental reports on young children’s lexical development across speakers and languages from a wide typological spectrum, regardless of their majority or minority status (Haman et al., Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015; Łuniewska et al., Reference Łuniewska, Wodniecka, Miller, Smolík, Butcher, Chondrogianni, Hreich, Messarra, Razak, Treffers-Daller, Yap, Abboud, Talebi, Gureghian, Tuller and Haman2019). This is because subjective AoA indirectly measures how long a child has been using a word and how entrenched the use of a word might be. The earlier a word is acquired the more experience the speaker has with using this word in various contexts and the easier it is to access it and entails that a child has had more time to practice with (comprehend and produce) a particular word. This may explain why AoA has been found to very significantly predict accuracy on nouns and verbs in expressive and receptive vocabulary tasks similar to the ones used in the present study with bilingual children, with the earlier the AoA, the higher the accuracy on the word (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017; van Wonderen & Unsworth, Reference van Wonderen and Unsworth2020). However, Haman et al. (Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017) found no significant correlations between subjective AoA and expressive vocabulary in the 5- to 6-year-old British English monolingual children in their study. AoA only correlated with expressive vocabulary skills with verbs but not with nouns. This may have been because the monolingual children in their study had ceiling accuracy on the comprehension tasks.

NWR tasks require the participant to store, manipulate, and repeat phonological information of items that are not real words and which participants have not previously heard. As such, they assess phonological STM on top of phonological processing and articulatory skills (S. E. Gathercole & Baddeley, Reference Gathercole and Baddeley1990). At the same time, NWR tasks tap into an individual’s word knowledge, as well as knowledge of the language-specific (LS), sublexical properties of words, such as word syllable structure, phonotactics, and prosody (stress) patterns (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Because repetition of nonwords requires children to learn a new, yet made-up, lexical item, factors that affect how children learn native words may affect performance on the NWR task. Children learn language on many levels, from statistical information about frequency of phonemes or combinations of phonemes, to whole words and syntax (Saffran & Wilson, Reference Saffran and Wilson2003; Sundara et al., Reference Sundara, Polka and Molnar2008). Specifically, sublexical representations containing phoneme combinations occurring more or less frequently in a language may support children’s NWR skills (S. E. Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992; Masoura & Gathercole, Reference Masoura and Gathercole1999). This level of phonological knowledge has been proposed by theories of phonological knowledge, such as the EPAM-VOC and CLASSIC, that assume that the presentation of nonwords activates sequences of phonemes in the long-term memory that are present in the nonword (e.g., see (Jones et al., Reference Jones, Gobet and Pine2007). This explains why young children are sensitive to a sublexical property, such as the probability with which syllables within a word co-occur, known as transitional probability, and why this helps to identify words from a stream of spoken language (Cutler & Carter, Reference Cutler and Carter1987; Thiessen & Saffran, Reference Thiessen and Saffran2003), including in bilingual contexts (Sundara et al., Reference Sundara, Polka and Molnar2008). Ngram frequency is another sublexical index similar to transitional probability but exceeds transitional probability in that it reflects how often phonemes, not just syllables, are found together in different-sized combinations in real words (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Because phonemic Ngrams measure phonotactic probability taking into consideration the frequency of phoneme sequences at many different lengths, they are much better at capturing subtle lexical and sublexical properties. For that reason, Ngram frequency has been shown to be a strong predictor for higher NWR accuracy beyond more qualitative ratings, such as word-likeness (Munson et al., Reference Munson, Kurtz and Windsor2005) and when consonant cluster complexity is reduced (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Consonant cluster complexity is another phonological factor that has been shown to determine children’s performance on NWR tasks, with nonwords containing more consonant clusters leading to lower accuracy than nonwords with fewer consonant clusters (Marshall & van der Lely, Reference Marshall and van der Lely2009; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). This is because consonant clusters are phonologically complex, and producing consonant clusters requires a range of complex and rapidly changing oromotor movement than Consonant-Vowel (CV) sequences (MacNeilage & Davis, Reference MacNeilage and Davis2000).

Prosody is a sublexical property that refers to language-specific stress patterns and has been shown to affect word learning and also performance on NWR tasks, with children preferring both words and nonwords with language-specific prosody (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015). For example, children acquiring English show a preference for words and nonwords with trochaic foot structure, in which a stressed (strong) syllable is followed by an unstressed (weak) syllable (SW), a prosodic pattern found in 90% of bisyllabic English words (Roy & Chiat, Reference Roy and Chiat2004). In a study targeting prosody, five- to eight-year-old monolingual children performed better on a task with typical prosody of the target language than on a task with evenly-stressed nonwords (Archibald et al., Reference Archibald, Gathercole and Joanisse2009; Archibald & Gathercole, Reference Archibald and Gathercole2007). Furthermore, children had higher whole-item accuracy and syllable number accuracy on words with typical prosody (first-syllable stress) compared to with atypical prosody (second-syllable stress) (Dollaghan et al., Reference Dollaghan, Biber and Campbell1993; Roy & Chiat, Reference Roy and Chiat2004).

Finally, in the view of NWR as an index of phonological STM (S. E. Gathercole & Baddeley, Reference Gathercole and Baddeley1990), word length, as measured by the number of syllables in a word, is the most commonly reported predictor variable in NWR tasks. This is because phonological STM load increases as the number of syllables increases (S. E. Gathercole & Baddeley, Reference Gathercole and Baddeley1990; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). In previous studies, as the number of syllables increased from two to five, NWR accuracy decreased in both monolingual and bilinguals (Boerma et al., Reference Boerma, Leseman, Wijnen and Blom2017; Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015; dos Santos & Ferré, Reference dos Santos and Ferré2018; Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992; Thordardottir & Brandeker, Reference Thordardottir and Brandeker2013).

In the present study, we examined how emergent bilingual children’s performance on the comprehension and production of single-word vocabulary is influenced by word class (nouns vs verbs) and subjective AoA, as estimated by native speakers of English. We also examined how performance on NWR tasks targeting quasi-universal (QU) and language-specific properties is influenced by sublexical properties, such as Ngram frequency, transitional probability, and prosody, by phonological STM, measured by the number of syllables and consonants in the nonword, as well as by phonological or articulatory complexity, measured by presence of consonant clusters in nonwords.

Relationship between vocabulary and NWR

Whether the sublexical processing and memory skills implicated during a NWR task are driving forces behind vocabulary growth or the result of a larger vocabulary is highly debated in the literature (Metsala, Reference Metsala1999; Verhagen et al., Reference Verhagen, Boom, Mulder, de Bree and Leseman2019), and the direction of their relationship seems to change as a function of age (Rispens & Baker, Reference Rispens and Baker2012). NWR can be a predictor of L1 lexical acquisition in the early years (Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992). As children get older, however, the strength of the relationship between NWR and vocabulary development seems to decline. The strong relationship between NWR and vocabulary development seems to fade away after the first school year and other cognitive skills, with long-term memory (LTM) playing a more important role in later L1 vocabulary development. This is because older monolingual children can rely on lexical and sublexical knowledge available in their LTM. This knowledge supports then STM and facilitates vocabulary learning thanks to access to this lexical and sublexical information (Archibald & Gathercole, Reference Archibald and Gathercole2006; Metsala, Reference Metsala1999; Verhagen et al., Reference Verhagen, Boom, Mulder, de Bree and Leseman2019). Although a NWR task as a measure of phonological STM abilities does not rely on lexical materials and minimizes the influence of long-term lexical representations, it engages other sublexical representations. With more language exposure, familiarity with more frequent combinations of phonemes increases, so larger chunks of phonemes are learnt. This means that learning new words or nonwords is easier because the working memory is maintaining fewer, longer phoneme chunks (Jones, Reference Jones2016). In previous research, differences seen in NWR task performance between phoneme chunks with high and low probability were best predicted by vocabulary size, showing that vocabulary size may reflect lexical and sublexical knowledge (Munson et al., Reference Munson, Kurtz and Windsor2005; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018).

This relationship also may explain why phonological skills both improve and are improved by vocabulary size in bilingual children (Masoura & Gathercole, Reference Masoura and Gathercole1999; Paradis, Reference Paradis2011). Larger receptive vocabulary in both monolingual and bilingual children significantly predicted their ability to learn real and nonwords in the target language (both the L1 and the L2) (Kan & Kohnert, Reference Kan and Kohnert2012; Roy & Chiat, Reference Roy and Chiat2004; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Earlier and longer exposure to the language tested may explain this connection, as they improve both vocabulary and NWR (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Gathercole et al., Reference Gathercole, Thomas, Roberts, Hughes, Hughes and Gathercole2013; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016). In a study relevant for our purposes, Nicolay and Poncelet (Reference Nicolay and Poncelet2013) reported that NWR is differentially related to minority L2 expressive and receptive vocabulary in the first three years of French immersion primary education in Canada. Namely, the link between NWR and L2 expressive vocabulary was stronger and lasted longer than the link between NWR and L2 receptive vocabulary. Finally, bilingual children with the same L1 vocabulary sizes as monolingual children were significantly better at learning nonwords and their meanings than monolingual children, suggesting that there may be a bilingual effect for nonword learning. This may point to bilingual children being more efficient when encoding phonologically unfamiliar information (Kaushanskaya & Marian, Reference Kaushanskaya and Marian2009).

Individual differences and development of vocabulary and NWR in bilingual children

Vocabulary development is a language domain that correlates highly with the development of other language and cognitive domains and academic achievement. At the same time, vocabulary knowledge, operationalized as receptive and expressive vocabulary size (that is, the number of words a speaker comprehends and produces) is highly susceptible to the amount and quality of input an individual receives in both monolingual (Weisleder & Fernald, Reference Weisleder and Fernald2013) and bilingual contexts (Chondrogianni, Reference Chondrogianni, Miller, Bayram, Rothman and Serratrice2018 for a review). Studies with bilingual school-age children attending majority L2 programs have shown that those with a later age of L2 exposure (AoE) have a steeper L2 vocabulary learning spurt compared to their bilingual peers with a lower L2 AoE (Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Golberg et al., Reference Golberg, Paradis and Crago2008). This has been attributed to two factors: (i) to children’s cognitive maturity when exposed to the L2 at a later age, which allows them to consolidate lexical information faster, and, (ii) to the positive, in this case, interdependence between languages that allows children to build on concepts already acquired in their first language (Golberg et al., Reference Golberg, Paradis and Crago2008) (but also see (Chondrogianni, Reference Chondrogianni, Miller, Bayram, Rothman and Serratrice2018; Unsworth, Reference Unsworth2013) for a discussion of studies reporting negative interdependence across different language domains in simultaneous and sequential bilingual children).

The language of schooling has been shown to interact with the home language to affect vocabulary outcomes in both the majority and the minority language. Studies have shown that although the language of schooling affects vocabulary size, the language used at home will have larger effects on vocabulary development, especially for minority languages, as it reinforces the contexts outside the school context where the minority language can be used, (Dijkstra et al., Reference Dijkstra, Kuiken, Jorna and Klinkerberg2016; V. C. Gathercole et al., Reference Gathercole, Thomas and Hughes2008; V. C. Gathercole & Thomas, Reference Gathercole and Thomas2009; Hoff et al., Reference Hoff, Rumiche, Burridge, Ribot and Welsh2014; O’Toole et al., Reference O’Toole, Ní Shíthigh, Molamphy and Walsh2019; Smithson et al., Reference Smithson, Paradis and Nicoladis2014). Additionally, speaking one of the two languages at home prior to attending immersion education may allow more time for the L1 vocabulary skills to be consolidated before the onset of L2 education. This is in line with studies reporting that children with later majority L2 immersion showed greater gains in the minority L1 (Schwartz et al., Reference Schwartz, Moin and Leikin2012).

Turning to studies investigating the majority language vocabulary skills in children attending minority language immersion education for revitalization, two studies with Welsh-English bilingual children from two to fifteen years of age showed that older children with higher exposure to English at home had higher vocabulary scores on a standardized assessment of single-word vocabulary comprehension for British English (V. C. Gathercole et al., Reference Gathercole, Thomas, Roberts, Hughes, Hughes and Gathercole2013; Rhys & Thomas, Reference Rhys and Thomas2013) compared to more Welsh-dominant peers in Welsh-immersion education, who reached age-appropriate English receptive vocabulary by adolescence. In the study Gathercole and Thomas (Reference Gathercole and Thomas2009) with Welsh-English bilingual children using a non-standardized shorter version of the Welsh and English standardized receptive vocabulary tasks adopted in the studies mentioned above, the authors reported that seven-year-old bilingual children from English-speaking homes attending Welsh-medium schools performed on a par with children with the same home language background attending Welsh-English bilingual schools. Note that at the age of seven years, the majority language English is not supported in Welsh-medium education. These studies from the Welsh context suggest that exposure to the majority language outside the school context (in the home in this case) suffices to offset any lack of instruction in the school setting, even when standardized assessments are used to assess the majority language.

Whether performance on NWR is susceptible to exposure factors is more controversial, and results depend on the design of the NWR tasks and what exactly they measure (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015). Generally, bilingual children perform better on NWR tasks compared to vocabulary or morphosyntactic tasks, which tap into LS knowledge (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015; Thordardottir & Brandeker, Reference Thordardottir and Brandeker2013). Bilingual children’s performance on NWR also improves with age (Chiat & Polišenská, Reference Chiat and Polišenská2016; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016), although longer LoE to the majority L2 is also an important predictor for L2 NWR performance, and in some studies, over and above the child’s current age (Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016). Numerous recent studies with bilingual children learning the majority L2 have shown that, when children are assessed on NWR tasks designed to not favor L2-specific sublexical properties, such as LS syllable structure and phonotactics, or to avoid language-specific consonant clusters, such as the case of QU NWR task developed by (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015; Chiat & Polišenská, Reference Chiat and Polišenská2016), bilingual children perform better on these tasks than on the LS ones (Boerma et al., Reference Boerma, Chiat, Leseman, Timmermeister, Wijnen and Blom2015) and may not differ from their monolingual peers, especially after controlling for L2 receptive vocabulary (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Chiat & Polišenská, Reference Chiat and Polišenská2016; Engel de Abreu et al., Reference Engel de Abreu, Cruz-Santos, Tourinho, Martin and Bialystok2012; Meir & Armon-Lotem, Reference Meir and Armon-Lotem2017). In two other studies using NWR tasks similar to the present study, current age was not a significant predictor for the QU and the LS NWR task performance in the bilingual and monolingual children (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018); conversely, earlier AoE to the majority L2 (English) in the case of bilingual children and receptive vocabulary in the case of monolingual children were significant individual variables in each study respectively, although the effect of age may have been masked by the stronger vocabulary differences between the groups (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018).

Finally, individual differences on vocabulary learning and NWR can be modulated by nonverbal intelligence, as a proxy for analytic reasoning. Nonverbal IQ has shown to significantly and positively correlate with receptive vocabulary in both the L1 and L2 (Daller & Ongun, Reference Daller and Ongun2017; Paradis, Reference Paradis2011), but has weak or no correlations with NWR tasks (Conti-Ramsden et al., Reference Conti-Ramsden, Botting and Faragher2001; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). The discrepancy between two measures may be attributed to the close link between analytic reasoning and vocabulary skills reported for both monolingual and bilingual contexts (Paradis, Reference Paradis2011).

It is important to note that in previous studies investigating the development of NWR in bilingual children of primary school age, these bilingual children were tested in the majority L2 which is supported at school, for example, in L2 English, and the child’s L1 was different from the majority L2. These studies differ from our study where English is the children’s L1 but it is not supported in the school context. In the present study, and because we are focusing on English as the majority language of the home and the wider community, we expect exposure variables (LoE and AoE) to play a lesser role for the children in our sample, especially in the case of the QU NWR task, whereas age at the time of testing will remain a contributing factor, as it also affects monolingual children’s performance on NWR tasks. We return to more detailed predictions in the relevant section below.

Present study

In the present study, we examined the individual, linguistic, and item-related factors that affect the comprehension and production of English vocabulary and NWR in English-Gaelic bilingual children in GMPE. In GMPE, the first three years (P1-P3) of primary school (age five to eight years) are taught in L2 Gaelic, after which point L1 English is introduced into the curriculum (O’Hanlon et al., Reference O’Hanlon, Paterson and McLeod2012). The two year-groups that we targeted in the current study attended primary years 2 and 3 at the time of testing, what is considered to be part of the first level in Scottish National Curriculum for Excellence (Scotland, 2021). Currently, how these factors affect majority language acquisition in GMPE remain to date unexplored. Specifically, we asked the following research questions:

  1. 1. How do item-level factors such as word class and AoA, and child-level factors, such as current age, length (LoE), and frequency of exposure (FoE) to English and Gaelic currently and at an early age, as well as nonverbal IQ modulate English-Gaelic bilingual children lexical production and comprehension?

  2. 2. How do item-level (word length, Ngram frequency, prosody, and phonological complexity) and child-level factors affect English-Gaelic children’s ability to repeat nonwords that have language-specific and QU properties?

  3. 3. What is the relationship between English vocabulary and NWR in English-Gaelic bilingual children attending the first level of GMPE?

Predictions

Starting with the item-level factors for vocabulary, we expected children to have better receptive than expressive skills and to perform better on nouns and words with lower AoA compared to verbs (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017; Haman et al., Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017). Better performance on comprehension than on production was expected because lexical retrieval and articulation implicated during production are more effortful than accessing lexical or phonological representations, implicated during comprehension (Gibson et al., Reference Gibson, Peña and Bedore2014; Levelt, Reference Levelt2001). We also expected children to perform better on nouns than on verbs not only because (concrete) nouns are generally acquired earlier and are easier to depict than events (McDonough et al., Reference McDonough, Song, Hirsh-Pasek, Golinkoff and Lannon2011), but also because nouns tend to have less marked morphosyntactic and distributional properties (Haman et al., Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015; Haman, Łuniewska, et al., Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017), which may impact on acquisition. Finally, we expected words with lower AoA to elicit higher performance as children would have had more time to practice with earlier than with later acquired words (Łuniewska et al., Reference Łuniewska, Haman, Armon-Lotem, Etenkowski, Southwood, Anđelković, Blom, Boerma, Chiat, de Abreu, Gagarina, Gavarró, Håkansson, Hickey, de López, Marinis, Popović, Thordardottir, Blažienė and Ünal-Logacev2016, Reference Łuniewska, Wodniecka, Miller, Smolík, Butcher, Chondrogianni, Hreich, Messarra, Razak, Treffers-Daller, Yap, Abboud, Talebi, Gureghian, Tuller and Haman2019).

Turning to the NWR task, we expected that nonword length as a measure of phonological STM will influence performance across the different NWR subtasks. Specifically, we expected words with fewer syllables to have higher accuracy than words with more syllables if increase of word length negatively impacts on phonological STM (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015; S. E. Gathercole & Baddeley, Reference Gathercole and Baddeley1990). In terms of the sublexical properties of nonwords, we anticipated that participants would perform better on the QU with prosody (PS) set than the QU set without prosody given the facilitating role of prosody in (non)word learning (Archibald & Gathercole, Reference Archibald and Gathercole2007; Archibald et al., Reference Archibald, Gathercole and Joanisse2009; Roy & Chiat, Reference Roy and Chiat2004). We also expected performance on the QU set to be better than that on the LS set due to the overall less marked and articulatory easier nonword structure in the QU set (CVCV) and the absence of consonant clusters compared to the LS set (Boerma et al., Reference Boerma, Chiat, Leseman, Timmermeister, Wijnen and Blom2015). In the LS set, we expected Ngram frequency, as a subtle measure of sublexical representations, syllable number, as a measure of phonological STM, and receptive vocabulary scores, as a measure of the child’s English lexical knowledge, to be significant predictors. Specifically, we predicted that children would perform better on items that have a higher Ngram frequency, fewer consonants (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018), fewer syllables (Boerma et al., Reference Boerma, Chiat, Leseman, Timmermeister, Wijnen and Blom2015), and typical prosody (Roy & Chiat, Reference Roy and Chiat2004). We also expected lexical and phonological skills to be positively associated, as the ability to repeat nonwords in younger children is related to their ability to learn new words (Masoura & Gathercole, Reference Masoura and Gathercole1999). In addition, learning new words (i.e., increase in vocabulary size) should provide more phonotactic/sublexical knowledge and make NWR easier (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018).

Turning to the child-level factors affecting performance on vocabulary, we predicted that older children with higher IQ (Daller & Ongun, Reference Daller and Ongun2017), earlier, longer, and more frequent exposure to the majority L1 English and later and less frequent exposure to the minority L2 Gaelic (Armon-Lotem & Ohana, Reference Armon-Lotem and Ohana2017) would perform better on the English expressive and receptive vocabulary tasks (Gathercole et al., Reference Gathercole, Thomas, Roberts, Hughes, Hughes and Gathercole2013). This is because the more exposure bilingual children have to consolidate the home language vocabulary (English in this case), the better they perform on related tasks (V. C. Gathercole et al., Reference Gathercole, Thomas, Roberts, Hughes, Hughes and Gathercole2013; Golberg et al., Reference Golberg, Paradis and Crago2008). However, we expected that the strong language exposure effects generally reported for other bilingual children, especially for bilingual children of a heritage background attending majority schools (Chondrogianni, Reference Chondrogianni, Miller, Bayram, Rothman and Serratrice2018), may not be as pronounced in the present study. Specifically, reduced language exposure may lead to weaker links between lexical access and phonological representations. These weaker links have been reported for bilingual children in previous studies when tested in their minority L1 or majority L2 (Gibson et al., Reference Gibson, Peña, Bedore and McCarter2020). However, these effects may not be observed to the same degree in the present study, where the emergent children were coming from predominantly majority L1 English-speaking homes and were tested in their majority home language that they were regularly using outside the school context.

For the NWR task, we expected that older children would perform better than younger children (Chiat & Polišenská, Reference Chiat and Polišenská2016), when they have higher receptive vocabulary scores (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018) and when they have higher exposure to the language tested (majority language in this case) (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016), as children’s STM skills and sublexical knowledge improves with age and language exposure. If bilingualism enhances STM skills, we anticipated children who have more bilingual exposure to perform better on the NWR task (Morales et al., Reference Morales, Calvo and Bialystok2013). We also expected that performance on the NWR task may also be driven by overlap in sublexical properties between Gaelic and English and specifically, prosody. Namely, given that Gaelic words also display a trochaic stress pattern (Lamb, Reference Lamb2001; Nance & Maolalaigh, Reference Nance and Maolalaigh2019), we expected this overlap in language-specific sublexical properties between the two languages to reinforce performance on the NWR task conditions that tap into prosody. We also expected language exposure to differentially affect correct repetitions across the three subtasks of the NWR task. This is because the QU subtests were constructed to minimize similarity with any specific language and have a CV structure that is present in many different languages. As a result, they should be the least sensitive to language exposure. In contrast, we expected the language-specific subtasks to be more susceptible to language exposure,Footnote 1 and that higher exposure in English (L1) would lead to better performance in both LS NWR task and the vocabulary task (Paradis, Reference Paradis2011; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018).

Participants

Fifty-five children (31 female) aged between six and eight years old were recruited from three Gaelic-Medium primary schools in three different locations in Scotland, UK, that is Edinburgh, Glasgow, and the Isle of Skye. Parents provided information about children’s language use and history using the Questionnaire for Parents of Bilingual Children (PABIQ) (Tuller, Reference Tuller, Armon-Lotem, de Jong and Meir2015). All children were exposed to the majority L1 English from birth and had varied exposure to the minority L2 Gaelic outside the school context. Twenty-four children were exposed to Gaelic before their third birthday, and all children had large variation in their frequency of exposure to Gaelic (FoE-GA) at an early age (early FoE-GA) and before attending primary school, which may or may not have coincided with nursery attendance. In contrast, children’s exposure to English (FoE-EN) at home was relatively high, both at an early age and during primary school. As English was the dominant language for all children, we used the Clinical Evaluation of Language Fundamentals (CELF-5) (Semel & Wiig, Reference Semel and Wiig2017) to identify and exclude children with potential Developmental Language Disorder. Five children had a score of one standard deviation or greater below the age-appropriate norms and, hence, were excluded from the final sample. Two other children were excluded because they did not complete all experimental tasks. No other children were excluded on account of reported hearing deficits or serious ear infections. This resulted in a final sample of 48 children. Descriptive data for the background variables can be found in Table 1.

Table 1. Participant background information

Note. AoE = Age of exposure; LoE = Length of Exposure; FoE = frequency of exposure; EN = English; GA = Gaelic; CELF = Clinical Evaluation of Language Fundamentals; Raven’s CPM = Raven’s Color Progressive Matrices.

Materials

Background assessments

Language and nonverbal abilities

To examine children’s general language abilities and to screen for language disorders, we administered the core component of the Clinical Evaluation for Language Fundamentals 5 (CELF-5 (Semel & Wiig, Reference Semel and Wiig2017). The core component of the CELF comprises the Sentence Structure, Word structure, Recalling Sentences, and Formulating Sentences. Children’s raw scores were transformed into standard scores with a 90% CI. Children’s nonverbal IQ was assessed using the Raven’s Colored Progressive matrices (Raven, Reference Raven2003), which is suitable for children aged between five to eleven years. Children complete three sets of twelve puzzles, which required them to choose one of six options. The maximum raw score was 36. Raw scores were used for the statistical analysis in the present study to also account for the role of age.

Background questionnaire

To evaluate children’s exposure to English and Gaelic, we administered the Questionnaire for Parents of Bilingual Children (PABIQ) (Tuller, Reference Tuller, Armon-Lotem, de Jong and Meir2015). Parents reported at what age children were exposed to English and Gaelic and estimated how frequently (“Always,” “Usually,” ‘Sometimes,” “Rarely” or “Never”) English and Gaelic were spoken in the home before primary school attendance (early FoE-EN/GA) and currently (current FoE), and these corresponded to percentages (100%, 75%, 50%, 25%, 0%). In terms of early FoE-EN, only five reported that they usually (75%) spoke English at home, whereas the rest reported that they always spoke English at home. For early FoE-GA, the majority of parents reported that they sometimes (50%) spoke Gaelic in the home or usually (75%) speak Gaelic at home, whereas 15 reported that they rarely (25%, N = 8) or never (N = 7) spoke Gaelic at home. No parents/guardians in our sample were native speakers of Gaelic; they had mostly acquired Gaelic as an L2, if at all. Children had large variation in their AoE to Gaelic, as well as how frequently they were exposed to Gaelic before formal schooling started and currently. Out of the forty-eight children in the sample, seven children had no early or current exposure of Gaelic in the home or early in the nursery, and four children had exposure of Gaelic in the home but did not attend a bilingual English-Gaelic nurseries. All remaining children had some exposure to Gaelic in the home, albeit to variable degree, and attended bilingual English-Gaelic nurseries. Children’s current exposure to Gaelic and English (current FoE-EN/GA) was less variable, as most families spoke English at home, whereas input in Gaelic was provided in the school context (Table 1). We estimated children’s socioeconomic status (SES) based on the years of maternal education. Most mothers had a university degree or university and further training, and only four mothers had only secondary education or vocational training, suggesting that the children in the present sample belonged to mid-to-high SES (Table 1).

Experimental tasks

Expressive and receptive vocabulary tasks

To assess children production and comprehension of nouns and verbs, we used the English version of the Litmus Cross-Linguistic Lexical Task (CLT) (Haman et al., Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015). The CLT consists of four subparts: Nouns receptive/Verbs receptive, Nouns expressive/Verbs expressive. In the two CLT-receptive subtasks (CLT-R), children saw four pictures and pointed to the picture which corresponded to the word they heard. In the two CLT-expressive subtasks (CLT-E), children saw one picture and were asked to name it. Each set contained 32 items controlled for imageability and AoA (mean 3.98; SD 1.08; 1.99 – 7.56), as reported in Haman et al. (Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015).

Although this task was originally administered to younger monolingual British English-speaking children aged between 5;2 and 6;9 years (Haman, Łuniewska, et al., Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017), we decided to use the task with the older children in our study for the following reasons.Footnote 2 First, the adoption of the English CLT was part of a larger project investigating the language abilities of English-Gaelic-speaking children in GMPE. As part of this project, we developed a Scottish Gaelic version of the CLT that had the same design criteria as the English CLT. The robustness and comparability of the research design between the English and the Gaelic tasks allow us to investigate how similar factors affect vocabulary development in the two languages of the bilingual individual. The results of this study are currently being published elsewhere. Second, the aim of the present study was not to establish norms on vocabulary comprehension and production or to examine the validity of this task as a measure of proficiency in this group of children (see adaptations of the CLT by van Wonderen and Unsworth (Reference van Wonderen and Unsworth2020) with 4- to 9-year-old monolingual Spanish and Dutch-speaking children with that aim); rather, the aim of the study was to investigate what child- and item-level variables affect emergent bilingual children’s performance on vocabulary and NWR, and how these variables may differentially modulate performance on these two related skills.

NWR task

Children’s phonological STM and sublexical skills were assessed using the British English version of the Crosslinguistic Nonword Repetition Task (CL-NWRT) (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015; Chiat et al., Reference Chiat, Polišenská, Yanushevskaya and Antonijevic2020; Chiat & Polišenská, Reference Chiat and Polišenská2016). In this task, children saw a broken necklace on the computer screen and were asked to assist with making three new necklaces, bead by bead, by repeating “magic” words. For each pre-recorded “magic” word that children heard and repeated back, a bead appeared on the screen, regardless of the accuracy of the repetition. Children only heard each nonword once. The CL-NWRT comprises three sets: the QU, the QU with Prosody-Specific (PS), and the Language-Specific (LS) set. Each component had different items with item-specific variables such as number of syllables and prosody being manipulated (see Table 2 for more detail). In the LS task, Ngram frequency and transitional probability were also manipulated following Chiat (Reference Chiat, Armon-Lotem, de Jong and Meir2015).

Table 2. Overview of the Crosslinguistic nonword repetition task (CL-NWRT) (British English Version) (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015)

Both the Litmus CLT and the CL-NWRT were developed during the COST Action IS0804 “Language impairment in a multilingual society: Linguistic patters and roadmap to assessment” (2008–2013, www.bi-sli.org).

Procedure

Children were tested individually in a quiet room in their primary school over three sessions on separate days by a Speech and Language Therapist and two research assistants. The background and experimental tasks were presented in a randomized order. In all experimental tasks, the words were pre-recorded by a British English female speaker. All children’s responses to the tasks were recorded on a digital voice recorder and later transcribed. This study received ethical permission from the Research Ethics Committee in the School of Philosophy, Psychology, and Language Sciences at the University of Edinburgh, UK (21-1920/1).

Coding and scoring

All responses to the CLT-E were coded as correct following the CLT instruction manual (Haman et al., Reference Haman, Łuniewska, Pomiechowska, Armon-Lotem, de Jong and Meir2015) (see supplementary material for examples). The CL-NWR task responses were transcribed into the International Phonetic Alphabet and scored by an independent rater. A second independent rater reviewed 10% of the transcriptions, chosen randomly. The interrater reliability whole-item rate was 95.5%. In the present study, we focused on whole-item scoring (Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015). If the child’s response had all and only the same phonemes in the correct order as the target nonword, it was marked as correct, for example, dula – /’du,lɑ/. Any additions (/’du,lɑm/), omissions (/’u,lɑ/), and substitutions (/’du,mɑ/) were marked as incorrect (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018).

Data analysis

Data were analyzed using R-Studio (version 3.6.0) (R Core Team, 2021) and the R packages “lme4” (Bates et al., Reference Bates, Mächler, Bolker and Walker2015), “Hmisc” [version 4.4-0 (Harrell & Dupont, Reference Harrell and Dupont2020)] and “MuMIn,” [version 1.43.6]. For each research question, a generalized logistic mixed model was constructed. Because of the large number of predictors, testing the maximal model with all fixed effects and random effects was not possible. Therefore, we used an overfitting anti-conservative model which contained all variables as fixed effects and only by-participant and by-item random intercepts (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). We decreased the number of variables using a backwards stepwise selection procedure. At each step, we compared alternate models with each variable removed in turn and chose the model with the lowest Akaike information criterion (AIC), which reflected a better goodness of fit while accounting for the complexity of the model. When the AIC value no longer decreased, the most important predictors were found, and this was the final model for the task. Interactions or non-linear contributions were found by building a further set of models and comparing to these final models through likelihood ratio comparisons. Marginal and conditional R2 values of each final model were calculated using the function “r.squaredGLMM” from the package “MuMIn.” The marginal R2 indicated how much variance was explained by the fixed effects while the conditional R2 indicated the variance explained by both fixed and random effects.

Results

Initial analysis: collinearity checks

Prior to constructing the various models, we examined whether there was collinearity among the various predictor variables, which could cancel out potential predictor outcomes. For the item- and participant-related predictors (Table 3), we looked at the Spearman correlation coefficients as not all variables had normal distribution (Tables 4 and 5).Footnote 3 There was high collinearity between certain participant-related predictors, for example, between AoE-Gaelic, LoE-Gaelic, and early FoE-Gaelic. There was also a high correlation between certain item-related predictors, such as between Ngram Frequency and Number of Syllables. As each variable measured different observations and could uniquely contribute to the CLT and NWR task performance, we decorrelated any variables correlated with a rho-value ≥ 0.7. Where two variables (A and B) were highly correlated to each other and only to each other, variable A was converted into residuals of a linear model (A∼B) and variable B was left intact. If variable B was taken out of the model, the residuals of variable A were then replaced by the original variable A. Where three variables (variable A, B, and C) were highly correlated to each other, two models were made, one with A and B and one with A and C. One of each pair was converted into residuals as above; then, final models were compared using AIC.

Table 3. Participant- and item-related variables for the Crosslinguistic Lexical Task (CLT) and the Crosslinguistic Nonword Repetition task (CL-NWRT)

Table 4. Spearman Correlation Coefficients for participant-related predictors in NWRT and CLT

Note. AoE = Age of exposure; LoE = Length of exposure; FoE = Frequency of exposure; PS = Prosody specific; QU = Quasi-universal; LS = Language specific; RS = raw scores; CLT-R/E = Crosslinguistic Lexical task Receptive/Expressive.

Table 5. Spearman Correlation Coefficients for item-related predictors in the Crosslinguistic Nonword Repetition task (CL-NWRT)

Crosslinguistic lexical task

On average, children had 91% accuracy across the receptive and expressive CLT tasks (83-98%; SD 4%). Items were correctly answered by 91% of children (17-100%; SD 17%). This shows that most children answered most items correctly, which indicates a ceiling effect, especially when observing that children reached the maximum probability of answering correctly (1.00) and that there was a smaller variance in the receptive tasks (see Table 6 and Figure 1).

Table 6. Mean (in %) accuracy in the Crosslinguistic Lexical Task (CLT)

Figure 1. Mean Proportion Accuracy on the CLT-Receptive/Expressive and Overall.

To understand the relative and unique contribution of the item- and child-related variables to the CLT-R and CLT-E separately and to avoid reporting here multiple three-way interactions between categorical (Task Type) and continuous background variables (e.g., AoE) that would need to be further unpacked, we constructed different models for each modality after checking for initial interactions. This decision was further reinforced by an original analysis that revealed an interaction between Word class and Task Type and a significantly higher performance on the CLT-R compared to CLT-E (t(47) = 14.512, p < 0.001).

The best models for predicting performance on the CLT-R and CLT-E tasks are presented in Table 7. Across all models, the item-related variables, such as Word Class and AoA, as well as the participant variance (conditional R 2), explained more of the variance than the child-related variables, such as AoE, which in both models contributed to their explanatory variance but was not a significant predictor. Additionally, the model for CLT-E had greater explanatory adequacy than CLT-R due to children’s ceiling accuracy on CLT-R.

Table 7. Best fitting models for the Crosslinguistic Lexical Task (CLT)

Note. AoA = Age of acquisition word; AoE-GA = Age of exposure to Gaelic; RS = raw score.

NWR task

On average, children’s overall accuracy across the NWRT was 63% (32-88%; SD 14%). Items were correctly answered by 63% of children (19-95%; SD 21% SD) showing that there were no floor or ceiling effects (see Table 8 and Figure 2).

Table 8. Descriptive statistics for the Crosslinguistic Nonword Repetition task (CL-NWRT)

Figure 2. Mean Proportion Accuracy on the CL-NWRT (QU = Quasi-universal, PS = prosody-specific, LS = language-specific, and overall).

To understand the relative and unique contribution of variables in children’s performance on the NWR tasks, we compared performance across the three different NWR sets. Children scored significantly higher on the QU set than the PS or LS set (p < 0.001). Because performance differed so greatly between the sets, and the LS set had different item-related variables to QU and PS sets, three models were built, one each for the QU, PS, and LS sets. Furthermore, to answer research question 3, namely whether receptive and expressive vocabulary scores contributed to the NWR performance, we entered the vocabulary scores (percentage accuracy per participant) as fixed effects in the models.

Table 9 presents the best fitting models for predicting performance on the NWR QU task, PS task, and the LS task, respectively. Across all models, the item-related variables, such as number of syllables (QU and PS sets) and Ngram frequency (LS sets), as well as the inclusion of the participant variance (conditional R2), explained more of the variance than child-related variables (LoE-GA and AoE-GA).

Table 9. Best fitting models for the quasi-universal, the prosody-specific, and the language-specific sets of the Crosslinguistic NWR task

Note. LoE-GA = Length of exposure to Gaelic; AoE-GA = Age of exposure to Gaelic; early FoE-EN = early frequency of exposure to English; CLT-E = Crosslinguistic Lexical Task – Expressive.

Discussion

In the present study, we examined the child- and item-level factors that affected the majority language receptive and expressive vocabulary and NWR skills of English-Gaelic six- to eight-year-old bilingual children attending Gaelic-medium immersion education. Specifically, we examined how emergent bilingual children’s performance on the comprehension and production of single-word vocabulary was influenced by word class (nouns vs verbs) and subjective AoA, as estimated by native speakers of English. We also examined how performance on NWR tasks targeting QU and language-specific properties is modulated by sublexical properties, such as Ngram frequency, transitional probability, and prosody, by phonological STM, measured by the number of syllables and consonants in the nonword, as well as by phonological or articulatory complexity, measured by presence of consonant clusters in nonwords. We examined how these item-level factors interact with the child’s age and degree and age of exposure to the two languages.

Lexical skills in bilingual children in immersion education

For vocabulary, we expected the children in our study to perform overall better on the receptive task compared to the expressive task given the different cognitive demands between production and comprehension and further phonetic and articulatory processes imposed during production (Gibson et al., Reference Gibson, Jarmulowicz and Oller2018; Levelt, Reference Levelt2001; Yan & Nicoladis, Reference Yan and Nicoladis2009). Although access of lexical and phonological representations may suffice during comprehension, this is not enough for successful performance during production (Gibson et al., Reference Gibson, Peña, Bedore and McCarter2020), giving, thus, rise to a discrepancy in performance between the two tasks. We also expected children to perform better on nouns than on verbs given the well-documented differences between the two word classes in their developmental trajectory (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017; Haman, Wodniecka, et al., Reference Haman, Wodniecka, Marecka, Szewczyk, Białecka-Pikul, Otwinowska, Mieszkowska, Łuniewska, Kołak, Miękisz, Kacprzak, Banasik and Foryś-Nogala2017; Sheng & McGregor, Reference Sheng and McGregor2010), because concrete nouns are inherently morphosyntactically simpler, and due to the way objects and actions are depicted in picture-based tasks. The single-picture format of the task may favor nouns, because they label one item rather than a relational concept, as in the case of verbs, which is what makes nouns easier for children to learn when younger (McDonough et al., Reference McDonough, Song, Hirsh-Pasek, Golinkoff and Lannon2011). In addition, the depiction of events in the case of verbs may leave more room for naming errors, and this may cause the larger gap between noun and verb performance in the expressive tasks (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017).

The results from the L1 English–L2 Gaelic bilingual children in our study were mixed. Although nouns overall elicited higher accuracy than verbs, children exhibited a discrepancy between comprehension and production only for verbs but not for nouns. The difference between receptive and expressive vocabulary skills only for verbs but not for nouns in this group of emergent bilingual children tested in their majority L1 differentiates them from other groups of bilingual children tested in their minority L1 or the majority L2. In these studies, receptive-expressive differences are reported for both word classes, albeit to a different degree (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017; Gibson et al., Reference Gibson, Oller, Jarmulonowicz and Ethington2012; Haman, Łuniewska, et al., Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017; Hansen et al., Reference Hansen, Simonsen, Łuniewska and Haman2017; Oller et al., Reference Oller, Pearson and Cobo-Lewis2007; van Wonderen & Unsworth, Reference van Wonderen and Unsworth2020). The lack of discrepancy between the two modalities for nouns but not for verbs may be related to the older age of the children in the present study compared to previous studies, which led to overall higher accuracy compared with previous studies. However, the fact that the emergent bilingual children in our study continued to have lower expressive skills for verbs reinforces the argument that inherent morphosyntactic properties of verbs, their different acquisitional trajectories from nouns, and methodological considerations related to how verbs are depicted or the responses that they elicit (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017; Haman, Łuniewska, et al., Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017; McDonough et al., Reference McDonough, Song, Hirsh-Pasek, Golinkoff and Lannon2011) lead to reduce performance.

We also expected subjective AoA to influence children’s comprehension and production accuracy (Łuniewska et al., Reference Łuniewska, Haman, Armon-Lotem, Etenkowski, Southwood, Anđelković, Blom, Boerma, Chiat, de Abreu, Gagarina, Gavarró, Håkansson, Hickey, de López, Marinis, Popović, Thordardottir, Blažienė and Ünal-Logacev2016, Reference Łuniewska, Wodniecka, Miller, Smolík, Butcher, Chondrogianni, Hreich, Messarra, Razak, Treffers-Daller, Yap, Abboud, Talebi, Gureghian, Tuller and Haman2019). Subjective AoA indirectly measures how long a child has been using a word and how entrenched the use of a word might be. This is because the earlier a word is acquired the more experience the speaker has with using this word in various contexts and the easier it us to access it. This may also explain why subjective AoA strongly correlates with parental reports on young children’s lexical development (Łuniewska et al., Reference Łuniewska, Haman, Armon-Lotem, Etenkowski, Southwood, Anđelković, Blom, Boerma, Chiat, de Abreu, Gagarina, Gavarró, Håkansson, Hickey, de López, Marinis, Popović, Thordardottir, Blažienė and Ünal-Logacev2016). In the present study, subjective AoA significantly predicted performance on both the expressive and the receptive tasks. This is in line with previous studies with bilingual children (Altman et al., Reference Altman, Goldstein and Armon-Lotem2017; van Wonderen & Unsworth, Reference van Wonderen and Unsworth2020) but with what was reported in Haman et al. (Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017) study with monolingual British English children, where AoA only correlated with verb production, even though the children in that study were younger than the ones in our study. This may be due to the different profiles of the children in the two studies: emergent bilingual children in our study versus entirely monolingual in the Haman et al. (Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017) study. Additionally, in our study, AoA was a better predictor than the child’s current age, as it explained age effects while accounting for differences in difficulty across items. Interestingly, in previous studies which found current age a significant predictor, subjective AoA was not used as predictor (Armon-Lotem & Ohana, Reference Armon-Lotem and Ohana2017; Gathercole et al., Reference Gathercole, Thomas, Kennedy, Prys, Young, Viñas Guasch, Roberts, Hughes and Jones2014). Since both current age and AoA are potential indicators of how much experience a child has had with a word, including them both as predictors in a single model may lead to AoA winning over current age. Future studies would benefit from trying to disentangle the distinct contribution of these factors to word learning.

Turning to the child-level predictors, AoE to Gaelic, the minority L2, was a contributing, yet not significant, predictor in both receptive and expressive tasks. This may indicate that with later L2 exposure, the majority L1 English was spoken as the only language at home over a longer time, and hence, children had more time to consolidate their L1 vocabulary skills before being immersed in the minority L2, as reported in previous studies on bilingual children (Chondrogianni, Reference Chondrogianni, Miller, Bayram, Rothman and Serratrice2018; Golberg et al., Reference Golberg, Paradis and Crago2008). Frequency of exposure in the home language, which was also the majority language of the community, namely English, did not predict performance on majority language vocabulary. The relationship between home language use and vocabulary development is widely reported in studies where English is the majority L2, and there is more variation in its use at home because another minority language is spoken in this context (Armon-Lotem & Ohana, Reference Armon-Lotem and Ohana2017; Gathercole et al., Reference Gathercole, Thomas, Kennedy, Prys, Young, Viñas Guasch, Roberts, Hughes and Jones2014). We were thus expecting exposure to modulate performance here as well, albeit to a lesser extent given the majority language status of the children in our sample. In the present study, however, and in contrast to previous studies, the use of English at home was quite high with little variation. Because the children in the present study displayed high performance on English vocabulary tasks and exposure to English was high as well, there was little variation in the data that is usually needed to give rise to correlations between two variables.

Given that in the present study, language exposure measures contributed to but did not strongly predict children’s accuracy on the expressive and receptive vocabulary tasks, the relationship between bilingual exposure and how it affects the vocabulary skills of this emergent bilingual group is harder to gauge. It seems that for the emergent bilingual children in our study the link between lexical access and phonological representations in the majority language remains strong, especially for nouns, and despite the lack of formal instruction of the majority language in the school setting. This finding differentiates them from other studies with sequential bilingual children tested in their minority L1 or majority L2 where weak links between lexical access and phonological representations have been reported, giving rise to receptive-expressive differences (Gibson et al., Reference Gibson, Peña, Bedore and McCarter2020; Gollan et al., Reference Gollan, Montoya, Cera and Sandoval2008). At the same time, the six- to- eight-year-old bilingual children in our study exhibited similar performance to the monolingual children in the Haman et al. (Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017) study, especially in the production task where their performance was not at ceiling (approx. 76% on verbs). This is despite the fact that the children in our study were two years older than the ones in the Haman et al. (Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017) study. Future studies could investigate whether and under which conditions the link between lexical access and phonological representations becomes less strong in the majority language, if at all, for bilingual children in immersion education, and whether certain word classes are affected more than others.

Finally, nonverbal IQ was a significant predictor only of expressive vocabulary skills, with children with higher Raven’s scores having larger expressive vocabularies. The relationship between nonverbal IQ and vocabulary development is well documented in monolingual studies (Lervåg et al., Reference Lervåg, Dolean, Tincas and Melby-Lervåg2019) and has been confirmed in previous studies in bilingual children (Daller & Ongun, Reference Daller and Ongun2017). In this study, the relationship between nonverbal IQ and expressive vocabulary may be due to the higher variance in performance in the latter compared to the receptive task. Namely, and as displayed in Figure 1, the receptive vocabulary component elicited almost ceiling performance with little variation, whereas the expressive vocabulary component of the task gave rise to more variation. Given that there was also more variation in the nonverbal IQ task, as displayed in Table 1, the expressive part of the CLT and the nonverbal IQ task correlated more than the receptive CLT component and the nonverbal IQ task in the present study.

Overall, in this study, item-related variables significantly modified the performance of English-Gaelic emergent bilingual children, as the coefficients in the models showed, whereas participant-related variables were important for improving the fit of the model but were not significant themselves. This may be due to the fact that children’s vocabulary skills in the majority language were already high and, with the rather homogeneous degree of exposure to the majority language in the home, the current variation in the present sample gave rise to exposure as a contributing but not a significantly modulating factor.

NWR skills of bilingual children in immersion education

In the NWR task, we predicted that children’s accuracy will be affected by the number of syllables in a nonword, a measure of phonological STM (S. E. Gathercole & Baddeley, Reference Gathercole and Baddeley1990). These predictions were borne out for the QU and PS sets: as the number of syllables increased, the phonological STM load increased, leading to lower accuracy on lengthier nonwords (Boerma et al., Reference Boerma, Chiat, Leseman, Timmermeister, Wijnen and Blom2015; dos Santos & Ferré, Reference dos Santos and Ferré2018; S. E. Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018; Thordardottir & Brandeker, Reference Thordardottir and Brandeker2013). However, this was not the case for the LS set, where number of syllables was not a predictor. This may be related to the interplay between NWR as a measure of phonological STM and the sublexical properties that we examined in the present NWR task, which we discuss in more detail below.

Looking at the sublexical properties more closely, we found that simple CVCV structure and lack of consonant clusters in the QU set led better performance compared with the PS and the LS sets (Boerma et al., Reference Boerma, Chiat, Leseman, Timmermeister, Wijnen and Blom2015; Chiat, Reference Chiat, Armon-Lotem, de Jong and Meir2015). However, our results did not confirm our prediction that children would be facilitated by prosody and, hence, perform better on the PS than the QU set. With very early and high exposure to the majority L1 English at home and in the community, we expected that children in this age range would perform better on nonwords with typical prosody (PS) than nonwords with even stress (QU) given the facilitatory role of these sublexical properties and as it is also reported in previous studies with monolingual (Archibald et al., Reference Archibald, Gathercole and Joanisse2009; Archibald & Gathercole, Reference Archibald and Gathercole2007; Roy & Chiat, Reference Roy and Chiat2004) and bilingual children (Boerma et al., Reference Boerma, Chiat, Leseman, Timmermeister, Wijnen and Blom2015). The significantly better performance on the QU set than the PS set in the present study could be because the administration of the three sets was fixed and children always completed the QU task after the PS task. Because the two tasks contain the same nonwords, repetition possibly improved words in the second set (QU) despite the lack of prosody.

Typical prosody was also not a predictor in the final model for the LS set. In previous studies, children performed better on nonwords with typical prosody compared to atypical prosody (Dollaghan et al., Reference Dollaghan, Biber and Campbell1993; Roy & Chiat, Reference Roy and Chiat2004). This different result might be because Ngram frequency could have explained more variance in this study, as it varied by item, providing a subtler measure of sublexical properties, whereas prosody varied only across sets.

Finally, we did not see consonant clusters modulate performance on the LS set, contrary to what was reported in previous studies with monolingual children (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). This could be because the consonant clusters in the present English LS task were not as challenging as in other tasks reported in previous studies (Gathercole et al., Reference Gathercole, Frankish, Pickering and Peaker1999; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Furthermore, the medium-to-high correlation between number of syllables, consonants, and Ngram frequency in the current study suggests that Ngram frequency may explain variance that number of syllables or consonants may also explain (Gathercole et al., Reference Gathercole, Frankish, Pickering and Peaker1999). This may be because as we learn larger chunks of phonemes that occur more often in the language (Jones, Reference Jones2016), working memory is able to more easily maintain fewer, longer and familiar phoneme chunks found in higher Ngram nonwords, than many, short and less familiar phoneme chunks (from nonwords with low Ngram frequency) (Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Such an explanation would be compatible with phonological models, such as the EPAM-VOC and CLASSIC, that assume that the presentation of nonwords activates sequences of phonemes in the long-term memory that are present in the nonword (e.g., see (Jones et al., Reference Jones, Gobet and Pine2007).

Turning to the child-level predictors, we predicted that increased age, receptive vocabulary scores, and exposure to the majority language would predict better LS set performance but may not affect performance on the QU and PS sets, as these two tap into more universal phonological properties of(non)word structure (Chiat & Polisenksa 2016; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Looking at each individual variable in turn, current age was not a predictor in any final model of the NWR task. This was in line with our prediction, as we were not expecting any child-level variables to be significant for the QU set. In the PS and LS sets, we anticipated current age to improve performance. However, it seems that, in this study, other factors such as AoE to L2 Gaelic and receptive vocabulary scores were more significant child-related predictors, similarly to previous studies (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018). Interestingly, LoE to the minority L2 Gaelic and AoE to L2 Gaelic were significant predictors of children’s performance on the NWR task, albeit in surprising ways. That is, earlier and longer exposure to minority L2 Gaelic rather than to the majority English significantly predicted better performance on the PS and LS sets. Although one may assume that more exposure to the majority language and less exposure to minority L2 would lead to better performance in the PS and LS sets, as they contained prosody and/or phonetics typical of the majority L1, we should note that Gaelic is also a stress-timed language with a trochaic pattern similar to English (Morrison, Reference Morrison2019) and with more multisyllabic words than English (Lamb, Reference Lamb2002). The same children tested in this study were also administered the Gaelic version of the CLT, where we established that they know a range of Gaelic words (including multisyllabic ones) in Gaelic (approx. 76% for comprehension and 40% for production). This ability to comprehend and produce multisyllabic words in another language may have reinforced their understanding of the prosodic patterns in the two languages, as well as their performance on multisyllabic words in the NWR task. This may have resulted in improved performance on these prosody-specific tasks with increased and earlier exposure to this particular minority language (Kaushanskaya & Marian, Reference Kaushanskaya and Marian2009). Additional evidence for this also comes from how exposure affected performance on the QU set. Children’s performance on the QU set was also influenced by exposure variables in that children with earlier exposure to the minority L2 Gaelic and less current exposure to the majority L1 English performed better on the QU set. Although we did not expect exposure to affect this component of the CL-NWRT (Antonijevic-Elliott et al., Reference Antonijevic-Elliott, Lyons, O’ Malley, Meir, Haman, Banasik, Carroll, McMenamin, Rodden and Fitzmaurice2020; Chiat & Polišenská, Reference Chiat and Polišenská2016; Engel de Abreu et al., Reference Engel de Abreu, Cruz-Santos, Tourinho, Martin and Bialystok2012; Meir & Armon-Lotem, Reference Meir and Armon-Lotem2017; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016), it seems that, in our study, earlier bilingual exposure led to better performance on a non-language-specific task.

Finally, we found that expressive but not receptive vocabulary skills contributed to children’s performance on the LS set of the NWR task, which carries language-specific phonotactics. The lack of contribution of the receptive task to children’s performance on the NWR task reported in other studies (Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016) may be due to children’s ceiling performance on this component, leading to little variation, and, hence, lack of a significant effect. Given the greater variation observed in the expressive vocabulary task, this measure contributed more to children’s performance on the NWR task. It may also be because both tasks are expressive and hence involve similar processes of lexical or sublexical access and phonological representations, as well as articulation and oromotor movement, whereas the latter two are not implicated in comprehension (S. E. Gathercole et al., Reference Gathercole, Willis, Emslie and Baddeley1992; Levelt, Reference Levelt2001). Our findings are in line with previous studies that show larger vocabulary sizes led to better NWR skills, especially for children of this age (Kan & Kohnert, Reference Kan and Kohnert2012; Roy & Chiat, Reference Roy and Chiat2004; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2016; Szewczyk et al., Reference Szewczyk, Marecka, Chiat and Wodniecka2018), although NWR merely contributed to the model rather than strongly predicted lexical skills in our study (Verhagen et al., Reference Verhagen, Boom, Mulder, de Bree and Leseman2019).

Conclusions

In this study, we investigated how psycholinguistic, item-level and child-level variables affected performance of emergent bilingual English-Gaelic children attending Gaelic-medium education on vocabulary tasks targeting production and comprehension of single-word vocabulary and on NWR tasks targeting QU and language-specific properties. We found that children’s performance on both receptive and expressive vocabulary tasks was significantly better on nouns compared to verbs and on words acquired at a younger age. Children’s performance was significantly better on nonwords with fewer syllables and with higher Ngram frequency. Later exposure to Gaelic and higher nonverbal IQ contributed to better performance on the vocabulary task. Earlier exposure to Gaelic, less English spoken at home and higher expressive vocabulary size predicted better performance in the NWR task.

Taken together, these results reinforce the assertion that the extended exposure that majority language pupils have to the majority language (English) in an English-dominant community limits potential modulating effects of exposure to a minority language through immersion education. In the present study, this is evidenced in the context of a not previously examined minority language, Gaelic, undergoing revitalization through immersion education. What is more, increased exposure to the minority language at home may lead to benefits in more general phonological skills related to word knowledge and word learning, such as NWR, especially when sublexical properties of the two languages overlap.

Acknowledgments

This study was funded by Bòrd na Gàidhlig (Ref. N: R45026). We would like to thank Euan Dickson for assisting with the data collection. We would also like to thank the schools, children, and their guardians who participated in this study.

Declarations of interest

None.

Appendix

Examples of types of errors in the CLT marked correct and incorrect (Examples are given in the format (target ➜ participant’s response))

Reponses marked as correct

  • Mispronunciation (“paintbrush” ➜ “paintbush”)

  • Unexpected or incorrect inflection (“mouse” ➜ “mice” or “mices”)

  • Derivations within or across word class (“frog” ➜ “froggy”), (“lighting” ➜ “a lighter”)

  • Innovation involving target ( “bed” ➜ “bedroom”)

  • Synonyms (“hugging” ➜ “cuddling”).

Responses marked as incorrect

  • Definitions (“peeling” ➜ “cutting the skin off”)

  • Hypernyms (“boot” ➜ “shoe”)

  • Hyponyms (“bear” ➜ “panda”)

  • Wrong word class without using target root (“needle” ➜ “threading”)

  • Onomatopoeia (“bathing” ➜ “splash splash”)

  • Gesture

  • Semantic confusion (“apple” ➜ “pear”)

  • Associative confusion (“dog” ➜ “bone”)

  • Perceptual confusion (“button” ➜ “plate”)

  • Phonological confusion (“mouse” ➜ “mouth”).

Footnotes

1 We would like to thank one of the anonymous reviewers for pointing out this prediction to us.

2 One reviewer raises the question about the appropriateness of the task with the age group in the present study. Although we acknowledge the limitations of using this task with this group (see Discussion), the emergent bilingual children in our study are different from the children in the Haman et al. (Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017) study in various ways. First, the children in our study were older than the ones in the Haman et al. (Reference Haman, Łuniewska, Hansen, Simonsen, Chiat, Bjekić, Blažienė, Chyl, Dabašinskienė, Engel de Abreu, Gagarina, Gavarró, Håkansson, Harel, Holm, Kapalková, Kunnari, Levorato, Lindgren and Armon-Lotem2017) study. Despite that, they did not have higher performance than the younger monolingual children described in (Haman, Łuniewska, et al., Reference Haman, Wodniecka, Marecka, Szewczyk, Białecka-Pikul, Otwinowska, Mieszkowska, Łuniewska, Kołak, Miękisz, Kacprzak, Banasik and Foryś-Nogala2017) study. Second, the children in our study are not monolingual; they are emergent bilingual children. And third, they are attending Gaelic-medium immersion education and not English mainstream education. Finally, the children in our study exhibited variability on subparts of the task that allowed us to address our research questions.

3 Age of Exposure to English, Length of Exposure to English, and Level of Maternal Education as a proxy for socioeconomic background (SES) were included in the initial analysis. However, these factors did not correlate or predict any of the outcomes, and, hence, they were not included in the results reported here. Specific to SES, and given that previous studies have reported effects of SES on the tasks examined in the present study (e.g., (Łuniewska et al., Reference Łuniewska, Haman, Armon-Lotem, Etenkowski, Southwood, Anđelković, Blom, Boerma, Chiat, de Abreu, Gagarina, Gavarró, Håkansson, Hickey, de López, Marinis, Popović, Thordardottir, Blažienė and Ünal-Logacev2016; Meir & Armon-Lotem, Reference Meir and Armon-Lotem2017; Roy & Chiat, Reference Roy and Chiat2004), we believe that the lack of SES effect in the present study is due to the rather homogeneously high SES of the children in our sample and the lack of variation (Table 1; mean years of maternal education = 17.17; SD = 2.63). Even the four children with lower SES performed within the range of the entire group and hence were included in the analysis.

References

Altman, C., Goldstein, T., & Armon-Lotem, S. (2017). Quantitative and qualitative differences in the lexical knowledge of monolingual and bilingual children on the LITMUS-CLT task. Clinical Linguistics & Phonetics, 31(11–12), 931954. https://doi.org/10.1080/02699206.2017.1312533 CrossRefGoogle ScholarPubMed
Antonijevic-Elliott, S., Lyons, R., O’ Malley, M. P., Meir, N., Haman, E., Banasik, N., Carroll, C., McMenamin, R., Rodden, M., & Fitzmaurice, Y. (2020). Language assessment of monolingual and multilingual children using non-word and sentence repetition tasks. Clinical Linguistics & Phonetics, 34(4), 293311. https://doi.org/10.1080/02699206.2019.1637458 CrossRefGoogle ScholarPubMed
Archibald, L. M. D., & Gathercole, S. E. (2006). Short-term and working memory in specific language impairment. International Journal of Language & Communication Disorders, 41(6), 675693. https://doi.org/10.1080/13682820500442602 CrossRefGoogle ScholarPubMed
Archibald, L. M. D., & Gathercole, S. E. (2007). Nonword repetition in specific language impairment: More than a phonological short-term memory deficit. Psychonomic Bulletin & Review, 14(5), 919924.CrossRefGoogle ScholarPubMed
Archibald, L. M. D., Gathercole, S. E., & Joanisse, M. F. (2009). Multisyllabic nonwords: More than a string of syllables. The Journal of the Acoustical Society of America, 125(3), 17121722.CrossRefGoogle ScholarPubMed
Armon-Lotem, S., & Ohana, O. (2017). A CDI study of bilingual English-Hebrew children – Frequency of exposure as a major source of variation. International Journal of Bilingual Education and Bilingualism, 20(2), 201217. https://doi.org/10.1080/13670050.2016.1179257 CrossRefGoogle Scholar
Baker, C. (2010). Bilingual education. In Kaplan, R. B. (Ed.), The Oxford handbook of applied linguistics. Oxford University Press. https://doi.org/10.1093/oxfordhb/9780195384253.013.0020 Google Scholar
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 1(1). https://doi.org/10.18637/jss.v067.i01 Google Scholar
Bialystok, E. (2018). Bilingual education for young children: Review of the effects and consequences. International Journal of Bilingual Education and Bilingualism, 21(6), 666679. https://doi.org/10.1080/13670050.2016.1203859 CrossRefGoogle ScholarPubMed
Boerma, T., Chiat, S., Leseman, P., Timmermeister, M., Wijnen, F., & Blom, E. (2015). A Quasi-Universal Nonword repetition task as a diagnostic tool for bilingual children learning Dutch as a second language. Journal of Speech, Language, and Hearing Research, 58(6), 17471760. https://doi.org/10.1044/2015_JSLHR-L-15-0058 CrossRefGoogle ScholarPubMed
Boerma, T., Leseman, P., Wijnen, F., & Blom, E. (2017). Language proficiency and sustained attention in monolingual and bilingual children with and without language impairment. Frontiers in Psychology, 8, 1241.CrossRefGoogle ScholarPubMed
Chiat, S. (2015). Non-word repetition. In Armon-Lotem, S., de Jong, J., & Meir, N. (Eds.), Assessing multilingual children (pp. 125150). Multilingual Matters. https://doi.org/doi:10.21832/9781783093137-008 CrossRefGoogle Scholar
Chiat, S., & Polišenská, K. (2016). A framework for Crosslinguistic Nonword repetition tests: Effects of Bilingualism and socioeconomic status on children’s performance. Journal of Speech, Language, and Hearing Research, 59(5), 11791189. https://doi.org/10.1044/2016_JSLHR-L-15-0293 CrossRefGoogle ScholarPubMed
Chiat, S., Polišenská, K., Yanushevskaya, I., & Antonijevic, S. (2020). Language neutral Crosslinguistic Nonword Repetition Test (CL-NWR). https://www.bi-sli.org/cl-nonword-repetiti Google Scholar
Chondrogianni, V. (2018). Child L2 acquisition. In Miller, D., Bayram, F., Rothman, J., & Serratrice, L. (Eds.), Studies in bilingualism (Vol. 54, pp. 103126). John Benjamins Pub Co. https://doi.org/10.1075/sibil.54.06cho Google Scholar
Chondrogianni, V., & Marinis, T. (2011). Differential effects of internal and external factors on the development of vocabulary, tense morphology and morpho-syntax in successive bilingual children. Linguistic Approaches to Bilingualism, 1(3), 318345. https://doi.org/10.1075/lab.1.3.05cho CrossRefGoogle Scholar
Comajoan-Colomé, L., & Coronel-Molina, S. M. (2021). What does language revitalisation in the twenty-first century look like? New trends and frameworks. Journal of Multilingual and Multicultural Development, 42(10), 897904. https://doi.org/10.1080/01434632.2020.1827643 CrossRefGoogle Scholar
Conti-Ramsden, G., Botting, N., & Faragher, B. (2001). Psycholinguistic markers for specific language impairment (SLI). Journal of Child Psychology and Psychiatry, 42(6), 741.CrossRefGoogle ScholarPubMed
Cummins, J. (2000). Language, power, and pedagogy: Bilingual children in the crossfire. Multilingual Matters.CrossRefGoogle Scholar
Cutler, A., & Carter, D. M. (1987). The predominance of strong initial syllables in the English vocabulary. Computer Speech & Language, 2(3), 133142. https://doi.org/https://doi.org/10.1016/0885-2308(87)90004-0 CrossRefGoogle Scholar
Daller, M., & Ongun, Z. (2017). The threshold hypothesis revisited: Bilingual lexical knowledge and non-verbal IQ development. International Journal of Bilingualism, 22(6), 675694. https://doi.org/10.1177/1367006917690835 CrossRefGoogle Scholar
Dijkstra, J., Kuiken, F., Jorna, R. J., & Klinkerberg, E. L. (2016). The role of majority and minority language input in the early development of a bilingual vocabulary. Bilingualism: Language and Cognition, 19(1), 191205. https://doi.org/DOI:10.1017/S1366728915000012 CrossRefGoogle Scholar
Dollaghan, C., Biber, M., & Campbell, T. (1993). Constituent syllable effects in a nonsense-word repetition task. In Journal of Speech & Hearing Research (Vol. 36, Issue 5, pp. 1051–1054). American Speech-Language-Hearing Assn. https://doi.org/10.1044/jshr.3605.1051 Google Scholar
dos Santos, C., & Ferré, S. (2018). A Nonword repetition task to assess bilingual children’s phonology. Language Acquisition, 25(1), 5871. https://doi.org/10.1080/10489223.2016.1243692 CrossRefGoogle Scholar
Duff, F. J., Reen, G., Plunkett, K., & Nation, K. (2015). Do infant vocabulary skills predict school-age language and literacy outcomes? Journal of Child Psychology and Psychiatry, 56(8), 848856. https://doi.org/https://doi.org/10.1111/jcpp.12378 CrossRefGoogle ScholarPubMed
Engel de Abreu, P. M. J., Cruz-Santos, A., Tourinho, C. J., Martin, R., & Bialystok, E. (2012). Bilingualism enriches the poor: Enhanced cognitive control in low-income minority children. Psychological Science, 23(11), 13641371. https://doi.org/10.1177/0956797612443836 CrossRefGoogle ScholarPubMed
Gathercole, S. E., & Baddeley, A. D. (1990). Phonological memory deficits in language disordered children: Is there a causal connection? Journal of Memory and Language, 29(3), 336360. https://doi.org/https://doi.org/10.1016/0749-596X(90)90004-J CrossRefGoogle Scholar
Gathercole, S. E., Frankish, C. R., Pickering, S. J., & Peaker, S. (1999). Phonotactic in- fluences on short-term memory. Journal of Experimental Psychology: Learning, Memory, and Cognition, 25(1), 8495. http://dx.doi.org/10.1037/0278-7393.25.1.84 Google Scholar
Gathercole, S. E., Willis, C. S., Emslie, H., & Baddeley, A. D. (1992). Phonological memory and vocabulary development during the early school years: A longitudinal study. Developmental Psychology, 28(5), 887898. https://doi.org/10.1037/0012-1649.28.5.887 CrossRefGoogle Scholar
Gathercole, V. C., & Thomas, E. (2009). Bilingual first-language development: Dominant language takeover, threatened minority language take-up. Bilingualism: Language and Cognition, 12(2), 213237. https://doi.org/10.1017/S1366728909004015 CrossRefGoogle Scholar
Gathercole, V. C., Thomas, E., & Hughes, E. (2008). Designing a normed receptive vocabulary test for bilingual populations: A model from Welsh. International Journal of Bilingual Education and Bilingualism, 11(6), 678720. https://doi.org/10.1080/13670050802149283 CrossRefGoogle Scholar
Gathercole, V. C., Thomas, E. M., Kennedy, I., Prys, C., Young, N., Viñas Guasch, N., Roberts, E. J., Hughes, E. K., & Jones, L. (2014). Does language dominance affect cognitive performance in bilinguals? Lifespan evidence from preschoolers through older adults on card sorting, Simon, and metalinguistic tasks. Frontiers in Psychology, 5, 11. https://doi.org/10.3389/fpsyg.2014.00011 CrossRefGoogle ScholarPubMed
Gathercole, V. C., Thomas, E. M., Roberts, E., Hughes, C., & Hughes, E. K. (2013). Why assessment needs to take exposure into account: Vocabulary and grammatical abilities in bilingual children. In Gathercole, V. C. M. (Ed.), Issues in the assessment of bilinguals (pp. 2055). Multilingual Matters.CrossRefGoogle Scholar
Genesee, F. (2006). What do we know about bilingual education for majority-language students? The Handbook of Bilingualism, pp. 547576. https://doi.org/https://doi.org/10.1002/9780470756997.ch21 CrossRefGoogle Scholar
Gibson, T. A., Jarmulowicz, L., & Oller, D. K. (2018). Difficulties using standardized tests to identify the receptive expressive gap in bilingual children’s vocabularies. Bilingualism: Language and Cognition, 21(2), 328339. https://doi.org/DOI:10.1017/S1366728917000074 CrossRefGoogle ScholarPubMed
Gibson, T. A., Oller, D. K., Jarmulonowicz, L., & Ethington, C. A. (2012). The receptive–expressive gap in the vocabulary of young second-language learners: Robustness and possible mechanisms. Bilingualism: Language and Cognition, 15(01), 102116. https://doi.org/10.1017/S1366728910000490 CrossRefGoogle ScholarPubMed
Gibson, T. A., Peña, E. D., & Bedore, L. M. (2014). The relation between language experience and receptive-expressive semantic gaps in bilingual children. International Journal of Bilingual Education and Bilingualism, 17(1), 90110. https://doi.org/10.1080/13670050.2012.743960 CrossRefGoogle ScholarPubMed
Gibson, T. A., Peña, E. D., Bedore, L. M., & McCarter, K. S. (2020). A longitudinal investigation of the semantic receptive-expressive gap in Spanish-English bilingual children. International Journal of Bilingual Education and Bilingualism, 115. https://doi.org/10.1080/13670050.2020.1721427 Google ScholarPubMed
Golberg, H., Paradis, J., & Crago, M. (2008). Lexical acquisition over time in minority first language children learning English as a second language. Applied Psycholinguistics, 29(1), 4165.CrossRefGoogle Scholar
Gollan, T. H., Montoya, R. I., Cera, C., & Sandoval, T. C. (2008). More use almost always means a smaller frequency effect: Aging, bilingualism, and the weaker links hypothesis. Journal of Memory and Language, 58(3), 787814. https://doi.org/https://doi.org/10.1016/j.jml.2007.07.001 CrossRefGoogle Scholar
Haman, E., Łuniewska, M., Hansen, P., Simonsen, H. G., Chiat, S., Bjekić, J., Blažienė, A., Chyl, K., Dabašinskienė, I., Engel de Abreu, P., Gagarina, N., Gavarró, A., Håkansson, G., Harel, E., Holm, E., Kapalková, S., Kunnari, S., Levorato, C., Lindgren, J., … Armon-Lotem, S. (2017). Noun and verb knowledge in monolingual preschool children across 17 languages: Data from Cross-linguistic Lexical Tasks (LITMUS-CLT). Clinical Linguistics & Phonetics, 31(11–12), 818843. https://doi.org/10.1080/02699206.2017.1308553 CrossRefGoogle ScholarPubMed
Haman, E., Łuniewska, M., & Pomiechowska, B. (2015). Designing cross-linguistic lexical tasks (CLTs) for bilingual preschool children (Armon-Lotem, S., de Jong, J., & Meir, N. eds., pp. 196240). Multilingual Matters.Google Scholar
Haman, E., Wodniecka, Z., Marecka, M., Szewczyk, J., Białecka-Pikul, M., Otwinowska, A., Mieszkowska, K., Łuniewska, M., Kołak, J., Miękisz, A., Kacprzak, A., Banasik, N., & Foryś-Nogala, M. (2017). How does L1 and L2 exposure impact L1 performance in bilingual children? Evidence from Polish-English migrants to the United Kingdom. Frontiers in Psychology, 8, 1444.CrossRefGoogle ScholarPubMed
Hansen, P., Simonsen, H. G., Łuniewska, M., & Haman, E. (2017). Validating the psycholinguistic aspects of LITMUS-CLT: Evidence from Polish and Norwegian. Clinical Linguistics & Phonetics, 31(11–12), 910930. https://doi.org/10.1080/02699206.2017.1307455 CrossRefGoogle ScholarPubMed
Harrell, F. Jr., & Dupont, Ch. (2020). Hmisc: Harrell Miscellaneous. R Package Version 4.2-0. https://CRAN.R-project.org/package=Hmisc Google Scholar
Hoff, E., Rumiche, R., Burridge, A., Ribot, K. M., & Welsh, S. N. (2014). Expressive vocabulary development in children from bilingual and monolingual homes: A longitudinal study from two to four years. Early Childhood Research Quarterly, 29(4), 433444. https://doi.org/10.1016/j.ecresq.2014.04.012 CrossRefGoogle Scholar
Johnstone, R., Harlen, W., MacNeil, M., Stradling, R., & Thorpe, G. (1999). The attainments of pupils receiving Gaelic medium education in Scotland. University of Stirling, Scottish CILT for the Scottish Executive Education Department.Google Scholar
Jones, G. (2016). The influence of children’s exposure to language from two to six years: The case of nonword repetition. Cognition, 153, 7988. https://doi.org/https://doi.org/10.1016/j.cognition.2016.04.017 CrossRefGoogle ScholarPubMed
Jones, G., Gobet, F., & Pine, J. M. (2007). Linking working memory and long-term memory: A computational model of the learning of new words. Developmental Science, 10(6), 853873. https://doi.org/10.1111/j.1467-7687.2007.00638.x CrossRefGoogle ScholarPubMed
Juhasz, B. J. (2005). Age-of-acquisition effects in word and picture identification. Psychological Bulletin, 131(5), 684712. https://doi.org/10.1037/0033-2909.131.5.684 CrossRefGoogle ScholarPubMed
Kan, P. F., & Kohnert, K. (2012). A growth curve analysis of novel word learning by sequential bilingual preschool children. Bilingualism: Language and Cognition, 15(3), 452469. https://doi.org/DOI:10.1017/S1366728911000356 CrossRefGoogle Scholar
Kaushanskaya, M., & Marian, V. (2009). The bilingual advantage in novel word learning. Psychonomic Bulletin & Review, 16(4), 705710. https://doi.org/10.3758/PBR.16.4.705 CrossRefGoogle ScholarPubMed
Lamb, W. (2001). Scottish Gaelic. Lincom Europa.Google Scholar
Lamb, W. (2002). Scottish Gaelic (2nd ed.). Lincom Europa.Google Scholar
Lervåg, A., Dolean, D., Tincas, I., & Melby-Lervåg, M. (2019). Socioeconomic background, nonverbal IQ and school absence affects the development of vocabulary and reading comprehension in children living in severe poverty. Developmental Science, 22(5), e12858. https://doi.org/https://doi.org/10.1111/desc.12858 CrossRefGoogle ScholarPubMed
Levelt, W. J. M. (2001). Relations between speech production and speech perception: Some behavioral and neurological observations. In E. Dupoux (Ed.), Language, brain and cognitive development: Essays in honour of Jacques Mehler (pp. 241256). MIT Press.Google Scholar
Łuniewska, M., Haman, E., Armon-Lotem, S., Etenkowski, B., Southwood, F., Anđelković, D., Blom, E., Boerma, T., Chiat, S., de Abreu, P. E., Gagarina, N., Gavarró, A., Håkansson, G., Hickey, T., de López, K. J., Marinis, T., Popović, M., Thordardottir, E., Blažienė, A., … Ünal-Logacev, Ö. (2016). Ratings of age of acquisition of 299 words across 25 languages: Is there a cross-linguistic order of words? Behavior Research Methods, 48(3), 11541177. https://doi.org/10.3758/s13428-015-0636-6 CrossRefGoogle Scholar
Łuniewska, M., Wodniecka, Z., Miller, C. A., Smolík, F., Butcher, M., Chondrogianni, V., Hreich, E. K., Messarra, C. A., Razak, R., Treffers-Daller, J., Yap, N. T., Abboud, L., Talebi, A., Gureghian, M., Tuller, L., & Haman, E. (2019). Age of acquisition of 299 words in seven languages: American English, Czech, Gaelic, Lebanese Arabic, Malay, Persian and Western Armenian. PLOS ONE, 14(8), e0220611. https://doi.org/10.1371/journal.pone.0220611 CrossRefGoogle ScholarPubMed
MacNeilage, P. F., & Davis, B. L. (2000). On the origin of internal structure of word forms. Science, 288(5465), 527531. https://doi.org/10.1126/science.288.5465.527 CrossRefGoogle ScholarPubMed
Marshall, C. R., & van der Lely, H. K. J. (2009). Effects of word position and stress on onset cluster production: Evidence from typical development, specific language impairment, and dyslexia. Language, 85(1), 3957. https://doi.org/http://doi.org/10.1353/lan.0.0081 CrossRefGoogle Scholar
Masoura, E. V, & Gathercole, S. E. (1999). Phonological short-term memory and foreign language learning. International Journal of Psychology, 34(5–6), 383388. https://doi.org/10.1080/002075999399738 CrossRefGoogle Scholar
McDonough, C., Song, L., Hirsh-Pasek, K., Golinkoff, R. M., & Lannon, R. (2011). An image is worth a thousand words: Why nouns tend to dominate verbs in early word learning. Developmental Science, 14(2), 181189. https://doi.org/https://doi.org/10.1111/j.1467-7687.2010.00968.x CrossRefGoogle Scholar
Meir, N., & Armon-Lotem, S. (2017). Independent and combined effects of socioeconomic status (SES) and bilingualism on children’s vocabulary and verbal short-term memory. Frontiers in Psychology, 8, 1442. https://doi.org/10.3389/fpsyg.2017.01442 CrossRefGoogle ScholarPubMed
Metsala, J. L. (1999). Young children’s phonological awareness and nonword repetition as a function of vocabulary development. Journal of Educational Psychology, 91(1), 319. https://doi.org/10.1037/0022-0663.91.1.3 CrossRefGoogle Scholar
Montanari, S. (2014). A case study of bi-literacy development among children enrolled in an Italian–English dual language program in Southern California. International Journal of Bilingual Education and Bilingualism, 17(5), 509525. https://doi.org/10.1080/13670050.2013.833892 CrossRefGoogle Scholar
Morales, J., Calvo, A., & Bialystok, E. (2013). Working memory development in monolingual and bilingual children. Journal of Experimental Child Psychology, 114(2), 187202. https://doi.org/https://doi.org/10.1016/j.jecp.2012.09.002 CrossRefGoogle ScholarPubMed
Morrison, D. A. (2019). Metrical structure in Scottish Gaelic: Tonal accent, glottalisation and overlength. Phonology, 36(3), 391432. https://doi.org/DOI:10.1017/S0952675719000204 CrossRefGoogle Scholar
Munson, B., Kurtz, B. A., & Windsor, J. (2005). The influence of vocabulary size, Phonotactic probability, and wordlikeness on nonword repetitions of children with and without specific language impairment. Journal of Speech, Language, and Hearing Research, 48(5), 10331047. https://doi.org/10.1044/1092-4388(2005/072) CrossRefGoogle ScholarPubMed
Nance, C., & Maolalaigh, R. Ó. (2019). Scottish Gaelic. Journal of the International Phonetic Association, 115. https://doi.org/DOI:10.1017/S002510031900015X Google Scholar
National Records of Scotland. (2015). Scotland’s census 2011: Gaelic report (part 1). National Records of Scotland.Google Scholar
Nicolay, A.-C., & Poncelet, M. (2013). Cognitive abilities underlying second-language vocabulary acquisition in an early second-language immersion education context: A longitudinal study. Journal of Experimental Child Psychology, 115(4), 655671. https://doi.org/https://doi.org/10.1016/j.jecp.2013.04.002 CrossRefGoogle Scholar
Ó Muircheartaigh, J., & Hickey, T. (2008). Academic outcome, anxiety and attitudes in early and late immersion in Ireland. International Journal of Bilingual Education and Bilingualism, 11(5), 558576. https://doi.org/10.1080/13670050802149184 CrossRefGoogle Scholar
O’Hanlon, F., Paterson, L., & McLeod, W. (2012). Language models in Gaelic medium pre-school, primary and secondary education: Vol. null (null (ed.)). Soillse.Google Scholar
O’Hanlon, F., Paterson, L., & McLeod, W. (2013). The attainment of pupils in Gaelic-medium primary education in Scotland. International Journal of Bilingual Education and Bilingualism, 16(6), 707729. https://doi.org/10.1080/13670050.2012.711807 CrossRefGoogle Scholar
O’Toole, C., Ní Shíthigh, D., Molamphy, A., & Walsh, E. (2019). Findings from the first phase of developing a receptive vocabulary test for the Irish language. International Journal of Bilingualism, 24(4), 572587. https://doi.org/10.1177/1367006919848142 CrossRefGoogle Scholar
Oller, D. K., Pearson, B. Z., & Cobo-Lewis, A. B. (2007). Profile effects in early bilingual language and literacy. Applied Psycholinguistics, 28(2), 191230. https://doi.org/DOI:10.1017/S0142716407070117 CrossRefGoogle ScholarPubMed
Padilla, A. M., Fan, L., Xu, X., & Silva, D. (2013). A Mandarin/English two-way immersion program: Language proficiency and academic achievement. Foreign Language Annals, 46(4), 661679. https://doi.org/https://doi.org/10.1111/flan.12060 CrossRefGoogle Scholar
Paradis, J. (2011). Individual differences in child English second language acquisition comparing child-internal and child-external factors. Linguistic Approaches to Bilingualism, 1(3), 213237. https://doi.org/10.1075/lab.1.3.01par CrossRefGoogle Scholar
Paradis, J., Genesee, F., & Crago, M. B. (2011). Dual language development and disorders: A handbook on bilingualism and second language learning (2nd ed.). Baltimore, MD: Paul H. Brookes Pub. Co.Google Scholar
Raven, J. C. (2003). Raven’s coloured progressive matrices. Pearson Assessment.CrossRefGoogle Scholar
R Core Team. (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing.Google Scholar
Rhys, M., & Thomas, E. M. (2013). Bilingual Welsh–English children’s acquisition of vocabulary and reading: implications for bilingual education. International Journal of Bilingual Education and Bilingualism, 16(6), 633656. https://doi.org/10.1080/13670050.2012.706248 CrossRefGoogle Scholar
Rispens, J., & Baker, A. (2012). Nonword repetition: The relative contributions of phonological short-term memory and phonological representations in children with language and reading impairment. Journal of Speech, Language, and Hearing Research, 55(3), 683694. https://doi.org/10.1044/1092-4388(2011/10-0263)CrossRefGoogle ScholarPubMed
Roy, P., & Chiat, S. (2004). A prosodically controlled word and nonword repetition task for 2- to 4-year-olds. Journal of Speech, Language, and Hearing Research, 47(1), 223234. https://doi.org/10.1044/1092-4388(2004/019)CrossRefGoogle ScholarPubMed
Saffran, J. R., & Wilson, D. P. (2003). From syllables to syntax: Multilevel statistical learning by 12-month-old infants. Infancy, 4(2), 273284. https://doi.org/https://doi.org/10.1207/S15327078IN0402_07 CrossRefGoogle Scholar
Schwartz, M., Moin, V., & Leikin, M. (2012). Lexical knowledge development in the first and second languages among language-minority children: The role of bilingual versus monolingual preschool education. International Journal of Bilingual Education and Bilingualism, 15(5), 549571. https://doi.org/10.1080/13670050.2011.650332 CrossRefGoogle Scholar
Semel, E., & Wiig, E. H. (2017). Clinical evaluation for language fundamentals 5 (CELF-5). Pearson Assessment.Google Scholar
Sheng, L., & McGregor, K. K. (2010). Lexical–semantic organization in children with specific language impairment. Journal of Speech, Language, and Hearing Research, 53(1), 146159. https://doi.org/10.1044/1092-4388(2009/08-0160)CrossRefGoogle ScholarPubMed
Smithson, L., Paradis, J., & Nicoladis, E. (2014). Bilingualism and receptive vocabulary achievement: Could sociocultural context make a difference? Bilingualism: Language and Cognition, 17(4), 810821. https://doi.org/DOI:10.1017/S1366728913000813 CrossRefGoogle Scholar
Sorenson Duncan, T., & Paradis, J. (2016). English language learners’ nonword repetition performance: The influence of age, L2 vocabulary size, length of L2 exposure, and L1 phonology. Journal of Speech, Language, and Hearing Research, 59(1), 3948. https://doi.org/10.1044/2015_JSLHR-L-14-0020 CrossRefGoogle Scholar
Stephen, C., McPake, J., McLeod, W., Pollock, I., & Carroll, T. (2010). Review of Gaelic medium early education and childcare. Retrieved form http://strathprints.strath.ac.uk/26270/1/0100403.pdf Google Scholar
Sundara, M., Polka, L., & Molnar, M. (2008). Development of coronal stop perception: Bilingual infants keep pace with their monolingual peers. Cognition, 108(1), 232242. https://doi.org/https://doi.org/10.1016/j.cognition.2007.12.013 CrossRefGoogle ScholarPubMed
Szewczyk, J. M., Marecka, M., Chiat, S., & Wodniecka, Z. (2018). Nonword repetition depends on the frequency of sublexical representations at different grain sizes: Evidence from a multi-factorial analysis. Cognition, 179(June), 2336. https://doi.org/10.1016/j.cognition.2018.06.002 CrossRefGoogle ScholarPubMed
Thiessen, E. D., & Saffran, J. R. (2003). When cues collide: Use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. In Developmental psychology (Vol. 39, Issue 4, pp. 706716). American Psychological Association. https://doi.org/10.1037/0012-1649.39.4.706 Google Scholar
Thordardottir, E., & Brandeker, M. (2013). The effect of bilingual exposure versus language impairment on nonword repetition and sentence imitation scores. Journal of Communication Disorders, 46(1), 1.CrossRefGoogle ScholarPubMed
Tuller, L. (2015). Clinical use of parental questionnaires in multilingual contexts. In Armon-Lotem, S., de Jong, J., & Meir, N. (Eds.), Assessing mutlilingual children (pp. 301330). Multilingual Matters.CrossRefGoogle Scholar
Unsworth, S. (2013). Current issues in multilingual first language acquisition. Annual Review of Applied Linguistics, 33, 2150. https://doi.org/DOI:10.1017/S0267190513000044 CrossRefGoogle Scholar
van Wonderen, E., & Unsworth, S. (2020). Testing the validity of the Cross-Linguistic Lexical Task as a measure of language proficiency in bilingual children. Journal of Child Language, 125. https://doi.org/DOI:10.1017/S030500092000063X Google ScholarPubMed
Verhagen, J., Boom, J., Mulder, H., de Bree, E., & Leseman, P. (2019). Reciprocal relationships between nonword repetition and vocabulary during the preschool years. In Developmental Psychology (Vol. 55, Issue 6, pp. 11251137). American Psychological Association. https://doi.org/10.1037/dev0000702 Google Scholar
Weisleder, A., & Fernald, A. (2013). Talking to children matters: Early language experience strengthens processing and builds vocabulary. Psychological Science, 24(11), 21432152. https://doi.org/10.1177/0956797613488145 CrossRefGoogle ScholarPubMed
Yan, S., & Nicoladis, E. (2009). Finding le mot juste: Differences between bilingual and monolingual children’s lexical access in comprehension and production*. Bilingualism: Language and Cognition, 12(03), 323. https://doi.org/10.1017/S1366728909990101 CrossRefGoogle Scholar
Figure 0

Table 1. Participant background information

Figure 1

Table 2. Overview of the Crosslinguistic nonword repetition task (CL-NWRT) (British English Version) (Chiat, 2015)

Figure 2

Table 3. Participant- and item-related variables for the Crosslinguistic Lexical Task (CLT) and the Crosslinguistic Nonword Repetition task (CL-NWRT)

Figure 3

Table 4. Spearman Correlation Coefficients for participant-related predictors in NWRT and CLT

Figure 4

Table 5. Spearman Correlation Coefficients for item-related predictors in the Crosslinguistic Nonword Repetition task (CL-NWRT)

Figure 5

Table 6. Mean (in %) accuracy in the Crosslinguistic Lexical Task (CLT)

Figure 6

Figure 1. Mean Proportion Accuracy on the CLT-Receptive/Expressive and Overall.

Figure 7

Table 7. Best fitting models for the Crosslinguistic Lexical Task (CLT)

Figure 8

Table 8. Descriptive statistics for the Crosslinguistic Nonword Repetition task (CL-NWRT)

Figure 9

Figure 2. Mean Proportion Accuracy on the CL-NWRT (QU = Quasi-universal, PS = prosody-specific, LS = language-specific, and overall).

Figure 10

Table 9. Best fitting models for the quasi-universal, the prosody-specific, and the language-specific sets of the Crosslinguistic NWR task