The predictors of foreign-accentedness in the home language of Polish–English bilingual children

MAGDALENA WREMBEL; MARTA MARECKA; JAKUB SZEWCZYK; AGNIESZKA OTWINOWSKA

doi:10.1017/S1366728918000044

The predictors of foreign-accentedness in the home language of Polish–English bilingual children

Published online by Cambridge University Press: 22 March 2018

and

MAGDALENA WREMBEL*: Affiliation:
Faculty of English, Adam Mickiewicz University, Poznań, Poland
MARTA MARECKA: Affiliation:
Institute of Psychology, Jagiellonian University, Kraków, Poland
JAKUB SZEWCZYK: Affiliation:
Institute of Psychology, Jagiellonian University, Kraków, Poland
AGNIESZKA OTWINOWSKA: Affiliation:
Institute of English Studies, University of Warsaw, Poland
*: Address for correspondence: Magdalena Wrembel, Faculty of English, Adam Mickiewicz University in Poznań, al. Niepodległości 4, 61–874 Poznań, Polandmagdala@wa.amu.edu.pl

Article contents

Abstract
Introduction
Aims and research questions
Phonetic analysis
Accentedness ratings
Discussion
Conclusions
Supplementary material
Footnotes
References

Rights & Permissions

Abstract

We investigated the speech patterns and accentedness of Polish–English bilingual children raised in Great Britain to verify whether their L1 Polish would be perceived as different from that of monolinguals matched for age and socioeconomic status. To this end, Polish-language speech samples of 32 bilinguals and 10 monolinguals (a 3:1 ratio, MAge = 5.79) were phonetically analysed by trained phoneticians and rated by 55 Polish raters, who assessed the degree of native accent, intelligibility, acceptability and perceived age. The results show significant differences in the phonetic performance of bilingual and monolingual children – both in terms of atypical speech patterns uncovered in the phonetic analysis and in terms of the holistic accentedness ratings. We also explored the socio-linguistic predictors of accent ratings in bilingual speech and found that the amount of L1 Polish input was the main predictor of accentedness in children's L1 Polish speech, while L2 English input was marginally significant. (149)

Keywords

foreign accent bilingual children accentedness ratings speech patterns cross-linguistic influence

Type: Research Article
Information: Bilingualism: Language and Cognition , Volume 22 , Issue 2 , March 2019 , pp. 383 - 400

DOI: https://doi.org/10.1017/S1366728918000044 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2018

Introduction

According to a popular belief, bilingual children speak both their languages with a native-like accent. Yet, research into bilingual speech production patterns has generated mixed results. Some older studies show that bilingual children are able to keep their two phonological systems apart, and that early bilinguals and monolinguals follow similar patterns of phonological development (e.g., Holm & Dodd, Reference Holm and Dodd1999; Johnson & Wilson, Reference Johnson and Wilson2002). More recent evidence, however, demonstrates differences in speech production between bilingual and monolingual children, as well as cross-linguistic interactions between language systems of bilingual speakers (e.g., En, Brebner & McCormack, Reference En, Brebner and McCormack2014; Mayr, Howells & Lewis, Reference Mayr, Howells and Lewis2014). This is in line with theoretical frameworks put forward in the area of second language acquisition (SLA) and bilingualism such as Flege's (Reference Flege and Strange1995, Reference Flege, Burmeister, Piske and Rhode2002) Speech Learning Model or Dynamic Systems Theory (Herdina & Jessner, Reference Herdina and Jessner2002; de Bot, Lowie & Verspoor, Reference de Bot, Lowie and Verspoor2007). Speech Learning Model assumes that the phonetic categories from both languages occupy the same phonological space in the bilingual mind. Similarly, the Dynamic Systems Theory points to the existence of interactions between the pertinent languages.

If both languages indeed occupy the same phonological space and interactions between languages occur, it is still not clear what the directionality and strength of these cross-linguistic interactions are. For instance, Barlow's (Reference Barlow2014) study on Spanish–English bilinguals in the USA reported influence from Spanish, the home language of bilinguals, to English, the community language. Other studies evidenced interaction in the opposite direction, i.e., from the community language to the home language of bilingual speakers (Mayr et al., Reference Mayr, Howells and Lewis2014) or even cases of first language attrition in bilinguals (Schmid, Reference Schmid2013). Still, research investigating the nature, directionality and conditioning factors of cross-linguistic interactions in early bilinguals is scarce, especially as regards perceived accent in their speech. This study aims to fill this gap by exploring the concept of native accent in the home language of Polish–English bilingual children and investigating the predictors of their performance.

Foreign accentedness ratings

The literature on accent ratings in bilinguals to date focuses primarily on foreign accentedness in their L2 speech, to the exclusion of the L1. The phenomenon of foreign accentedness comprises a range of phonetic features detectable on multiple layers of speech, such as atypical realisation of speech segments, prosodic patterns, as well as speech rate, rhythm, disfluency markers and hesitations (Southwood & Flege, Reference Southwood and Flege1999). In other words, foreign accent refers to a range of segmental and prosodic deviations from the native norms of pronunciation in a given language and thus it is difficult to assess unidimensionally.

Studying this phenomenon is nevertheless important, because any explicit or implicit judgements of people's speaking performance or pronunciation skills inherently involve rating their accentedness (for overviews see Jesney, Reference Jesney2004 and Piske, MacKay & Flege, Reference Piske, MacKay and Flege2001). Therefore, foreign accentedness ratings (FARs) have been often applied in SLA research on adult learners (e.g., Flege, Reference Flege1988; Gallardo del Puerto, Gómez Lacabex & García Lecumberrri, Reference Gallardo del Puerto, Gómez Lacabex and García Lecumberri2007; Piske et al., Reference Piske, MacKay and Flege2001). This paradigm is very popular in SLA studies also because FARs are less time-consuming to obtain than output measurements (analysing acoustic or articulatory features), and they provide a global measure of talker's phonetic performance (Schmid & Hopp, Reference Schmid and Hopp2014).

However, this method is not without problems. FARs are usually performed by native-speakers of a given language (the so-called independent raters) who are asked to evaluate on a scale the degree of a foreign accent in a set of speech samples they are presented with. The rating techniques usually entail a Likert scale or a continuous measure with two extreme categories, such as “heavy foreign accent” vs. “native-like pronunciation” (Southwood & Flege, Reference Southwood and Flege1999). One problem with this paradigm is that there is no established norm regarding the number of raters performing the assessment, and thus studies vary widely in this regard, employing from one (Snow & Hoefnagel-Höhle, Reference Snow and Hoefnagel-Hohle1977) to over two hundred raters (Anderson-Hsieh & Koehler, Reference Anderson-Hsieh and Koehler1988). Employing only few raters can bias the results, since there might be individual differences in the rater's ability to perform the FARs accurately. For instance, lower familiarity with the foreign accent often leads to harsher and more variable ratings, while the linguistic training and expertise result in more consistent ratings (Thompson, Reference Thompson1991). For this reason, employing a large number of phonetically trained raters is crucial for the reliability of FARs.

Another potential problem with the FAR paradigm is that it is influenced by the number of speech samples provided by a control group of native-speakers that are mixed randomly into the set to serve as a benchmark for non-native speakers’ accent ratings. Usually monolingual controls comprise 20% to 40% of the sample in FARs. A larger proportion of monolingual controls may lead to a more severe assessment of the bilingual sample (Flege & Fletcher, Reference Flege and Fletcher1992) and skew the results, which is why it is so important to employ an appropriately sized control group.

Yet another issue is that the techniques of eliciting speech samples are likely to influence the outcomes of the study. Delayed repetition and other controlled techniques are recommended as more reliable measures of accentedness, because such speech samples were sometimes reported as more strongly accented than free speech (e.g., Oyama, Reference Oyama1976; Thompson, Reference Thompson1991). The problem with spontaneous speech samples is that some talkers might produce lexical, morphological and syntactic errors in them and these can introduce variability in the FARs that is not associated with the phonetic performance of the talker. However, it might be argued that using spontaneous speech samples is more ecologically valid and several studies employed such a technique, eliciting samples through picture descriptions or recounting personal experiences (e.g., Elliott, Reference Elliott1995). This trade-off between the ecological validity and reliability is a problem which can be solved by combining the two methods, but studies rarely employ such a paradigm.

The final problem with FAR is that raters in a foreign accentedness study may apply a contraction bias, i.e., overestimate small differences and underestimate large ones (Southwood & Flege, Reference Southwood and Flege1999). This is a problem that is not easily solved within the paradigm, unless the FARs are accompanied by another measure of phonetic performance.

Summing up, accentedness is a complex construct, difficult to assess unidimensionally or assign to specific categories, because it depends on multiple factors and there are no physical units in which accent can be measured. It is also sensitive to biases resulting from an inappropriate number of raters, the choice of raters, poorly balanced sampling of talkers, or improper speech elicitation techniques. Even when a FAR study is carefully constructed, a constriction bias can still skew the results. Consequently, although FARs is an established and widely used measure of global phonetic performance in bilinguals, caution should be taken when drawing firm conclusions about the nativeness or non-nativeness of speech samples based on FARs alone. Ideally, FARs should be conducted on a well-balanced sample of talkers, with a large number of trained raters, and the speech samples should be eliciting using methods that guarantee both reliability and ecological validity. However, to make FARs even more reliable, the method should be supplemented with more precise output measures, such as a phonetic analysis of bilingual speech samples. This is why in this study we employ a unique paradigm that combines a carefully controlled holistic accentedness assessment with a more objective phonetic analysis performed by trained raters.

Predictors of FARs in adults

The degree of foreign accentedness in non-native speech depends on several participant-related characteristics, the most important being the age of L2 acquisition, the amount of language experience (exposure), as well as the L1 background and use (Piske et al., Reference Piske, MacKay and Flege2001). The age of acquisition or the age of arrival to an L2-speaking country (AoA) are the most established and well-researched factors influencing the perceived accentedness in speakers with immigrant background. Research on the impact of AoA on accent generally supports the claim that ‘the earlier, the better’ (e.g., Asher & Garcia, Reference Asher and Garcia1969; Suter, Reference Suter1976; Oyama, Reference Oyama1976; Flege & Fletcher, Reference Flege and Fletcher1992; Flege, Munro Munro, M. J., & MacKay & MacKay, Reference Flege, Munro and MacKay1995; Moyer, Reference Moyer1999).

Other widely explored factors are the ones related to language experience or input. These are often operationalised as the length of residence (LoR) in a particular speech community, which might be assumed to be a proxy to the amount of input received in the L2 (see Haman, Wodniecka, Marecka, Szewczyk, Białecka-Pikul, Otwinowska, Mieszkowska, Łuniewska, Kołak, Miękisz, Kacprzak, Banasik & Foryś-Nogala, under review). LoR has been found significant in some studies on foreign accent in the L2 (e.g., Asher & Garcia, Reference Asher and Garcia1969; Purcell & Suter, Reference Purcell and Suter1980; Flege & Fletcher, Reference Flege and Fletcher1992), but not others (e.g., Oyama, Reference Oyama1976; Thompson, Reference Thompson1991; Moyer, Reference Moyer1999). As indicated in further research (Flege et al., Reference Flege, Munro and MacKay1995; Meador, Flege & MacKay, Reference Meador, Flege and MacKay2000), LoR is of importance only within the first few months of L2 acquisition as L2 speech patterns tend to become fixed later on. A related factor is the amount of L1 use (Guion, Flege & Loftin, Reference Guion, Flege and Loftin2000; Yeni-Komshian, Flege & Liu, Reference Yeni-Komshian, Flege and Liu2000). Speakers using their L1 on a more regular basis display a greater degree of foreign accent in their L2.

As for the predictors of FARs in the L1 of bilingual speakers and language attriters, data are scarce. The few studies conducted on the topic show, interestingly, that LoR did not correlate with the degree of the perceived foreign accent in the speech of language attriters (Hopp & Schmid, Reference Hopp and Schmid2013; Schmid & Dusseldorp, Reference Schmid and Dusseldorp2010), and that FAR's sole predictor turned out to be the amount of L1 contact without code-switching (de Leeuw, Reference de Leeuw2009; de Leeuw, Schmid & Mennen, Reference de Leeuw, Schmid and Mennen2010). Overall, research on factors influencing FARs in the L1 of bilingual speakers seems inconclusive, which is exacerbated by the fact that such studies are relatively infrequent. However, it seems that measures related to language input might be of importance here.

Research on FARs in children

The issue of foreign accent in child bilinguals is largely under-researched. Only a few investigations involved accent ratings performed on children's speech samples. In an early study, Asher and Garcia (Reference Asher and Garcia1969) examined over 70 Spanish–English bilingual children of Cuban immigrants in the USA. The children were aged 7 to 19 and had been living in the USA for about five years. The results of the accent ratings indicated that only children with very early age of arrival (1- 6 years of age) were perceived as near-native in English.

A longitudinal study by Snow and Hoefnagel-Höhle's (Reference Snow and Hoefnagel-Hohle1977) on English acquirers of Dutch also investigated age-related differences. Accent was measured on recordings of participants imitating Dutch sounds in individual words. The first testing session, which took place six weeks after the arrival to the Netherlands, showed a paradoxical effect: late learners (teenagers and adults) outperformed early learners (children) with respect to the nativeness of their accent. Nevertheless, ten months after the arrival to the Netherlands, younger children did better than the older learners in L2 pronunciation.

Summing up, only two studies examined foreign accentedness in bilingual children's L2 and they brought mixed results. However, they point to the AoA and language experience as the possible predictors of FARs that could be further investigated. As for foreign accentedness in bilingual children's L1, to the best of our knowledge there has been no research in the area. This is a serious oversight, because investigating this topic has both theoretical and practical importance. From the theoretical standpoint, detecting foreign accent in the L1 of bilingual children would be an ultimate challenge to the “earlier is better” assumption, which strongly suggests that for children the early start is a guarantee of success in mastering two languages. A study challenging this view would thus greatly further our understanding of factors involved in the development of native-like pronunciation. From the practical standpoint, investigating the FARs in the L1 of bilingual migrant children is important, since some families re-migrate to their home countries. Upon returning to the home country, bilingual children may experience stigmatisation and educational setbacks on account of their inferior knowledge of L1 and foreign accent (H. Grzymała-Moszczyńska, J. Grzymała-Moszczyńska, Durlik & Szydłowska, Reference Grzymała-Moszczyńska, Grzymała-Moszczyńska, Durlik and Szydłowska2015). Thus, a FAR study conducted on the L1 of bilingual children would both provide valuable insights to the field of bilingualism and be of high practical relevance.

Aims and research questions

Our study aims to fill the existing gap in research by investigating the degree of accent, acceptability, intelligibility and the perceived age in bilingual children's L1 speech samples. It also examines speaker- and rater-related predictors of foreign accentedness in the L1. To make our research more practically relevant, the accentedness ratings in our study were conducted by teachers or teacher trainees, since this group is likely to have contact with bilingual migrant children returning to their home countries and their perception of children's speech might influence the treatment of these children in educational settings. Apart from investigating an under-researched topic (the degree of accent in the L1) in an under-studied group (i.e., bilingual children), we also aimed to set a new standard for conducting FARs in a more reliable way. This was done by employing a large number of raters and by combining two data elicitation techniques. In the study, we use both the samples obtained from a repetition task and spontaneous speech samples, in which children narrated a picture story. We assume that the samples elicited in the two ways may be assessed differently due to their length and varying morpho-syntactic complexity. Further, to obtain more generalizable and robust results of our research, we combine an auditory phonetic analysis performed by expert raters (Study 1) with the traditional FARs (Study 2). This approach allows us also to investigate the relationship between atypical speech patterns in the L1 of bilingual children, identified by means of the phonetic analysis, and the global accentedness measures.

The major objective of the current study was to explore whether the speech of Polish–English bilinguals is perceived as different from that of Polish monolinguals matched for age and socioeconomic status and to investigate potential sources of cross-linguistic influence (henceforth CLI), as well as socio-linguistic predictors of the perceived foreign accent in children's Polish. The following research questions were asked to address these issues:

RQ 1: Does the Polish speech of bilingual Polish–English children living in the UK differ from the speech of Polish monolinguals? If so, in what aspects?
RQ 2: How are the degree of native accent, intelligibility, acceptability and perceived age related in the performed ratings?
RQ 3: How do accentedness ratings relate to a more objective measure of phonological performance, i.e., a detailed phonetic analysis of bilingual children's speech?
RQ 4: What background factors contribute to the perceived foreign accent in bilingual participants?

Our predictions about the potential areas of CLI in both studies were based on the existing phonological differences between Polish and EnglishFootnote ¹. Polish is characterised by a relatively large repertoire of consonants and rich phonotactics. It allows for consonant clusters in all word positions that are more complex than those occurring in English. On the other hand, English has a broader vocalic inventory and a phonemic vowel length distinction, which is absent in Polish. There are also cross-linguistic differences concerning the realisation of laryngeal contrasts, vowel reduction in unstressed syllables, predominant word stress patterns and the rhythmical structure of both languages e.g., Jassem (Reference Jassem2003), Dziubalska-Kołaczyk and Walczak (Reference Dziubalska-Kołaczyk, Walczak, Delcourt and van Sterkenburg2011), Roach (Reference Roach2009), or Cruttenden (Reference Cruttenden2014), (see Footnote 1 for a more detailed discussion).

The differences between the children's two languages, as described above, led us to predict that transfer might occur in a number of the areas enumerated, particularly in the production of Polish consonants and consonant clusters. We also assumed that the amount of input received by the children in both languages would affect the direction of the influence, as suggested by previous research on FARs in bilingual speakers.

Participants

In order to avoid terminological confusion, in the following descriptions we use the term “Talkers” with reference to the children who provided the speech samples for the study. The same Talkers were used in Study 1 and Study 2 and they are presented below. The term “Raters” will be used with reference to the adults who assessed the speech samples, both in the auditory phonetic analysis (Study 1) and in the accentedness ratings (Study 2). Since different Raters were used in Study 1 and Study 2, their profiles will be presented in the respective sections describing both studies.

Talkers

The Talkers' speech samples used for this study come from a large-scale Polish project on linguistic and cognitive development of children conducted within the European COST Action IS0804. The database contains speech recordings and data from 173 bilingual children living in the UK who had at least one Polish parent, and 311 Polish monolingual children. A written parental consent was obtained for all the children participating in the research before they completed a large battery of language and cognitive tests. For the purpose of this study, 42 child Talkers were chosen for the analyses, including 32 Polish–English bilinguals and 10 Polish monolinguals, who served as controls. The samples were chosen based on the quality of the recordings, and they were representative for the bilingual and monolingual populations, respectively (for details of the selection procedure, see Marecka, Wrembel, Zembrzuski & Otwinowska-Kasztelanic, Reference Marecka, Wrembel, Zembrzuski and Otwinowska-Kasztelanic2015). The relatively small number of monolingual controls to bilingual participants is typical in FAR studies. As already mentioned, monolingual controls usually constitute 20% to 40% of the sample since a larger proportion might skew the results (see Flege and Fletcher, Reference Flege and Fletcher1992; Jesney, Reference Jesney2004). Following this proportion, we included 24% of monolingual controls in our sample. Since the design assumed having the same Talkers in Study 1 and 2, the number of children and, consequently, the 3:1 bilingual to monolingual ratio was constant across the studies. The bilingual children were recorded in the UK (London and Cambridge), while the monolingual children were recorded in Poland (Warsaw and Kraków). The monolingual controls can be treated as a homogenous group, since they spoke the Polish standard variety and did not exhibit any dialectal features. There is no significant variation in the monolingual Polish regiolects spoken in the major Polish cities due to the convergence of dialects and regional varieties in Polish urban areas (e.g., Kurkowska, Reference Kurkowska and Kurkowska1981; Wilkoń, Reference Wilkoń2000; Dubisz, Reference Dubisz2013). The convergence is motivated by the fact that standard Polish enjoys a high prestige, is spoken by the society at large and taught at schools, and is widely present in the mediaFootnote ².

The background information about the children came from a questionnaire (PABIQ by Tuller, Reference Tuller, Armon-Lotem, De Jong and Meir2015 and its Polish adaptation by Kuś, Otwinowska, Banasik & Kiebzak-Mandera, Reference Kuś, Otwinowska, Banasik and Kiebzak-Mandera2012). The 32 bilinguals (20 females), whose L1 (home language) was Polish and L2 (community language) was English, were children of Polish migrants to the UK. Their mean age was 5.79 (SD = 0.64, range: 4.82 - 6.98), they all had been exposed to English before the age of three and all had at least one Polish parent. As far as their language use is concerned, they spoke Polish at home, but English at kindergarten or school. The socioeconomic status (SES) of children's families was measured based on the parents’ level of education. Their mothers had on average 16.03 years of formal education (SD = 3.09), their fathers 14.25 years (SD = 2.84). The parents of the migrant children originated from different regions of Poland; however, since they were well educated and mostly came from large cities, they were unlikely to display dialectal differences (see Footnote 2). The 10 Polish monolingual children (8 female), who served as controls, lived in Poland and their mean age was 5.98 (SD = 0.47, range: 5.18 – 6.81). Their mothers had on average 18 years of formal education (SD = 2.58), and fathers - 19.75 years (SD = 2.22).

Study 1: Phonetic analysis

In Study 1 the speech of monolinguals and bilinguals was analysed by trained phonetic Raters, who performed a detailed analysis of the recordings, pointing to specific features observed in the speech of bilingual children. The study follows the design of Marecka et al. (Reference Marecka, Wrembel, Zembrzuski and Otwinowska-Kasztelanic2015).

Raters

The study involved 8 native Polish Raters (5 females), university students of English, mean age 22.5 (SD = 1.2). Their proficiency in English was advanced (C1 according to the CEFR – Common European Framework of Reference for Languages, Council of Europe, 2001) and they all had had previous phonetic training. The Raters came from western Poland (the Wielkopolska region) and all spoke standard Polish without any dialectal features (see Footnote 2 for more information about standard Polish and the convergence of the dialects in towns).

Materials and procedure

The speech samples of the Talkers in Study 1 come from a Polish Sentence Repetition (SRep) task (Banasik, Haman & Smoczyńska, Reference Banasik, Haman and Smoczyńska2012). This task originated from a study initially designed to test morpho-syntax, but it was used in this research because it offered comparable phonological output across the Talkers. In the SRep, the Talkers were asked to repeat 68 sentences in Polish pre-recorded by two Polish native speakers. The sentences varied in length and grammatical complexity. Each sentence was presented to the participant through headphones and subsequent repetitions were recorded.

For the purpose of the auditory analysis, 14 sentences were chosen on the basis of three criteria: grammatical simplicity, completeness and a wide range of phonetic contexts. As for the grammatical simplicity and completeness, we excluded the sentences that contained complex grammatical structures such as passive voice, relative clauses, etc., because not all the children had been able to repeat them. We further excluded the sentences that had been omitted or produced incompletely by the Talkers. Out of the remaining set, we chose sentences that offered a wide range of phonetic contexts, including sibilant sounds, complex consonant clusters, nasal glides and plosives in stressed onset positions, i.e., features that are likely to cause problems for English speakers of Polish. This finally led to the selection of 14 sentences for the analysis. The order of the sentences as presented to the Raters remained the same for each child.

Prior to the analysis proper, three expert phoneticians conducted a detailed auditory phoneme-by-phoneme analysis of the speech samples from 5 bilingual children randomly chosen from the Talkers' pool. On the basis of the processes identified by these phoneticians, we created a diagnostic list of atypical speech patterns (see Table 1). It enumerated 12 problem areas found in the speech of Polish–English bilingual children that differed from monolingual production, presumably due to cross-linguistic influences (CLI). The diagnostic list constituted a list of focus areas for the subsequent auditory phonetic analysis.

Table 1. Diagnostic list of atypical speech patterns

For the analysis proper, the speech sample (14 sentences) for each bilingual or monolingual Talker was assigned a code. Then each of the eight Raters recruited for the study received a randomly selected set of bilingual and monolingual recordings to analyse auditorily. Each sample was analysed independently by two Raters, and then cross-checked by a third phonetically-trained expert Rater recruited from the authors of the study. The expert Rater made a final decision in case of discrepancies. All the Raters (including the expert Raters) were blind to the fact of which samples were monolingual and which bilingual.

The Raters’ task was to pinpoint any articulatory alterations in the child's speech samples that were reflected in the diagnostic list. The 14 sentences for each child were treated as one continuous speech sample. Each Rater received a set of transcription cards (one per child), containing the transcriptions of the 14 sentences, and was asked to underline the fragments that were mispronounced by a particular Talker. Then the Raters had to transcribe how the speech sounds were altered and to categorise each speech alternation into one of the 12 categories from the diagnostic list. We then calculated how many instances of speech alterations corresponding to particular categories occurred in the speech sample for each child. Finally, for each sample we collapsed the speech alterations enumerated in the 12 categories into one cumulative number of atypical patterns (i.e., the total of all atypical patterns from our diagnostic list per child's speech sample). All the calculations were conducted per Talker and not per item sentence, since a phonetic analysis of larger speech samples is much more reliable and informative than that pertaining to short fragments of speech. This is because the latter contains a more limited range of phonetic contexts.

Statistical analysis

Based on the Raters’ analysis, within each category from the diagnostic list we compared the number of atypical phonological patterns occurring in the speech of the bilingual Talkers with the speech patterns attested in monolingual samples. We also compared the cumulative number of atypical speech patterns between the two groups. The non-parametric Mann-Whitney U tests were used for the comparisons due to an unequal size of the groups being compared. Bonferroni corrections with the number of comparisons set to 13 (12 for all the diagnostic categories and 1 for the cumulative number of atypical speech patterns) were used to make up for a large number of comparisons. This measure was chosen as it is one of the most conservative ones and thus minimizes the risks of type I errors.

Results

Overall, the study shows that the Polish speech of bilingual children differed significantly from the speech of their monolingual peers. The Raters reported on average 5.2 alterations per speech sample in the monolingual group (SD = 3.16), as opposed to 26.44 alterations per speech sample in the bilingual group (SD = 16.93). The difference was statistically significant, as indicated by the Mann-Whitney test with the Bonferroni corrections (U = 300.5, p < .001). As for the 12 areas of atypical speech patterns enumerated in the diagnostic list, the two groups differed in terms of vowel reduction, the production of non-native consonants, consonant cluster reduction and the application of atypical VOT patterns. The differences in the number of alterations between particular categories from the diagnostic list are presented in Table 2.

Table 2. Results of phonetic analysis for monolingual vs. bilingual groups

* p < .05, ** p < .01, *** p < .001

Study 2: Accentedness ratings

While Study 1 focused on a detailed phonetic analysis, Study 2 aimed to verify whether the Polish speech of Polish–English bilingual children is perceived as different from that of their monolingual peers when rated holistically and impressionistically by teachers or teacher trainees. We also wanted to examine whether bilingual speakers could be perceived as younger or speech-delayed in comparison to monolingual speakers. To this aim, Study 2 reanalysed the SRep speech samples of the Talkers from Study 1 using the holistic FAR paradigm, as opposed to a detailed phonetic analysis. Apart from using the repeated speech samples from Study 1, Study 2 included also a second type of speech samples, i.e., spontaneous speech. The parameters assessed included the degree of Native Accent, Intelligibility, Acceptability and Perceived Age.

Further, we aimed to investigate whether the results of the holistic assessment in Study 2 were related to the more objective measures of atypical speech patterns obtained from Study 1. We also wanted to examine which category of atypical speech patterns was the most predictive of FARs. Finally, we probed the sociolinguistic predictors of foreign accentedness in bilingual speech. To establish this, Study 2 used data from the background questionnaire for each child.

Raters

Study 2 included 55 Polish Raters, in-service teachers or teacher trainees. The majority of the participants were female (52), stemming from the fact the teaching profession in Poland is dominated by women. Their mean age was 22.13 (SD = 4.69). Eight of the Raters were practicing primary-school teachers, 15 were pedagogy students, and 32 were pre-service English language teachers. With regard to their English proficiency, all the Raters but one were advanced in English (CEFR B1 or above). Moreover, 39 Raters were phonetically trained, whereas 16 Raters did not undergo any phonetic training. The Raters’ task in Study 2 was to perform accentedness ratings of L1 Polish speech samples and to assess the children's perceived age. The Raters came from the Wielkopolska and Mazowsze region, spoke Polish standard variety and did not display any dialectal features (see Footnote 2 for details).

Materials

The speech samples of the Talkers in Study 2 come from two tasks, one eliciting repeated speech, and another eliciting structured spontaneous speech. Repeated speech samples were taken from the SRep task (Banasik et al., Reference Banasik, Haman and Smoczyńska2012), as in Study 1. For the purpose of Study 2, we chose only 3 sentences from the SRep, due to required time limitations of the Accent Ratings procedure. The 3 sentences were selected from the 14 sentences that served as a basis of analysis in Study 1. Each of the sentences contained a wide range of phonetic contexts as defined in Study 1. For each child, the SRep sample consisted of the same 3 sentences, presented in exactly the same order and treated as one speech sample to be assessed holistically.

Children's spontaneous speech was elicited with the Polish version of the Multilingual Assessment Instrument for Narratives (MAIN) (Gagarina et al., 2012; Kiebzak-Mandera, Otwinowska & Białecka-Pikul, Reference Kiebzak-Mandera, Otwinowska, Białecka-Pikul, Gagarina, Klop, Kunnari, Tantele, Välimaa, Balciuniene, Bohnacker and Walters2012). In language elicitation with the MAIN, the child is presented with a specially designed, cross-culturally acceptable picture story and is asked to tell the story (narration). Then the child retells a different story after a model presented by the experimenter (re-narration). Samples from the re-narration mode were chosen for the accentedness ratings since re-narration generated more fluent speech than the narration mode. Further, re-narrations allowed us to control better for morphological, syntactic or lexical cues as to the children's native status, since the Talkers were able to base their stories on an adult model provided earlier, which facilitated the task performance. We aimed to choose the most coherent 20-second story fragments for each child to minimize Raters’ bias against children who produced less coherent stories. The 20-second fragments were obtained by removing pauses, fillers, and hesitation markers from the re-narration recordings. All the speech samples selected for the ratings (both SRep and re-narration) were equated for the overall RMS amplitude in the Audacity 2.1.2 software to ensure equal loudness (Audacity Team, 2016).

To establish the sociolinguistic and socio-demographic predictors of the bilingual Talkers' accent, we used the data from the background questionnaire (Kuś et al., Reference Kuś, Otwinowska, Banasik and Kiebzak-Mandera2012), a Polish version of the PABIQ questionnaire (Tuller, Reference Tuller, Armon-Lotem, De Jong and Meir2015). The questionnaire provided detailed information about the child's family background, language acquisition and use, language skills, as well as the quality and quantity of exposure to pertinent languages. On the basis of the questionnaire, for each bilingual child we calculated an index of Input in Polish, an index of Input in English and the Age of First Exposure to English (AoE). While the questionnaire contained information about several other factors that could potentially influence the accent of the children (such as language output etc.), only these three factors were included in our analysis, since variables associated with the age of L2 acquisition (such as Age of Arrival) and with language contact or exposure were most frequently associated with foreign accentedness in previous studies so far (see Introduction). The Input indices were calculated on the basis of the answers to the questions about the frequency with which the child was addressed in a given language by a particular person or in a particular context. For each language, the child could receive a maximum of 40 points for the home input (a maximum of 8 points each for the mother, father and siblings addressing the child exclusively in the assessed language, a maximum of 4 points for the grandparents and the potential babysitter addressing the child exclusively in the respective language, and a maximum of 8 points for the parents addressing each other exclusively in the language in the presence of the child). Additionally, the children could receive the maximum of 51 points for the outside input, depending on how often they were addressed in the language in a range of situations that were enumerated in the questionnaire (for instance, at the nursery, in conversations with friends etc.). Overall, the maximum score for each Input index was 91. The mean score for the Input in Polish for the bilingual participants was 43.17 (SD = 14.05), while for the Input in English it was 34.28 (SD = 14.06). The mean AoE was 14.07 months (SD = 15.01).

Procedure

The rating procedure had the form of an online questionnaire. The Raters' task was to assess each sample on the degree of native accent, intelligibility and acceptability as well as the perceived age of the child. We randomly divided the speech samples into two sets A and B, each consisting of 21 samples (5 monolingual and 16 bilingual). The division into two sets was introduced to optimise the time of the accentedness rating task from an hour to about 30 minutes and to avoid Raters' fatigue effect. The Raters were randomly assigned to one of the sets (A or B). For each set, an identical online questionnaire was created, consisting of two tasks. In Task 1 the Raters had to holistically assess one SRep speech sample (3 sentences) per each of the 21 children. In Task 2, the Raters were to assess 20-seconds-long fragments of the re-narration (spontaneous speech) for the same children. The order of recordings was randomised and the Raters were blind to which samples came from bilingual and which from monolingual children. They also did not know that they rated the same children in both tasks. The order of Tasks 1 and 2 in the online questionnaire was counterbalanced.

Prior to the experiment, the Raters were informed about the general aims of the study and asked to use headphones during the rating procedure. They were instructed to listen to each recording and then assess the degree of native accent in the Polish speech on a 7-point Likert scale (Native Accent: 1 - very strong foreign accent, and 7 - lack of foreign accent/sounds like a native speaker of Polish). Further, the Raters were asked to assess the intelligibility and acceptability of each recording on a 7-point Likert scale (Intelligibility: 1 - speech completely unintelligible, 7 - speech completely intelligible, Acceptability: 1 - speech completely unacceptable, 7 - speech completely acceptable), a methodology commonly used in FARs (see Piske et al., Reference Piske, MacKay and Flege2001 for an overview). Additionally, they were to assess the age of the children in years and months based on their speech performance (Perceived Age). The age assessment was conducted to test whether bilingual children are perceived as younger (and implicitly speech-delayed) when compared to their monolingual peers on the basis of their speech.

Analysis A: Reliability statistics and initial analyses

Before answering the research questions, we conducted a series of reliability statistics to check the quality of our data. First, the inter-rater agreement for the four parameters (Native Accent, Acceptability, Intelligibility and Perceived Age) was established using Cronbach's alpha and Krippendorff's alpha. The former measure is traditionally applied in accentedness rating studies, whereas the latter appears to be more robust and suitable for measuring inter-rater agreement in studies with multiple raters (see Hayes & Krippendorf, Reference Hayes and Krippendorff2007). Both analyses were conducted separately on each set (A and B) and for each Task (Narration and SRep).

Results

According to Cronbach's alpha, the inter-rater agreement was very high for all the parameters (ranging from .91 to .98). However, as indicated by Krippendorff's alpha, the assessments of the Perceived Age (ranging from .12 to .21) were much less reliable than the assessments of the remaining three parameters (ranging from .32 to .60). This result is consistent with the comments made by the Raters, who found it generally hard to assess children's age. Since we asked a research question about the relationship of Perceived Age and other rating parameters, we included Perceived Age in Analysis B conducted to answer this question. However, it was not included in any further analyses.

Analysis B: Relationships between the rating parameters

The next step following the reliability statistics was to check how the four rating parameters (Native Accent, Acceptability, Intelligibility and Perceived Age) related to each other. This was done to answer the second research question posed in the Introduction and also to establish whether the rating parameters could be collapsed into a single measure for the future analyses. To achieve this, we conducted four types of statistical analyses. First, we calculated the descriptive statistics for each parameter for the bilingual and monolingual group. Second, we conducted a series of correlation analyses between each pair of the rating parameters, separately for each group of Talkers (the monolingual group and the bilingual group), to obtain a general picture of the possible interrelations between the factors. Both of these analyses were conducted on the mean Rater assessment for each Talker and on the means between the two tasks. Third, for the most strongly correlated parameters, we created the Tukey mean-difference plots, which show the extent to which two variables might be considered essentially the same factor. Fourth, after finding out that the most important variables were indeed related, we conducted an Exploratory Factor Analysis (EFA) with the default orthogonal rotation (varimax) on the three strongly correlated parameters to obtain a single factor that could be used as the output in all the future analyses. The EFA was conducted using the factanal function in the basic stats R package.

Results

Table 3 presents the mean scores for Native Accent, Acceptability, Intelligibility and Perceived Age for the monolingual and bilingual groups. As can be seen, the scores for the first three parameters are similar within groups, ranging from 4.33 to 4.63 in the bilingual group, and from 5.62 to 6.07 in the monolingual group.

Table 3. Accentedness ratings – descriptive statistics

Table 4 presents the correlations between the rating parameters for the two participant groups. In the bilingual group assessments, there were high correlations between Native Accent, Acceptability and Intelligibility. However, Perceived Age did not correlate significantly with any other parameter. Also, in the monolingual group assessments, there was a high correlation between Native Accent, Acceptability and Intelligibility; however, in this group, also Perceived Age correlated with the three basic parameters. In other words, the three basic rating parameters, Native Accent, Intelligibility and Acceptability were highly correlated for both the monolingual and bilingual group, but Perceived Age was related to the other three parameters only in the monolingual group.

Table 4. Accentedness ratings – correlations between rating parameters

* p < .05, ** p < .01, *** p < .001

This relationship between Native Accent, Intelligibility and Acceptability is also visible in the Tukey mean-difference plots provided in Figures 1 and 2. Figure 1 shows the plots for the bilingual group, while Figure 2 for the monolingual group. The Tukey mean-difference graphs plot the mean of each pair of measurements (x axis) against the differences between the measurement (y axis). The middle horizontal line on the plot represents the mean differences, while the upper and lower lines represent the cut-off point of 2SD below and above the mean. If two variables are related, the pairs should cluster around the mean and stay within the cut-off points. This is the case for all the presented plots, showing, once again that the three parameters are closely related.

Figure 1. Tukey mean-difference plots for the bilingual group.

Figure 2. Tukey mean-difference plots for the monolingual group.

An Exploratory Factor Analysis (EFA) with the orthogonal rotation (varimax) conducted on the three most important parameters (Native Accent, Intelligibility and Acceptability) confirmed that these measures were closely related. The EFA rendered only one factor, which was heavily loaded by all three parameters, and which explained 75.9% of the variation (see Table 5). The resulting factor was called Holistic Accent Assessment (HAA), and it was used in all the subsequent regression models as the outcome variable.

Table 5. Exploratory Factor Analysis on the four rating parameters

Analysis C: Bilingual vs. monolingual Talkers

The aim of this analysis was to compare the bilingual and monolingual groups on Holistic Accent Assessment, while controlling for such factors as Task and Rater. To this end we ran a linear mixed effects model, which allowed us to control the individual variance introduced by each Rater and Talker and thus to minimize the type I error. The analysis was conducted with the lmer function in lme4 package (Bates, Maechler, Bolker & Walker, Reference Bates, Maechler, Bolker and Walker2015) with Satterthwaite approximation for p values implemented in the lmerTest package (Kuznetsova, Brockhoff & Christensen, Reference Kuznetsova, Brockhoff and Christensen2015). The fixed effects entered into the model were: Talker group (bilingual vs. monolingual), Task (re-narration vs. SRep), Rater training (trained vs. untrained), an interaction of Talker group and Task, Talker group and Rater training, Rater training and Task, as well as the three-way interaction of Talker group, Rater training and Task. Rater group was not included in the analyses as it closely correlated with phonetic training. We also did not include Age in the model, as the bilingual and monolingual groups were matched for age. The random effect structure in the model was initially maximal (see Barr, Levy, Scheepers & Tily, Reference Barr, Levy, Scheepers and Tily2013 for the discussion of the advantages of maximal random effect structure in linear mixed effects models). However, in the next step of the analysis we removed the random slopes whose variance equalled zero, as recommended by Bates, Kliegl, Vasisth, and Baayen (Reference Bates, Maechler, Bolker and Walker2015). We compared the models without the zero slopes to the maximal models with the likelihood ratio tests to make sure that we had not discarded any meaningful variables. Apart from the elimination of the zero slopes in the random effects no model selection was conducted. The model met the assumptions of the linear regression such as homoscedasticity and normally distributed residuals and had low collinearity as indicated by the variance inflation factor (VIF), which had a value below 2.5. In this and all the subsequent mixed-effects analyses, all categorical predictors were sum coded, while the continuous predictors were centred on the mean values.

Results

Table 6 presents the fixed effects in the model, while Table 7 presents the random effects. The model shows that the best predictors of the Holistic Accent Assessment was Talker group and (marginally) Task. Children's assessments were higher in the monolingual group and on the SRep task. There were no consistent effects of Rater training and no significant interactions.

Table 6. The model predicting Holistic Accent Assessment with Talker group, Task, Rater training and the interactions: fixed effects.

. p < .1, *p < .05, ** p < .01, *** p < .001

Table 7. The model predicting Holistic Accent Assessment with Talker group, Task, Rater training and the interactions: random effects

Analysis D: Phonetic predictors of accentedness in bilingual children

In this analysis, we aimed to answer the third research question, i.e., to assess whether accentedness ratings relate to the detailed phonetic analysis of bilingual children's speech. More specifically, we asked whether the number of atypical speech patterns identified in the phonetic analysis in Study 1 predicted the Holistic Accent Assessment, and we tested which types of atypical speech patterns were the best predictors of this variable. To this end, we ran a linear mixed effect model with the Holistic Accent Assessment as the outcome variable and the categories of atypical speech patterns from Study 1 as predictors. Because entering all L2 categories into the model would likely result in overfitting, we collapsed them into three groups: 1) Vowels – the number of atypical speech patterns related to vowels, 2) Consonants – the number of atypical speech patterns related to consonants, and 3) Prosody – the number of atypical speech patterns related to prosody (see Table 1 for the exact categories within each group). These three groups were entered into the model as fixed effects. At the beginning, we created a model with three fixed effects (Prosody, Consonants and Vowels) and maximal random effect structure. The model showed a degree of collinearity with VIF greater than 2.5. We therefore ran a stepwise backwards regression, eliminating the weakest fixed effects one by one and comparing them by looking at their Akaike Information Criterion (AIC) and the likelihood ratio tests. The new model, which had the lowest AIC of all the ones tested, had only one fixed effect: Prosody. Obviously, it had no collinearity and met other assumptions of linear regression such as homoscedasticity and normally distributed residuals. Following the regression, we removed random slopes with variance that were close to zero, so the final model is an intercept-only model. The likelihood ratio test did not show significant differences between the model with and without the random slopes. Below we report only the final model.

Results

Table 8 presents the fixed effects of the final model we obtained, while Table 9 presents the random effects. As evident from Table 8, there was a relationship between the atypical speech patterns identified in Study 1 and the holistic assessment conducted in Study 2. Specifically, the patterns related to prosody significantly predicted the Holistic Accent Assessment, i.e., the fewer atypical speech patterns related to prosody, the higher was the Holistic Accent Assessment.

Table 8. The model predicting Holistic Accent Assessment with phonetic variables: fixed effects

p < .1, * p < .05, ** p < .01, *** p < .001

Table 9. The model predicting Holistic Accent Assessment with phonetic variables: random effects

Analysis E: Sociolinguistic and sociodemographic predictors of accentedness

In the final analysis, we aimed to answer the fourth research question concerning the background factors that contribute to the perceived foreign accent in the bilingual participants. Thus, we created a mixed linear regression model with Holistic Accent Assessment as the outcome variable and three measures taken from the background questionnaire, i.e., Input in Polish, Input in English and AoE as the fixed effects. The Age of the child was entered as an additional fixed effect, as this was the most probable confounding variable. The model did not show collinearity (VIF below 2.5) and met other assumptions of the regression. As previously, we removed the random slopes whose variance equalled zero from the model and compared the models with and without the zero slopes with likelihood ratio tests. There was difference between the models, so the model without the zero slopes was selected. As a result of removing the zero slopes, the model contains only random intercepts.

Results

The resulting model is presented in Tables 10 (fixed effects) and 11 (random effects). As can be seen from Table 10, Input in Polish was the strongest predictor of Holistic Accent Assessment. The more Input in Polish our participants received, the better their Holistic Accent Assessment was. There was also a marginal effect of Input in English. Children who received more of such input, performed worse in terms of Holistic Accent Assessment.

Table 10. The model predicting Holistic Accent Assessment with background (sociolinguistic) variables: random effects: fixed effects

p < .1, * p < .05, ** p < .01, *** p < .001

Table 11. The model predicting Holistic Accent Assessment with background (sociolinguistic) variables: random effects: random effects

Discussion

Studies of phonological profiles of bilingual children are scarce (see Genesee & Nicoladis, Reference Genesee, Nicoladis, Hoff and Shatz2009). To date, only two investigations involving Accent Ratings have been performed on children (cf. Asher & Garcia, Reference Asher and Garcia1969; Snow & Hoefnagel-Höhle, Reference Snow and Hoefnagel-Hohle1977): none of them, however, explored accentedness in bilingual children's home language. This study on the Polish–English bilingual children aimed to fill the gap. First, we aimed to explore whether bilingual children's oral production is perceived as different from that of their monolingual peers. Second, we intended to investigate the phonological and sociolinguistic predictors of the perceived foreign accent.

Great care was taken to overcome methodological issues that affect the validity of Accent Ratings (see Piske et al., Reference Piske, MacKay and Flege2001; Schmid & Hopp, Reference Schmid and Hopp2014). First, possible rater effects were minimized, by recruiting a large number of raters who were proficient in English and were familiar with English-accented speech. We also controlled for the effect of raters in the mixed effect models we created. Second, while previous Accent Ratings studies varied considerably with respect to speech sample elicitation (e.g., Bongaerts, van Summeren, Planken & Schils, Reference Bongaerts, van Summeren, Planken and Schils1997; Elliott, Reference Elliott1995; Flege et al., Reference Flege, Munro and MacKay1995), we employed two elicitation techniques, namely sentence repetition (a more controlled and restricted task) and re-narration (a guided elicitation task), to offer a more reliable measure of the speakers’ phonetic performance. Third, we juxtaposed Accent Ratings with a detailed acoustic analysis performed by phonetically-trained raters. In this way, our procedures provided both detailed and holistic assessments of children's pronunciation in all aspects of speech, i.e., segmental, suprasegmental and extralinguistic, which added to the robustness of the results.

The first question we asked concerned the differences in the pronunciation of L1 Polish between the bilingual and the monolingual children. The results obtained in both Study 1 and Study 2 point to such differences. Study 1, where precise auditory analysis was performed, demonstrated that bilingual productions in Polish were characterised by a higher number of speech alterations, driven by CLI from L2 English (community language) to L1 Polish (home language). The alternations included atypical productions of Polish consonants, the reductions of consonant clusters that do not comply with English phonotactics, the productions of VOT values typical for English rather than Polish, and the application of vowel reduction (which is typical for English, but not for Polish speech). Similarly, in Study 2 the holistic assessment ratings performed were still significantly higher, i.e., better for the monolingual control group. This effect was equally strong for both tasks used (SRep and Re-narration), even though Re-narration was generally assessed more severely by the raters. This indicates that even if there were any lexical, syntactic and morphological patterns that differentiated monolingual and bilingual speech, they had no bearing on the assessment of accent. This finding suggests that the bilingual children were perceived by the Polish native raters as less native-like, less intelligible and less acceptable in their L1 than their Polish monolingual peers, despite having acquired Polish from birth. These results run counter to a popular belief that early bilinguals can speak both languages without a foreign accent because they keep the two phonological systems apart. Instead they support recent findings (e.g., En et al., Reference En, Brebner and McCormack2014; Mayr et al., Reference Mayr, Howells and Lewis2014; Marecka et al., Reference Marecka, Wrembel, Zembrzuski and Otwinowska-Kasztelanic2015; Wrembel, Reference Wrembel2015), suggesting that the two languages interact in the children's phonological system. This interaction can be explained by Flege's (Reference Flege and Strange1995, Reference Flege, Burmeister, Piske and Rhode2002) Speech Learning Model, which assumes that the phonetic categories from both languages in the bilingual mind occupy the same phonological space. It also conforms to the Dynamic Systems Theory of language acquisition stating that all the languages in the speaker's mind constantly influence each other (e.g., de Bot et al., Reference de Bot, Lowie and Verspoor2007).

Further, the results demonstrate that early bilingualism and extensive exposure to two languages do not guarantee a fully native-like performance in both language systems in the sphere of phonetics and phonology. Although Polish was chronologically the first language acquired by all the bilingual children in the study, in the migrant context it was influenced by English, the community language, to such an extent that the L1 was perceived as foreign-accented. This calls into question the often quoted ‘the earlier, the better’ assumption (e.g., Asher & Garcia, Reference Asher and Garcia1969; Suter, Reference Suter1976; Oyama, Reference Oyama1976; Flege & Fletcher, Reference Flege and Fletcher1992, Flege et al., Reference Flege, Munro and MacKay1995; Moyer, Reference Moyer1999), and indicates that the age factor is probably not the sole determiner of nativeness in the oral performance.

To answer our second research question, we investigated whether and how the parameters of Native Accent, Intelligibility, Acceptability and Perceived Age were related in the performed ratings. Although the Perceived Age measure had not been used in previous Accent Ratings studies, we conceptualized the assessment of age as an indicator of potential perceived delay in speech development. We hypothesized that bilingual children might be assessed as younger than monolinguals, due to the atypical speech patterns they exhibit. Thus, our raters were to identify the participants’ age based on their speech performance. However, it appears that raters had difficulties in assessing the talkers’ age, as evidenced in the very low interrater agreement with respect to this parameter (cf. Krippendorff's alpha scores). Moreover, Perceived Age was not related to the other three parameters in the bilingual group, which indicates that it was not influenced by the degree of foreign accent or speech intelligibility. Interestingly, the perceived age correlated with the Native Accent, Intelligibility and Acceptability in the case of the monolinguals, which suggests that FARs in this group were more related to the developmental patterns in the children's speech. When it comes to Native Accent, Intelligibility and Acceptability, we have found that these three parameters were tightly intertwined in both the bilingual and monolingual groups of participants. Since it was difficult to tease them apart, we concluded that they reflect the same concept. Thus, we obtained one global category of accentedness, the so-called Holistic Accent Assessment, as a consistent measure of perceived pronunciation performance.

Our third research question enquired how accentedness measures relate to the phonetic analysis of bilingual children's speech. To this end, we tested which groups of phonetic features (i.e., atypical speech patterns connected with the production of vowels, consonants, and prosody) had the greatest influence on the accentedness scores. The atypical prosody was the best predictor of Holistic Accent Assessment. In other words, children who had problems with retaining the syllabic structure of the words or applying accurate stress patterns were more likely to be perceived as non-native, unintelligible and unacceptable. This result is consistent with previous studies, which found that inserting non-native prosodic features into speech samples influences FARs conducted on these samples (de Mareüil & Vieru-Dimulescu, Reference de Mareüil and Vieru-Dimulescu2006; Liu & Lee, Reference Liu and Lee2012). At the same time, previous research indicates that the presence of non-native segmental features in speech samples is a stronger predictor of foreign accentedness than the presence of non-native suprasegmental features (Liu & Lee, Reference Liu and Lee2012; Lee, Reference Lee2014), while in our study this was not the case. This discrepancy might stem from the fact that our study was conducted on naturalistic data, while the studies by Lee and Liu utilised synthesised speech samples, which could have exaggerated the foreignness of segmental features.

Finally, we enquired what sociolinguistic factors contribute to the perceived foreign accent in bilingual children's speech. The results demonstrate that the quality and quantity of input in Polish was the strongest predictor of the Holistic Accent Assessment. In other words, the more Polish input children had received, the better their FARs were. There was also a marginally significant negative effect of input in English on the Holistic Accent Assessment. This means that the more input the bilingual children received in the community language, the lower was their assessment in the home language. At the same time, neither age nor AoE was related to children's FARs. The predictors of FARs in our study correspond to those evidenced by Flege et al. (Reference Flege, Munro and MacKay1995) and Piske et al. (Reference Piske, MacKay and Flege2001), who focused on the impact of the L1/L2 input on the nativeness of L2 accent.

Summing up, in contrast to previous studies, which accounted for the influence of L1 on the performance in L2, we investigated the impact of L1 and L2 exposure on the accent in L1. We have demonstrated that, under specific circumstances of bilingual acquisition, children's L1 (home language) can be phonetically influenced by their L2 (prestigious community language). Thus, our results evidence a different direction of phonetic cross-linguistic influence from that usually presented in SLA studies. Obviously, the impact of language use and status on bilingual children's speech production still requires further investigation. It is necessary to determine more precisely which conditions result in a phonetic drift towards the ambient language norms in bilinguals' speech, because, as demonstrated by Sancier and Fowler (1997), even a temporary change in language use patterns and input conditions may significantly influence bilinguals’ sound production in their languages. Still, we believe that the procedures applied in our study, which provided both detailed and holistic assessment of children's pronunciation skills, allowed us to accurately examine the L1 speech of early bilingual children. Our future research will involve a parallel investigation into the English language of the Polish–English bilingual children to establish the differences between the speech patterns observed in L1 and L2 in this population.

Conclusions

The major objective of the present study was to investigate the phonetic performance in the L1 speech of Polish–English bilingual children living in Great Britain. Crucially, we have demonstrated that the L2 of early bilinguals exerts cross-linguistic influence on the L1 in the area of phonology. Thus, Polish–English bilingual children are perceived as foreign-accented in their L1 Polish and as less intelligible and less acceptable in their L1 than their Polish monolingual peers, despite having acquired Polish from birth. Further, those children who received the smallest amount of Polish input were most severely assessed. This points to the importance of substantial amount and quality of input in the home language if the bilingual child is to develop native-like phonology in the L1.

The novelty of our contribution is threefold. Firstly, to the best of our knowledge, no previous research examined the global perceived accent in the L1 of child bilinguals. Secondly, our study focused on Polish, a language rarely investigated in SLA studies. Thirdly, our design combined accentedness ratings with a detailed phonetic analysis and an examination of the participant background factors and rater effects. Thus, this study bridges the existing gap in the literature and lays foundations for further investigations of accent in young bilinguals.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/S1366728918000044.

Footnotes

*The data for this paper come from the Bi-SLI-Poland project Cognitive and language development of Polish bilingual children at the school entrance age – risks and opportunities conducted within the European COST Action IS0804 and carried out at the Faculty of Psychology, University of Warsaw, Poland in collaboration with Institute of Psychology, Jagiellonian University, Poland. The project was supported by the Polish Ministry of Science and Higher Education /National Science Centre (Decision 809/N-COST/2010/0). Data collection and coding were also partly supported by Foundation for Polish Science subsidy to Zofia Wodniecka.

The current project was supported by the Polish Ministry of Science and Higher Education grant (Decision 0094/NPRH3/H12/82/2014) Phonological and Morpho-syntactic Features of Language and Discourse of Polish Children Raised Bilingually in Migrant Communities in Great Britain carried out at the Faculty of Modern Languages, University of Warsaw, Poland. The project was related to the European COST Action IS1306.

Supplementary material can be found online at https://doi.org/10.1017/S1366728918000044

¹ Polish is characterised by a relatively large system of consonants, with a series of sibilants including post-dental /t͡s/ /d͡z/, alveolar-palatal /ɕ/, /ʑ/, /t͡ɕ/, /d͡ʑ/, and post-alveolar retroflex /ʂ/, /ʐ/, /t͡ʂ/ or /d͡ʐ/. The respective consonantal repertoire in English is more limited in this respect, yet on the other hand, it features interdental fricatives that Polish lacks. Further, stressed onset plosives are realised in English with long-lag voice onset time (English being an aspirating language), whereas Polish (the so-called voicing language) has shorter VOT values for voiceless plosives and pre-voicing in the voiced ones. The voiced vs. voiceless opposition in Polish obstruents is neutralised in word-final positions, unlike in English. Polish has a very rich phonotactics, as complex clusters of consonants are allowed in all word positions, while the English phonotactics is more restricted. In turn, Polish has a more limited inventory of vowels (a prototypical system of 6 oral vowels plus 2 nasal vowels), as compared to a more complex English vowel system (with 12 oral vowels and 8 diphthongs) and a phonemic vowel length distinction, which is absent in Polish. In English, there is a reduction to schwa in unstressed vowels, whereas Polish has only vowel elision in casual speech. In terms of the rhythmical structure, English is classified as a typical stress-timed language while Polish escapes the dichotomous categorization and exhibits properties of both syllable- and stress-timing. The predominant stress pattern in Polish is fixed and falls on the penultimate syllable, whereas in English the lexical stress is free. For a more detailed discussion of the respective phonological systems of Polish and English see e.g., Jassem (Reference Jassem2003), Dziubalska-Kołaczyk and Walczak (Reference Dziubalska-Kołaczyk, Walczak, Delcourt and van Sterkenburg2011), Roach (Reference Roach2009), or Cruttenden (Reference Cruttenden2014).

² We can confidently state that the monolingual Polish children from the control group, who came from two major cities: Warsaw (the centre of Poland) and Kraków (the south of Poland), speak one standard Polish variety, rather than two phonologically and grammatically different dialects. The same can be stated about the raters from Warsaw and Poznań, where the differences barely exist among the educated population of Polish speakers. Although distinct dialectal features can be found in eastern borderland, upper Silesia, the Masurian region, and in the Podhale, south of Kraków, there is no significant variation in the monolingual Polish regiolects spoken in the major Polish cities. This is due to the convergence of dialects and regional varieties in Polish urban areas (see Kurkowska, Reference Kurkowska and Kurkowska1981; Wilkoń, Reference Wilkoń2000; Dubisz, Reference Dubisz2013), where the dialectal features are no longer used (Karpowicz, Reference Karpowicz2009). Nowadays, the urban varieties of Polish lack regular phonological oppositions and only exhibit slight lexicalised phonetic features and some potential differences in the intonation patterns (Karaś, Reference Karaś2008; Wilkoń, Reference Wilkoń2000). There are two reasons for the present situation. First, throughout the history, the Polish-speaking area has been much more homogenous and uniform than other European countries such as Germany, France or Great Britain (see e.g., Jahr & Janicki, Reference Jahr and Janicki2009, Dubisz, Reference Dubisz2013). Second, in the post-war history, migration and urbanization motivated the linguistic integration of Polish, particularly visible in large cities such as Warsaw, Poznań, Kraków. This led to the disappearance of earlier regional dialects (mazowiecki - Masovian, małopolski – Smaller Polish, wielkopolski - Greater Polish) and a convergence towards the standard Polish variety. The completion of the process and the widespread use of standard Polish took place in the 1970s and 1980s (Dubisz, Reference Dubisz2013). The convergence of dialects further is motivated by the fact that standard Polish, spoken by the society at large, taught at schools and present in the media, enjoys an extremely high social status. Polish is thus a prototypically focused language, as opposed to some Germanic languages, where dialects enjoy an equally high status (Jahr & Janicki, Reference Jahr and Janicki2009).

References

Audacity Team (2016) Audacity®. Version 2.1.2. Available from: http://audacityteam.org/Google Scholar

Anderson-Hsieh, J., & Koehler, K. (1988). The effect of foreign accent and speaking rate on native speaker comprehension. Language Learning, 38, 561–613.Google Scholar

Asher, J. J., & Garcia, R. (1969). The optimal age to learn a foreign language. The Modern Language Journal, 53, 334–341.Google Scholar

Banasik, N., Haman, E., & Smoczyńska, M. (2012). Sentence Repetition Task. Unpublished material, University of Warsaw.Google Scholar

Barlow, J. A. (2014). Age of acquisition and allophony in Spanish–English bilinguals. Frontiers in Psychology, 5, 288.Google Scholar

Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J. (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68 (3), 255–278.Google Scholar

Bates, D., Maechler, M., Bolker, B., & Walker, S. (2015). Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software, 67 (1), 1–48.Google Scholar

Bates, D., Kliegl, R., Vasishth, S., & Baayen, H. (2015). Parsimonious Mixed Models. ArXiv:1506.04967 [Stat]. Retrieved from http://arxiv.org/abs/1506.04967 Google Scholar

Bongaerts, T., van Summeren, C., Planken, B., & Schils, E. (1997). Age and ultimate attainment in the pronunciation of a foreign language. Studies in Second Language Acquisition, 19, 447–465.Google Scholar

Council of Europe. (2001). Common European Framework of Reference for Languages: Learning, teaching, assessment. Cambridge: Cambridge University Press.Google Scholar

Cruttenden, A. (2014). Gimson's Pronunciation of English (8^th edition). New York: Routledge.Google Scholar

de Bot, K., Lowie, W., & Verspoor, M. (2007). A Dynamic Systems Theory approach to second language acquisition. Bilingualism: Language and Cognition, 10, 7–21.Google Scholar

de Leeuw, E. (2009). When your native language sounds foreign: A phonetic investigation into first language attrition. Unpublished Ph.D. thesis. Queen Margaret University, Edinburgh.Google Scholar

de Leeuw, E., Schmid, M., & Mennen, I. (2010). The effects of contact on native language pronunciation in an L2 migrant setting. Bilingualism: Language and Cognition, 13, 33–40.Google Scholar

de Mareüil, P. B., & Vieru-Dimulescu, B. (2006). The Contribution of Prosody to the Perception of Foreign Accent. Phonetica, 63 (4), 247–267.Google Scholar

Dubisz, S. (2013). Integracja językowa w dziejach polszczyzny. Poznańskie Studia Polonistyczne. Seria Językoznawcza, 20, 85–97.Google Scholar

Dziubalska-Kołaczyk, K., & Walczak, B. (2011). Polish. In: Delcourt, C.; van Sterkenburg, P. (eds.) The languages of the 27. Bruxelles: Fondation universitaire de Belgique, 817–840.Google Scholar

Elliott, A. R. (1995). Field independence/dependence, hemispheric specialization, and attitude in relation to pronunciation accuracy in Spanish as a foreign language. The Modern Language Journal, 79, 351–371.Google Scholar

En, L. G. W., Brebner, C., & McCormack, P. (2014). A preliminary report on the English phonology of typically developing English-Mandarin bilingual preschool Singaporean children. International Journal of Language & Communication Disorders, 49, 317–332.Google Scholar

Flege, J. E. (1988). Factors affecting degree of perceived foreign accent in English sentences. Journal of the Acoustical Society of America, 84, 70–79.Google Scholar

Flege, J. E. (1995). Second-language Speech Learning: Theory, Findings, and Problems. In Strange, W. (ed.) Speech Perception and Linguistic Experience: Issues in Cross-language research, pp. 229–273. Timonium, MD: York Press.Google Scholar

Flege, J. E. (2002). Interactions between the native and second-language phonetic systems. In Burmeister, P., Piske, T., & Rhode, A. (eds.), An Integrated View of Language Development Papers in Honor of Henning Wode, pp. 217–244. Trier: Wissenshaftlicher Verlag.Google Scholar

Flege, J.E., & Fletcher, K. L. (1992). Talker and listener effects on degree of perceived foreign accent. Journal of the Acoustical Society of America, 91, 370–389.Google Scholar

Flege, J. E., & MacKay, I. R. A. (2011). What accounts for “age” effects on overall degree of foreign accent? In Wrembel, M., Kul, M. and Dziubalska-Kołaczyk, K., (eds.) Achievements and perspectives in the acquisition of second language speech: New Sounds 2010, pp. 65–82, Vol. 2. Bern, Switzerland: Peter Lang.Google Scholar

Flege, J. E., Munro, M. J., & MacKay, I. R. A. (1995). Factors affecting strength of perceived foreign accent in a second language. Journal of the Acoustical Society of America, 97, 3125–3134.Google Scholar

Gallardo del Puerto, F., Gómez Lacabex, E., & García Lecumberri, M. L. (2007). The assessment of foreign accent by native and non-native judges. PTLC Proceedings, London, CD-ROM.Google Scholar

Genesee, F., & Nicoladis, E. (2009). Bilingual first language acquisition. In: Hoff, E., Shatz, M. (eds) Blackwell Handbook of Language Development, Oxford: Blackwell Publishing, 324–342.Google Scholar

Grzymała-Moszczyńska, H., Grzymała-Moszczyńska, J., Durlik, J., & Szydłowska, P. (2015). (Nie)łatwe powroty do domu? Funkcjonowanie dzieci i młodzieży powracających z emigracji. Warsaw, Poland: Fundacja Centrum im. prof. Bronisława Geremka.Google Scholar

Guion, S. G., Flege, J. E., & Loftin, J. D. (2000). The effect of L1 use on pronunciation in Quichua-Spanish bilinguals. Journal of Phonetics, 28, 27–42.Google Scholar

Haman, E., Wodniecka, Z., Marecka, M., Szewczyk, J., Białecka-Pikul, M., Otwinowska, A., Mieszkowska, K., Łuniewska, M., Kołak, J., Miękisz, A., Kacprzak, A., Banasik, N., & Foryś-Nogala, M. (under review). How does L1 and L2 exposure impact L1 performance in bilingual children? Evidence from Polish–English migrants to the UK. Frontiers in Psychology.Google Scholar

Hayes, A. F., & Krippendorff, K. (2007). Answering the call for a standard reliability measure for coding data. Communication Methods and Measures, 1, 77–89.Google Scholar

Herdina, P., & Jessner, U. (2002). A Dynamic Model of Multilingualism: Perspectives of Change in Psycholinguistics. Clevedon: Multilingual Matters Ltd.Google Scholar

Holm, A., & Dodd, B. (1999). A longitudinal study of the phonological development of two Cantonese–English bilingual children. Applied Psycholinguistics, 20, 349–376.Google Scholar

Hopp, H., & Schmid, M.S. (2013). Perceived foreign accent in first language attrition and second language acquisition: The impact of age of acquisition and bilingualism. Applied Psycholinguistics, 34, 361–394.Google Scholar

Jahr, E. H., & Janicki, K. (2009). The function of the Standard variety: a contrastive study of Norwegian and Polish. International Journal of the Sociology of Language, 115, 25–45.Google Scholar

Jassem, W. (2003). Polish. Journal of the International Phonetic Association, 33, 103–108.Google Scholar

Jesney, K. (2004). The use of global foreign accent rating in studies of L2 acquisition. Calgary, AB: Language Research Centre, University of Calgary.Google Scholar

Johnson, C. E., & Wilson, I. L. (2002). Phonetic evidence for early language differentiation: Research issues and some preliminary data. International Journal of Bilingualism, 6, 271–289.Google Scholar

Karaś, H. (ed.) (2008). Gwary polskie. Przewodnik multimedialny. (2008). http://www.gwarypolskie.uw.edu.pl Google Scholar

Karpowicz, T. (2009). Kultura Języka Polskiego: wymowa, ortografia, interpunkcja. Warszawa: Wydawnictwo Naukowe PWN.Google Scholar

Kiebzak-Mandera, D., Otwinowska, A., & Białecka-Pikul, M. (2012). MAIN Multilingual Assessment Instrument for Narratives: Polish Version. In: Gagarina, N., Klop, D., Kunnari, S., Tantele, K., Välimaa, T., Balciuniene, I., Bohnacker, U., & Walters, J.. (eds.), ZAS Papers in Linguistics 56.Google Scholar

Kurkowska, H. (1981). Próba charakterystyki socjolingwistycznej współczesnego języka polskiego. In Kurkowska, H. (Ed.): Współczesna polszczyzna. Wybór zagadnień;. Warszawa, 30–31.Google Scholar

Kuś, K., Otwinowska, A. Banasik, N., & Kiebzak-Mandera, D. (2012). Kwestionariusz Rozwoju Językowego (Language Development Questionnaire). University of Warsaw, unpublished material.Google Scholar

Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2015). lmerTest. R package version 2.0.Google Scholar

Lee, J. K. (2014). The role of prosody in the perception of foreign accent and comprehensibility: prosody-corrected-L2 speech vs. prosody-distorted-L1 speech. Korean Journal of Linguistics.Google Scholar

Liu, X., & Lee, J.-K. (2012). The Contribution of Prosody to the Foreign Accent of Chinese Talkers‘ English Speech. Phonetics and Speech Sciences, 4 (3), 59–73.Google Scholar

Marecka, M., Wrembel, M., Zembrzuski, D., & Otwinowska-Kasztelanic, A. (2015). Do early bilinguals speak differently than their monolingual peers? Predictors of phonological performance of Polish–English bilingual children. In E. Babatsouli & D. Ingram (eds.), Proceedings of the International Symposium on Monolingual and Bilingual Speech 2015.Google Scholar

Mayr, R., Howells, G., & Lewis, R. (2014). Asymmetries in phonological development: the case of word-final cluster acquisition in Welsh-English bilingual children. Journal of Child Language, 42, 146–179.Google Scholar

Meador, D., Flege, J., & MacKay, I. (2000). Factors affecting the recognition of words in a second language. Bilingualism: Language and Cognition, 3, 55–67.Google Scholar

Moyer, A. (1999). Ultimate attainment in L2 phonology: The critical factors of age, motivation, and instruction. Studies in Second Language Acquisition, 21, 81–108.Google Scholar

Oyama, S. (1976). A sensitive period for the acquisition of a non-native phonological system. Journal of Psycholinguistic Research, 5, 261–285.Google Scholar

Piske, T., MacKay, I.R.A., & Flege, J. E. (2001). Factors affecting degree of foreign accent in an L2. A review. Journal of Phonetics 29, 191–215.Google Scholar

Purcell, E., & Suter, R. (1980). Predictors of pronunciation accuracy: A reexamination. Language Learning, 30, 271–287.Google Scholar

Roach, P. (2009). English Phonetics and Phonology (4^th edition). Cambridge: Cambridge University Press.Google Scholar

Schmid, M. S. (2013). First language attrition. Wiley Interdisciplinary Reviews: Cognitive Science, 4, 117–123.Google Scholar

Schmid, M. S., & Dusseldorp, E. (2010). Quantitative analyses in a multivariate study of language attrition: The impact of extralinguistic factors. Second Language Research, 26, 125–160.Google Scholar

Schmid, M. S., & Hopp, H. (2014). Comparing foreign accent in L1 attrition and L2 acquisition: Range and rater effects. Language Testing, 31, 367–388.Google Scholar

Snow, C. E., & Hoefnagel-Hohle, M. (1977). Age differences in the pronunciation of foreign sounds. Language and Speech, 20, 357–365.Google Scholar

Southwood, M. H., & Flege, J. E. (1999). Scaling foreign accent: direct magnitude estimation versus interval scaling. Clinical Linguistics & Phonetics, 13, 335–349.Google Scholar

Suter, R. (1976); Predictors of pronunciation accuracy in second language learning, Language Learning, 26, 233–253.Google Scholar

Thompson, I. (1991). Foreign accents revisited: The English pronunciation of Russian immigrants. Language Learning, 41, 177–204.Google Scholar

Tuller, L. (2015). Clinical Use of Parental Questionnaires in Multilingual Contexts. In: Armon-Lotem, S., De Jong, J. and Meir, N. (eds.), Assessing multilingual children: Disentangling bilingualism from language impairment pp. 301–330. Bristol: Multilingual Matters.Google Scholar

Wilkoń, A. (2000). Typologia odmian językowych współczesnej polszczyzny. Katowice: Wydawnictwo Uniwersytetu Śląskiego.Google Scholar

Wrembel, M. (2015). In search of a new perspective: Cross-linguistic influence in the acquisition of third language phonology. Poznań: Wydawnictwo Naukowe UAM.Google Scholar

Yeni-Komshian, G. H., Flege, J. E., & Liu, S. (2000). Pronunciation proficiency in the first and second languages of Korean-English bilinguals. Bilingualism: Language and Cognition, 3, 131–149.Google Scholar