1. Introduction
Heritage language (HL) has only recently become a valued source of data for research in theoretical linguistics due to its unique properties. Heritage speakers (HSs) are early bilinguals who were raised speaking the minority language but switched to the majority language in later childhood (Valdés, Reference Valdés, Kagan and Rifkin2000). In many aspects of language production and comprehension, they differ from other bilinguals such as second language learners. Despite the early exposure to the HL, most HSs have very limited literacy skills, especially when the HL orthography is different from the majority language, as in the case of Arabic, Chinese, Korean, and Russian, with English being a majority language in the USA (Benmamoun, Montrul & Polinsky, Reference Benmamoun, Montrul and Polinsky2013; Carreira & Kagan, Reference Carreira and Kagan2011; Koda, Zhang & Yang, Reference Koda, Zhang, Yang, He and Xiao2008; Polinsky, Reference Polinsky2018; Polinsky & Kagan, Reference Polinsky and Kagan2007; Xiao, Reference Xiao2006). In this eye-tracking study, we investigate reading skills of bilingual adult and adolescent speakers of Russian, non-Roman-based HL, and explore a connection between reading and the proficiency level in HL.
We start with a brief description of universal eye-movement measures in reading and how they are affected by lexical properties of the words. Then we present the novel Bilingual Russian Sentence Corpus (BiRSC) that focuses on reading skills in two groups of bilingual heritage Russian-English HSs and compare them to the previously studied two comparison groups: namely, Russian-speaking monolingual skilled readers (RSC; Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan & Kliegl, Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019) and Russian-speaking monolingual children learning to read (Korneev, Akhutina & Matveeva, Reference Korneev, Akhutina and Matveeva2017). We add another comparison group, L2 learners of Russian, and demonstrate that the proficiency in HL only weakly influences eye-movement characteristics in reading as HL reading patterns regardless of their proficiency, were more similar to L2 learners and children than monolingual adults, a finding consistent with current bilingual language processing models and theories of HL acquisition.
1.1. Eye-movement benchmarks in monolingual reading
The start of the basic eye-movement research in reading goes back to the early 20th century (Huey, Reference Huey1908) and since then language researchers have been using eye-movement measures to test various psycholinguistic theories (Radach & Kennedy, Reference Radach and Kennedy2004; Rayner, Reference Rayner2009) as well as the models of eye-movement control (Reichle, Pollatsek, Fisher & Rayner, Reference Reichle, Pollatsek, Fisher and Rayner1998; Engbert, Nuthmann, Richter & Kliegl, Reference Engbert, Nuthmann, Richter and Kliegl2005). These measures are traditionally classified either as early or late (see Table 1 for the list of measures and their description), depending on what stage of language processing they reflect (Clifton, Staub & Rayner, Reference Clifton, Staub, Rayner, van Gompel, Fischer, Murray and Hill2007; Roberts & Siyanova-Chanturia, Reference Roberts and Siyanova–Chanturia2013). Early measures are generally sensitive to lexical access, early information integration, and early morphological decomposition (but see Vasishth, von der Malsburg & Engelmann, Reference Vasishth, von der Malsburg and Engelmann2013, for discussion). Late measures are indicative of post-lexical processing wherein they reflect reanalysis and recovery from difficulties in morphosyntactic processing as well as semantic integration (Boston, Hale, Kliegl, Patil & Vasishth, Reference Boston, Hale, Kliegl, Patil and Vasishth2008).
Table 1. Early and late eye-movement measures in reading.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tab1.png?pub-status=live)
Lexical properties of the word that influence eye movements, i.e., frequency, predictability, and length, are also well-established (for review, see Staub & Rayner, Reference Staub, Rayner and Gaskell2007). Typically, high-frequency words are recognized faster compared to the low-frequency words, the phenomenon known as the word frequency effect (FE, e.g., Inhoff & Rayner, Reference Inhoff and Rayner1986). In terms of eye-movement measures, the effect leads to shorter mean fixation durations and higher skipping probabilities on the high-frequency words. The same findings are observed for words with high predictability. Conversely, reading times increase as the length of the word increases whereas the skipping likelihood decreases (Rayner, Reference Rayner1998).
The effects of lexical properties on eye movements are universal across languages despite differences in grammar and script, supporting the theory of the “universal science of reading” (Share, Reference Share2008). The basic eye-movement characteristics, on the other hand, vary dependent on the writing script (e.g., alphabetic vs. logographic), visual and informational density (Liversedge, Drieghe, Li, Yan, Bai & Hyönä, Reference Liversedge, Drieghe, Li, Yan, Bai and Hyönä2016). In English, the mean fixation durations in monolingual adults typically range between 220–250 ms as a function of the word's length, frequency and predictability as well as its fit in semantic and syntactic context of the sentence. Approximately one-third of all words are skipped, with skipping probabilities contingent on the word length (i.e., 3-letter words are skipped 70% of the time, 7-letter ones only 20%; function words are skipped more often than content words). The probability of regressions is about 20%, but it varies depending on the syntactic and semantic complexity of the text.
The saccade landing position is also dependent on the writing system. This measure provides information about the preferred viewing position within the word and serves as the basis of comparison with the optimal viewing position (OVP). O'Regan and Jacobs (Reference O'Regan and Jacobs1992) demonstrated that in cases where the first eye fixation is in OVP, the word is recognized faster and requires fewer re-fixations. In alphabetic languages with Latin script, such as English, German or French, OVP is located closer to the center of the word (Hyönä & Bertram, Reference Hyönä and Bertram2011; Nuthmann, Engbert & Kliegl, Reference Nuthmann, Engbert and Kliegl2005; Vitu, McConkie, Kerr & O'Regan, Reference Vitu, McConkie, Kerr and O'Regan2001). In logographic non-Roman-based languages, such as Chinese, it varies from the center of the word that was fixated only once to the beginning of the word that has multiple fixations (Li, Liu & Rayner, Reference Li, Liu and Rayner2011). In Arabic, the OVP depends on the word length, wherein short words attract center-based saccade landing and long words receive the first saccade at the beginning of the word (Paterson, Almabruk, McGowan, White & Jordan, Reference Paterson, Almabruk, McGowan, White and Jordan2015).
In monolingual children (findings from English and Finnish), eye-movement benchmarks reflect their developmental nature, with longer mean fixation durations (280–300 ms), lower skipping rate (ranging from 9% to 39% of all words), and more regressions (30%). In comparison to adults, children are slower with grapheme-to-phoneme decoding process compared to adults due to less experience with written materials and smaller visual perceptual span. Similar to adults, children as young as 7 years old tend to land the first saccade closer to the word center (Barnes & Kim, Reference Barnes and Kim2016; Blythe & Joseph, Reference Blythe, Joseph, Liversedge, Gilchrist and Everling2011; Joseph, Liversedge, Blythe, White & Rayner, Reference Joseph, Liversedge, Blythe, White and Rayner2009; Vitu et al., Reference Vitu, McConkie, Kerr and O'Regan2001). The differences in reading skills gradually disappear with age (Blythe, Häikiö, Bertam, Liversedge & Hyönä, Reference Blythe, Häikiö, Bertam, Liversedge and Hyönä2011; Häikiö, Bertram, Hyönä & Niemi, Reference Häikiö, Bertram, Hyönä and Niemi2009; Mancheva, Reichle, Lemaire, Valdois, Ecalle & Guérin-Dugué, Reference Mancheva, Reichle, Lemaire, Valdois, Ecalle and Guérin-Dugué2015).
1.2. Eye-movement benchmarks in bilingual L2 reading
Only recently have eye movements in reading become a more common object in bilingualism research. The primary goal of these studies is to use the eye-movement characteristics in L2 reading to tap into key debates of bilingual language processing, such as lexical access, organization of bilingual mental lexicon, and grammatical and discourse processing (for review, see Roberts & Siyanova-Chanturia, Reference Roberts and Siyanova–Chanturia2013).
Early eye-movement measures (see Table 1) reflect lexical access – therefore, they could be used to empirically test theories of word representations in the bilingual lexicon, such as the weaker links account (Gollan, Montoya, Cera & Sandoval, Reference Gollan, Montoya, Cera and Sandoval2008), according to which the reduced exposure and less accumulated practice of both of the bilingual's languages lead to the weakened links between word forms and their mental representations. As a result, lexical access in non-dominant language is slowed down as reflected in larger frequency effects and longer reading times. Proficiency, however, should modulate these effects because high-proficient L2 bilinguals have higher amounts of non-dominant language exposure.
Indeed, in recent L2 corpus eye-movement studies with unbalanced L1 Dutch-L2 English readers, Cop and colleagues (Cop, Drieghe & Duyck; Reference Cop, Drieghe and Duyck2015; Cop, Dirix, Drieghe & Duyck, Reference Cop, Dirix, Drieghe and Duyck2017) found longer total reading times (1523 ms), longer average fixation durations (239 ms), more fixation counts (8.3 per sentence), and decreased skipping probability (48% vs. 52% in L1) in reading in a non-dominant language. In that sense, L2 readers resemble young monolingual children who just started to learn to read (Blythe & Joseph, Reference Blythe, Joseph, Liversedge, Gilchrist and Everling2011; Blythe et al., Reference Blythe, Häikiö, Bertam, Liversedge and Hyönä2011) or low-literate monolingual adults (Barnes & Kim, Reference Barnes and Kim2016; Kuperman & Van Dyke, Reference Kuperman and Van Dyke2011). However, the influence of L2 proficiency on sentence level reading parameters was small (only the fixation count was affected) suggesting that L2 reading can be close to L1 but quite variable. There was no difference in bilinguals’ L1 and monolingual reading which implies that, for late bilinguals, the strength of links between lexical representations and word forms for L1 words is comparable to that of monolingual readers.
Whitford and Titone (Reference Whitford and Titone2012, Reference Whitford and Titone2016, Reference Whitford and Titone2017) who recorded eye movements of late L2 bilinguals of various ages (18–86 years), different language dominance (English or French), and proficiency found that the amount of exposure to the weaker language determined the magnitude of the frequency effect, with lower levels of exposure leading to larger frequency effects in early and late eye-movement measures in L2. Berzak, Katz and Levy (Reference Berzak, Katz, Levy, Walker, Ji and Stent2018) recently demonstrated that the proficiency level in L2 could affect eye movements even more directly. They found that first fixation duration and total reading times not only correlated with standardized tests of English language proficiency (MET: r = .5 and TOEFL: r = .54) but were also effective in predicting the outcomes of these tests.
To summarize: monolinguals and high-proficient bilingual adult readers do not show significant qualitative differences in eye-movement measures in L1 reading (cf. Whitford & Joanisse, Reference Whitford and Joanisse2018, for differences in bilingual children). However, eye movements in low-proficient L2 readers are characterized by longer fixation durations, lower likelihood of skipping the words, higher probabilities of regressive saccades, and larger frequency effects making them similar to monolingual children or monolingual adults with poor reading skills. Such empirical findings support the weaker links account (Gollan et al., Reference Gollan, Montoya, Cera and Sandoval2008) that puts exposure to non-dominant language as the bottleneck factor affecting various aspects of non-dominant language processing, including acquisition and fluency of reading skills.
1.3. Reading in a heritage language
Little is known about literacy skills of young adult HSs whose HL development was interrupted by the start of school in the dominant language. With respect to reading, previous research suggests that there are no benefits of the early exposure to literacy in HL. Earlier studies that compared Chinese L2 learners and Chinese HSs reported no difference between these two groups in Chinese character recognition, reading comprehension or vocabulary knowledge (Ke, Reference Ke1998; Xiao, Reference Xiao2006). More recent HL research revealed only the facilitatory effect of vocabulary on reading comprehension in HL (Zhang & Koda, Reference Zhang and Koda2018).
Literacy remains the weakest domain in HSs in comparison to L2 learners who outperform them on a variety of written tasks, even when the dominant and HL languages share the same script, such as English and Spanish (Keating, VanPatten & Jegerski, Reference Keating, VanPatten and Jegerski2011; Potowski, Jegerski & Morgan-Short, Reference Potowski, Jegerski and Morgan-Short2009; Tse, Reference Tse2001). While the reasons for L2 advantage in written tasks is a topic for separate research, one can think that the possible explanation lies in the formal setting of the L2 acquisition. HSs acquire language at home and, in most cases, have very limited experience with its written form, whereas L2 speakers receive formal classroom instruction aimed to speed up literacy acquisition.
In contrast to literacy, oral competency in HL is consistently linked to better performance of HSs in auditory tasks compared to L2 learners. For example, Gor, Cook, Pandza and Chrabaszcz (Reference Gor, Cook, Pandza and Chrabaszcz2018, February) compared the performance of 28 Russian HSs and 31 L2 learners in a grammaticality judgment task (GJT) in visual and auditory modalities. The results showed that, while there was no difference in accuracy scores among groups on the visual GJT, HSs outperformed proficiency-matched L2 learners in the auditory modality. Considerably better auditory language processing skills are also reported for Chinese HSs (Xiao, Reference Xiao2006) and Spanish HSs (Potowski et al., Reference Potowski, Jegerski and Morgan-Short2009). From a pedagogical perspective, it is important to identify how literacy skills differ between HSs and L2 learners so that they can be targeted for training in heritage or mixed language learning classrooms.
As of yet, eye-movement benchmarks in reading in HL remain unknown, and their investigation is the primary goal of the present study: This is the first eye-tracking study of reading in HL of non-Roman-based orthography that compares HSs to monolingual children learning to read and L2 learners and explores a connection between reading in HL and the proficiency level. We seek answers to research questions in (1)-(3):
(1) How does proficiency (high vs. low) in HL defined by simple reading tests affect eye movements in reading isolated sentences?
(2) Whose eye-movement patterns do HSs’ patterns resemble the most: those of adults, L2 learners or monolingual Russian children learning to read?
(3) Will canonical effects of word length and frequency (both foveal and parafoveal) hold for reading in HL?
2. Eye-movement benchmarks in reading in Russian
The non-Roman-based Cyrillic alphabet, complex polysyllabic structure of the words, and morphological richness distinguish Russian from many other European languages (for overview of Russian orthography, see Rakhlin, Kornilov & Grigorenko, Reference Rakhlin, Kornilov, Grigorenko, Verhoeven and Perfetti2017) and make it an ideal target for comparative reading research in alphabetic languages. Despite the fact that this method of eye-movement recordings was tested in the Soviet Union as early as in the 1960s by Alfred Yarbus (Reference Yarbus1967), until recently there were no studies that investigated eye-movement benchmarks in Cyrillic in monolingual Russian adults or acquisition of reading in Russian children. To the best of our knowledge, our study that established the Russian Sentence Corpus (RSC , Laurinavichyute et al., Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019) was the first one to fill in this gap. Eye movements of monolingual Russian 8-year-old children were also only recently investigated, using a design parallel to the RSC (Korneev, Akhutina & Matveeva, Reference Korneev, Akhutina and Matveeva2017). In order to test our predictions regarding eye-movement benchmarks and literacy skills of HL Russian young adults, we will use the previously collected data from skilled monolingual readers and monolingual children and compare our novel data form HSs and L2 learners to them.
2.1. Russian Sentence Corpus (RSC) and its child version
The RSC design follows the cross-linguistic protocol of the Potsdam Sentence Corpus for German (Kliegl, Grabner, Rolfs & Engbert, Reference Kliegl, Grabner, Rolfs and Engbert2004) and includes basic eye-movement characteristics of 96 skilled monolingual Russian readers (66 women, M Age = 24, range 18–80). The RSC is based on 144 sentences randomly selected and modified from the Russian National Corpus that represent various types of grammatical structures typical of the Russian language. Each sentence contains a target word orthogonally manipulated in a 3 × 3 × 2 design: the part-of-speech (adjectives, nouns, verbs), length (short, medium and long), and frequency (either high:> 50 ipm, or low: <10 ipm). The dependent measures for the RSC are listed in Table 1. Although there were some language-specific differences (see in Laurinavichyute et al., Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019 for discussion), in general the eye-movement benchmarks in reading in Russian presented in Table 4 (the first column) were consistent with findings for other alphabetic languages and showed the universal lexical effects of length, frequency, and predictability (see Table 2 in Laurinavichyute et al., Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019).
Korneev and colleagues (Reference Korneev, Akhutina and Matveeva2017) developed the child version of the RSC for 37 monolingual Russian children (17 girls; M Age = 8.6). The children were second graders in a Moscow public school, and, after one full year of formal literacy instruction, were the youngest group that was able to read in whole words with the average speed of 70 words per minute and comprehend the reading material making them the first age group appropriate for the comparison to adult HSs. The children of this age also represent a particularly interesting group for the comparison with our HS participants with respect to one of the hypotheses in HL acquisition, the divergent attainment hypothesis (Benmamoun et al., Reference Benmamoun, Montrul and Polinsky2013; Montrul, Reference Montrul2008; Scontras, Fuchs & Polinsky, Reference Scontras, Fuchs and Polinsky2015). It suggests that HL developmental delays, which result from the abrupt switch to the majority language in childhood (typically between the ages of 5 to 9) and inevitable reduced HL exposure, continue into adulthood. Consequently, HSs’ language skills including literacy cease to develop beyond the switch point and often resemble those of young monolingual children.
While it was not possible to use the exact same 144 sentences from the RSC, 30 new similar sentences were constructed around the same 30 target nouns from the RSC that were deemed appropriate for 8-year-old children. The child corpus followed 3 × 2 design, in which authors manipulated the length and frequency of the target nouns. Table 4 (third column) reveals that, as expected, Russian children produced multiple fixations per word with longer mean durations in early and late measures, skipped fewer words, and made more regressive saccades to the currently fixated words. The average mean fixation durations, skipping, and regression rates as well as the effects of word length, word frequency, and their interaction were comparable to what was found for children learning to read in Roman-based alphabets (for review, see Blythe & Joseph, Reference Blythe, Joseph, Liversedge, Gilchrist and Everling2011). The effects of length and frequency of the parafoveal words were not investigated.
2.2. The present study: Bilingual Russian Sentence Corpus (BiRSC)
Following the design and analysis in Laurinavichyute et al. (Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019), the current study investigates literacy skills of young adult HSs of Russian. First, we establish eye-movement benchmarks in corpus, i.e., reading of the isolated sentences for two proficiency levels (high vs. low). In lieu of the absence of the standardized HL proficiency test which would take into account heritage advantage in some of the linguistic domains (phonology and semantics; oral production and comprehension) and potential disadvantage in others (syntax and morphology; written production and comprehension), we operationally defined proficiency in HL reading as a set of scores in reading speed, quality of reading, and comprehension based on their performance on two reading assessment tests used for monolingual Russian second-graders (Fotekova & Akhutina, Reference Fotekova and Akhutina2002). These scores allowed us to classify HSs participants into high-proficient and low-proficient readers (see Method section for details).
Second, we compare HS eye-movement benchmarks to the baseline (i.e., monolingual adults from RSC), and then two comparison groups (i.e., monolingual children from the child RSC and L2 learners tested in the present study). Based on the available research in L2 and HL reading as well as predictions of the weaker links account, we expect eye movements of both groups of HSs to be different from skilled monolingual adults as both groups have reduced HL input, although to a different extent. Nevertheless, we predict that if high-proficient HSs perform well on two reading assessment tests, they should also perform better (e.g., read faster, skip more words, have shorter fixation durations and regress less) than low-proficient readers or monolingual children and be closer to monolingual adults in their eye-movement characteristics. Accordingly, we expect low-proficient HSs to perform on par with L2 learners, providing support for the weaker links account, and with children, confirming predictions of the divergent attainment hypothesis in HL (Benmamoun et al., Reference Benmamoun, Montrul and Polinsky2013; Montrul, Reference Montrul2008).
Finally, in line with the previous findings, we expect to confirm universal lexical effects of length and frequency on the eye movements in reading in HL. However, we hypothesize that low-proficient HSs would show reduced sensitivity to the length and frequency of the parafoveal words focusing more on the currently fixated word (for review of foveal-on parafoveal processing see Drieghe, Reference Drieghe, Liversedge, Gilchrist and Everling2011; for findings in bilingual parafoveal processing, see Whitford & Titone, Reference Whitford and Titone2015; Reference Whitford and Titone2016).
3. Method
3.1. Participants
Fifty adult English–Russian HSs (23 women, MAge = 18.4, range 13–29, MAgeofArrival = 3.5) and 27 L2 learners (17 women, MAge = 21.2, range 16–43) participated in the study. We recruited participants from two sites, a large urban university and a specialized public high school, both in New York City. The study was approved by the university's Institutional Review Board (IRB) and the New York City Department of Education IRB. Before the start of the study, all participants (over 18 years) or parents/guardians of the participants (under 18 years) signed the informed consent form (minor participants also provided their assent) and filled out the language background questionnaire, administered in English. Bilinguals were matched on the dominant language (English) and age (see Table 2 for other background information).
Table 2. Participant characteristics and average scores for performance on 4 reading assessment tasks.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tab2.png?pub-status=live)
3.2. Establishing proficiency in Russian HL reading
We wanted to include every HS participant in the study and establish his or her proficiency in Russian using our reading tests as proxy because we lack the standardized assessment. However, recall that many HSs cannot read in Russian at all. To deal with this problem, we included 48 out of total 50 Russian HSs as they minimally matched reading skills of the youngest monolingual group of Russian 8-year-old second graders (Korneev et al., Reference Korneev, Akhutina and Matveeva2017) on the average reading speed (words per minute), quality of reading (syllables vs. words), and text comprehension. To do so, we used two simple reading assessments to approximate their proficiency in Russian HL reading. They were (1) Russian Word Identification (Word ID-Rus) task and (2) Russian Oral Reading Fluency (ORF-Rus) task.
Russian Word Identification (Word ID-Rus) task
This task was used to screen participants’ ability to read the Cyrillic letters because many Russian HSs either cannot read at all or forgot how to read. We adapted the Word Identification task from Fotekova and Akhutina's neuropsychological assessment for elementary school Russian-speaking children (Fotekova & Akhutina, Reference Fotekova and Akhutina2002). Participants were asked to read out loud 24 single words in Russian (the complete list of words is presented in Fotekova & Akhutina, Reference Fotekova and Akhutina2002, p. 21). The words tested the reader's mastery of the pronunciation of all phonemes as well as of the major phonetic processes in Russian, such as palatalization (SHMEl’ /ʃmʲˈel’// ‘bee’), word final devoicing (FLAG /flak/ ‘flag’), syllabic stress shift (SAPOGÍ´PL - SAPÓGSG ‘boots-boot’), and vowel reduction (SOBÁKA /sabáka/ ‘dog’). The reading fluency was assessed based on three criteria with a maximum of 15 points for each (45 in total): reading speed (words per minute), method of reading (whole words/syllables/sounds), and number of errors (pronunciation). Oral productions were scored by the native Russian speaker, the first author. The two participants who scored fewer than 15 points in total were classified as not eligible and did not proceed any further in the study.
Russian Oral Reading Fluency (ORF-Rus) task
The remaining 48 participants were asked to read out aloud a short text in Russian Kak ja lovil rakov “How I was Catching Crayfish” (202 words; Fotekova & Akhutina, Reference Fotekova and Akhutina2002, p. 21). The text utilizes various grammatical constructions (e.g., relative clauses, passives, null object, subject drop, zero copula, impersonal verbs, double negative), tenses (including historic present), different word orders (SVO, VSO, and OVS) embedded into declarative and exclamatory sentences that contained lexical items of different frequencies, illustrated in Example (4).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tabU1.png?pub-status=live)
The maximum score for the task was 45 points. We used three criteria for reading fluency: reading speed (words per minute), comprehension score (3 comprehension questions), and number of reading errors (stress, pronunciation, omissions, repetitions). As a result, HSs (and L2) participants were classified into two subgroups, high- and low-proficient readers.
High-proficient HSs (n = 21, 13 women, MAge = 17.5, range 13–24; MAoA = 5.1) scored ≥ 30 pts whereas the remaining 27 HSs were low-proficient (22 women, MAge = 19, range 15–29, MAoA = 2.3), scoring < 30 pts. All 27 L2 participants were classified as low-proficient readers.
3.3. Design and materials
Having classified HSs according to their proficiency in reading with the help of Word ID-Rus and ORF-Rus tasks, we also administered two parallel tasks in English: namely, English Word Identification (Word ID-Eng) and English Oral Reading Fluency (ORF-Eng), to rule out general reading difficulties in their dominant (English) language.
English Word Identification (Word ID-Eng) Subtest
English Word Identification (Word ID-Eng) Subtest of the Woodcock Reading Mastery Tests (WRMT 3rd edition, Woodcock, Reference Woodcock2011). Word ID-Eng is a part of the standardized assessment of reading in English. It serves as a test of a decoding skill and requires participants to read out loud English words in the set of 5–6 items of increasing difficulty (e.g., plausible, abdominal in the initial set and ennui, dossier in the final set). The task included 17 trials, with a maximum score of 46 points (scoring starts with the baseline of 30 pts). Testing is discontinued after three consecutive errors in pronunciation. All 48 HS participants passed the baseline revealing reading fluency in the dominant language.
English Oral Reading Fluency (ORF-Eng) Subtest
English Oral Reading Fluency (ORF-Eng) Subtest of the Woodcock Reading Mastery Tests (WRMT 3rd edition, Woodcock, Reference Woodcock2011). ORF-Eng measures the participant's ability to fluently read connected text. The task is to read out loud sentences that gradually increase in difficulty (e.g., the last sentence was: Since then, it has been easier with the introduction with the box camera and flexible film, and most recently, the “point-and-shoot” process that requires no specialized knowledge at all.) Performance was scored for both accuracy and fluency of expression. The experimenter noted errors and omissions, including mispronunciations, word substitutions, hesitations, repetitions, and transpositions. The final score was calculated based on the formula which takes into consideration the total reading time, number of words in the passage, and number of errors. Table 2 provides the oral fluency scores both in Russian and English.
Bilingual Russian Sentence Corpus (BiRSC): Reading experiment
The design and materials for the present study follow the design and materials from RSC (Laurinavichyute et al., Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019) and its child version (Korneev et al., Reference Korneev, Akhutina and Matveeva2017). Low-proficient HSs and L2 learners read the 30 sentences from the child version of RSC whereas high-proficient HSs read one-half of the 144 sentences from RSC (version A the first 72 sentences, version B the second 72 sentences), in order to accommodate time constraints of the study. We refer to A and B versions of RSC as the advanced BiRSC and to its child version as the beginner BiRSC. Table 3 presents the descriptive characteristics of all corpus words and sentences from the beginner and the advanced BiRSC. The sentences and the script used for the analyses reported below are available at the Open Science Framework project page doi:10.17605/OSF.IO/TCRBA.
Table 3. Descriptive characteristics of the beginner and advanced versions of BiRSC.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tab3.png?pub-status=live)
a First and last words in each sentence were not analyzed.
All words in the BiRSC were annotated for length and frequency. Frequency information was taken from Lyashevskaya and Sharov (Reference Lyashevskaya and Sharov2009). Examples (5) and (6) illustrate representative sentences from advanced and beginner versions, respectively. (The morphological markers are omitted for ease of exposition). To make sure that the participants read for content, all sentences in BiRSC were followed by a multiple-choice comprehension question. High-proficient readers had three alternatives per question (5) and low-proficient readers had two (6).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tabU2.png?pub-status=live)
3.4. Procedure
All sentences were presented in Ubuntu Mono Normal black font, size 22 pt, on a light grey background on the BenQ XL2411Z 144 Hz monitor (resolution: 1920 × 1080 pix) controlled by a ThinkStation computer. Presentation was programmed in Experiment Builder (SR Research Ltd.). The eye movements were recorded by the Eyelink 1000+ desktop mount eye-tracker using a chin rest. Participants were seated 55 cm from the camera and 92 cm from the monitor. One letter subtended 0.34° visual angle. Only the right eye was tracked, at 1000 Hz rate.
The experiment began with a 9-point calibration which was repeated after every 15 sentences. Each trial started with the fixation point at the position of the first letter in the sentence presented for 500 ms. If the fixation detection was successful, the experiment automatically proceeded to the presentation of the sentences; otherwise, calibration was repeated. The experiment started with three practice trials. To indicate that they finished reading the sentence, participants fixated the red dot at the lower right-hand corner of the screen; when the fixation was detected by the eye-tracker, the trial proceeded to the comprehension question which the participants answered by clicking on one alternative with a mouse. After 1-s delay, the program proceeded to the next sentence and comprehension question trial. The experimental sentences (72 or 30) appeared on the screen in randomized order.
4. Results
4.1. Descriptive statistics: Reading assessment tasks 1–4
Table 2 (last 4 rows) presents means and standard deviations for reading assessments in Russian and English. The performance in the ORF-Rus was scored based on 1) the pre-defined criteria established by Fotekova and Akhutina (Reference Fotekova and Akhutina2002), and 2) the Woodcock Reading Mastery Tests formula; the two set of scores were highly correlated, r = .80, p < .001. We used the latter score in the statistical analyses to ensure the comparability with the score of the ORF-Eng.
The significantly higher scores on both Russian tasks for 21 HSs (Word ID-Rus [t (30)= 5.11, p = .012]; ORF-Rus [t (46) = 7.12, p < .001]) as compared to 27 remaining HSs and all L2 (Word ID-Rus [t (27) = 4.72, p < .001]; ORF-Rus [t (30) = 9.68, p < .001] supported the predefined classification of participants as high-proficient and the latter as low-proficient readers. In ORF-Eng, there was no difference among the three groups in the performance, (ts < 1). In Word ID-Eng, high- and low-proficient HSs did not differ (t (46) = −1.90, p = .204), but L2 readers obtained significantly higher scores than high-proficient HSs (t (46) = −2.88, p = .018), suggesting that L2 learners are faster with grapheme-to-phoneme decoding process in English than high-proficient HSs. Low-proficient HSs and L2 readers did not differ on any of the four tasks. (See Table S5 for exact t-values and p-values for all measures, Supplementary Materials).
Performance on the ORF-Rus task was also correlated with the self-reported amount of Russian language exposure per day (r = .54, p = < .001), age of arrival to the USA (r = .52, p = < .001), age of reading start in Russian (r − .51, p = <.001), and self-reported comprehension ability in Russian (r = .58, p < .001). To make sure that variability in the age range for HSs does not affect the results, we conducted independent t-tests for reading assessments and other background information by splitting HSs into two age groups: adolescents younger than 18 (n = 28) and young adults 18 or older (n = 47). The only differences (t (73) = 2.08, p = .041) was in self-reported age of reading start in Russian (Madolescent = 9.1, SD = 4.7 vs. Madult = 12.6, SD = 8.2) and objective measurement for oral reading fluency in English (t (73) = −2.77, p = .007) where adolescents scored higher than adults (Madolescent = 31.1, SD = 6.4 vs. Madult = 27.1, SD = 5.8).
4.2. Descriptive statistics: eye-movement benchmarks in reading in HL
Table 4 presents means and standard deviations for nine dependent measures from Table 1 for the two groups of high-proficient (second column) and low-proficient HSs (fourth column) and compares them to those of the monolingual Russian adults (n = 96, Laurinavichyute et al., Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019), 8-year-old children (n = 37, Korneev et al., Reference Korneev, Akhutina and Matveeva2017), and L2 learners (n = 27, this study). Additionally, we included the mean number of fixations per word (x) and the saccade landing position (xi). Sentences with incorrect comprehension question responses were excluded from the analysis (low-proficient HSs M accuracy = 81%; high-proficient HSs M accuracy = 92.7%; L2 learners M accuracy = 85.1%; children M accuracy = 98%; monolingual adults M accuracy = 99%). Descriptive statistics were calculated for all words in the corpora, between-group differences were calculated using series of independent t-tests with Bonferroni correction for multiple comparisons at α-level .005 (see Table S6 for t-values and corresponding p-values, Supplementary Materials).
Table 4. Comparison of basic parameters of eye movements ((i) time duration measures, (ii) probabilities of skipping or fixating the word, (iii) probability of regressions, saccade landing sites and number of fixations per word) in reading in Russian (SD in parentheses).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tab4.png?pub-status=live)
† All the differences (RO difference is marginal) are significant between HSs and monolingual adults.
* Significant differences between HSs and monolingual children. Significant differences between HSs and L2learners are in bold.
High-proficient HSs
On the one hand, as Table 4 reveals, high-proficient HSs’ eye movements were significantly different from eye movements of monolingual adults as assessed with a series of independent t-tests between groups (Table S6). On the other hand, they were strikingly similar to monolingual children in all measures except gaze duration (GD), total reading time (TT) and the probability of regression from the fixated word (RO) (Table 5).
Table 5. Conceptual comparison of high-proficient HSs and low-proficient HSs to other groups (to match the statistical analysis in Table 4.) All the differences (RO difference is marginal) are significant between HSs and monolingual adults. Empty cells designate no difference between groups.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tab5.png?pub-status=live)
When compared to two low-proficiency groups, low-proficiency HSs and L2, high-proficient HSs were significantly faster in GD and TT measures, skipped more words, re-fixated the words less and, with respect to L2 learners, high-proficient HSs produced lower rates of regressive saccades (RO).
Low-proficient HSs
All low-proficient HSs’ eye-movement characteristics were significantly different from those of monolingual adults (RO is marginal). Low-proficient HSs matched children in first fixation duration (FFD) and single fixation duration (SFD) measures (Figure 1A), as well as the probability of regression from the fixated word (RO; Figure 1B) and saccade landing position. However, they fixated more than half of the words at least twice, for longer times (GD and TT), and skipped (P0) fewer words as compared to children. Low-proficient HSs did not differ from L2 learners in any of the measures except fixation probability and regression originating from the fixated word (Table S6).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_fig1.png?pub-status=live)
Fig. 1. Means for (A) time durations measures and (B) probabilities of skipping (P0), fixating (P1, P2+) and making regressions (RO, RG) by each group of speakers.
Next, we compared the differences in frequency effects (FEs) between low-proficient and high-proficient HSs because of the sensitivity to the amount of language exposure in L2 reading (Whitford & Titone, Reference Whitford and Titone2012). Words were divided into high frequency (HF) and low frequency (LF) using a median split in log-transformed frequency for high-proficient HSs (HF: n = 521, ≥ 1.49; LF: n = 523, < 1.49) and the low-proficient HS group (n = 95, ≥ 1.89; LF: n = 96, < 1.89). The median split in advanced BiRSC roughly corresponded to the commonly used thresholds for low (medium)- and high-frequency words (low-frequency range: 1–32 instances per million; high-frequency range: 32–38107 ipm). In the beginner BiRSC, however, the lower frequency range also included words with considerably high ipm count due to the nature of the corpus, in which the sentences were constructed for reading by children (low-frequency range: 1–75 instances per million; high-frequency range: 82–38107 ipm).
Table 6 shows that there are only a few differences in the magnitude of the FEs between the two groups. Low-proficient HSs showed significantly larger FEs in GD, TT, and the probability of making one fixation (P1). The difference between FEs in skipping probability was also significant, but the pattern was reversed: high-proficient HSs skipped frequent words more often than low-proficient HSs. Note that frequency was used as a continuous variable for all subsequent analyses.
Table 6. Means and standard deviations for all eye-movement measures across word log frequencies. Frequency effects (FEs; i.e., differences between LF and HF words) and corresponding p-values are presented in italics. First four measures (FFD, SFD, GD, TT) are in ms, the rest are percentages.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_tab6.png?pub-status=live)
4.3. Modeling: Relationships between frequency, length, reading assessments and eye movements in BiRSC
We ran (generalized) linear mixed models using R (R Core Team, 2016) that included previous, current and next words’ length and frequency as well as length of incoming saccade to the current word, relative position of the word in the sentence, and saccade landing position on the current word as fixed predictors (see details for the models in Tables S7-S12, Supplementary Materials). Each eye-movement measure was fit to the same set of predictors. Random factors were random intercepts for participants, sentences, and words. No random slopes were added to the final models as such addition resulted in over-parametrization. We also removed random intercepts for sentences in models that resulted in singular fits (the variances across sentences were estimated as zero). Significant effects are adjusted for multiple comparisons using Bonferroni correction at α-level of .005.
To establish the relationship between fixation duration measures in BiRSC and performance for the four reading assessments, we also run a separate set of models that include scores from the reading assessments along with baseline predictors (see details for the models in Tables S13 – S15, Supplementary Materials). Random structure remained the same as in previous set of models. Significant effects are adjusted for multiple comparisons using Bonferroni correction at α-level of .012.
For all analyses, the first and last words of every sentence were excluded and only sentences with correct answers to the comprehension questions were analyzed. Fixations and saccades were extracted from eye-movement data following the algorithm from the Data Viewer package (SR Research Ltd). No cut-off limits were applied to fixations because fixations shorter than 100 ms constituted only 1.9% of all data (the maximum fixation duration was 3779 ms for low-proficient HS that was elicited by low-frequency word of .85 ipm). The predictor of word length was centered and scaled; the frequency was log-transformed (to base 10). Eye-movement duration measures (FFD, SFD, GD, and TT) were log-transformed to ensure normal distribution of models’ residuals. To exclude the possibility of multicollinearity of model predictors, we ensured that the variance inflation factor (VIF) was less than 3 for all predictors.
For binary outcome variables (fixation, skipping and regression probability), we used mixed-effects linear logistic regression. Both linear mixed-effects and generalized linear mixed-effects models were fit with the function (g)lmer from the R-package lme4 (Bates, Mächler, Bolker & Walker, Reference Bates, Mächler, Bolker and Walker2015). The comparison tables (S7-S14, Supplementary Materials) for (G)LMM outcomes were created with the sjPlot package (Lüdecke, Reference Lüdecke2017).
High-proficient HSs
Confirming canonical effects in reading, the lexical factors of word length and frequency reliably affected all measures of interest (see Tables S7-S8, Supplementary Materials). High-proficient HSs fixated longer words for longer time and frequent words for shorter time, and they regressed to longer words more often and rarely skipped them (see Figure 2).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_fig2.png?pub-status=live)
Fig. 2. High-proficient HSs. All corpus words: Means for four time durations measures as a function of word length (A) and logarithmic word frequency (С); probabilities of skipping or fixating the word as a function of word length (B) and frequency (D).
The effect of the previous (n−1) and upcoming words (n+1)
When the upcoming word was longer, the time spent fixating the current word (GD, TT) decreased. Similarly, higher frequency of the upcoming word decreased total time (TT) spent reading the current word. In regards to the preceding word, longer preceding words led to decreased total reading times (TT) on the current word. When the preceding word had low frequency, high-proficient HSs fixated the current word (TT) longer. In addition, the regression probability decreases when preceding word length and frequency increase (see Tables S7-S8, Supplementary Materials). As effects appeared mostly in total reading time or regression rate measures, we cannot draw any decisive conclusions concerning parafoveal processing, as the word in parafovea could have been fixated before these effects occurred.
Reading assessments
Higher scores in the ORF-Rus were associated with faster reading times for all fixation duration measures (FFD, SFD, GD, and TT). Reading assessments in English did not predict any time duration measures (see Table S13, Supplementary Materials).
Low-proficient HSs
Low-proficient HSs were also sensitive to word length and frequency (see Tables S9-S10, Supplementary Materials) except that the first (FFD) and single fixation duration (SFD) as well as regression rates were not affected by the word length (same finding for SFD for Russian monolingual adults and children). All measures of interest were affected by word frequency (see Figure 3). We did not analyze factors affecting skipping probability as it was very low in general (10%).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_fig3.png?pub-status=live)
Fig. 3. Low-proficient HSs. All corpus words: Means for four time durations measures as a function of word length (A) and logarithmic word frequency (С); probabilities of skipping or fixating the word as a function of word length (B) and frequency (D).
The effect of the previous (n−1) and upcoming words (n+1)
Only the regression probability (RG) was affected by the upcoming word frequency wherein more frequent words led to a decreased chance of regression. Longer upcoming words led to reduced first fixation duration (FFD). With respect to the preceding word, longer preceding words led to longer reading times in single fixation duration (SFD). Longer and more frequent preceding words were also associated with lower probability of regressing to previous words (RO) (see Tables S9-S10, Supplementary Materials). Thus, low-proficient were similar to high-proficient HSs: they did process some information parafoveally (e.g., length of the parafoveal word), but most of the effects appeared in the late eye-tracking measures (i.e., regression rates).
Reading assessments
None of the reading assessments in Russian or English predicted any of the time duration measures (see Table S14, Supplementary Materials).
L2 learners
L2 learners were sensitive to word frequency in all duration measures (see Figure 4) except first fixation duration (FFD). Longer words led to increased gaze duration (GD), total reading times (TT), and higher probability to multiple fixations on words (P1, P2+) (see Tables S11-S12, Supplementary Materials).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210115121525086-0689:S136672892000019X:S136672892000019X_fig4.png?pub-status=live)
Fig. 4. L2 learners. All corpus words: Means for four time durations measures as a function of word length (A) and logarithmic word frequency (С); probabilities of skipping or fixating the word as a function of word length (B) and frequency (D).
The effect of the previous (n−1) and upcoming words (n+1)
None of the measures were affected by upcoming or preceding word length or frequency with exception of regression probability (RG), wherein upcoming longer words led to decreased regression rates (see Tables S11-S12, Supplementary Materials).
Reading assessments
None of the reading assessments in Russian or English predicted any of the time duration measures (see Table S15, Supplementary Materials).
5. Discussion
In this eye-tracking study, we presented the Bilingual Russian Sentence Corpus (BiRSC) available at the OSF (doi:10.17605/OSF.IO/TCRBA) that characterizes literacy skills by adult and adolescent HSs of non-Roman-based HL, i.e., Russian. BiRSC contains basic eye-movement characteristics in HL reading as a factor of the HL proficiency level. Then we compared the HSs’ eye-movement benchmarks to those of skilled monolingual Russian readers (baseline), children learning to read, and L2 learners. In what follows, we separately discuss the HSs’ eye-movement benchmarks and the effects of the lexical characteristics of the words, i.e., length and frequency.
5.1. Proficiency and eye-movement benchmarks in reading in HL
High-proficient HSs were classified as such in our study as those who had high scores on reading assessments tests; these scores were also positively correlated with self-reported amount of Russian language exposure per day, self-reported comprehension ability in Russian, and age of arrival to the USA. Surprisingly, our high-proficient HSs were quantitatively different from the skilled monolingual readers in all of the eye-movement measures. If anything, high-proficient HSs more resembled monolingual children in early duration fixations (except gaze duration), probability of skipping, mean number of fixations, regression rate (probability of making a regression from fixated word), and the saccade landing position. Thus, basic eye-movement benchmarks of high-proficient HSs characterize them as bilingual readers with child-like eye-movement patterns in reading isolates sentences.
Proficiency, however, turned out to have a weak effect on HL reading. It affected only some eye-movement benchmarks: namely, the high-proficient HSs read faster (gaze duration and total reading time), skipped more words, and had fewer fixations than the low-proficiency HSs. However, there were no differences between the two groups in many other measures, i.e., in the earliest (first fixation duration, single fixation duration, saccade landing position) and some of the late eye-movement measures (both saccade rates). We conclude that even proficient HSs experience difficulties in HL during both lexical (early) and post-lexical (late) processing.
One possible limitation of our study is that skilled monolingual adults and high-proficient HSs read one, more difficult, set of the sentences whereas children, low-proficient HSs, and L2 learners read another, simpler set of sentences. It was our intention to have the high-proficiency HSs face more complex sentences so that we can compare their eye-movement characteristics to those of adult native speakers and estimate the extent of the gap in reading abilities between these two groups during natural uninterrupted comprehension. The sentences that low-proficiency HSs and L2 learners read were deliberately simplified and were identical to the materials read by the children to allow the direct comparison between children and bilingual readers (recall that children were not able to read the sentences from the RSC). The obtained results of the L2 learners and low-proficient HSs’ eye-movement characteristics, therefore, represent a liberal overestimation of their potential performance on the advanced version of BiRSC. The key differences that we found comparing low- to high-proficient group would also hold for more difficult reading materials but with greater dissimilarities between the proficiency levels. As for similarities (in first and single fixation durations, regression rates and saccade landing position), we speculate that while high-proficient HSs would be likely to outperform low-proficient readers, we are now confident that they still do not reach the reading fluency of monolingual adult counterparts. Therefore, we can speak of a continuum of literacy skills in high-proficient HSs that range between “L2 learner” and “monolingual baseline” stages.
So why is it that the reading abilities in even high-proficient HSs in our study differ so much from skilled monolingual readers while resembling monolingual children learning to read more than any other group? We suggest that these differences follow from two theories, one of bilingualism and another of HL. First, as expected, our results support the weaker links account (Gollan et al., Reference Gollan, Montoya, Cera and Sandoval2008) of lexical access delays in bilinguals. Second, the similarities between HSs and monolingual children are consistent with the divergent attainment hypothesis in HL theory (Benmamoun et al., Reference Benmamoun, Montrul and Polinsky2013; Montrul, Reference Montrul2008; Polinsky, Reference Polinsky2006; Polinsky & Kagan, Reference Polinsky and Kagan2007; Scontras et al., Reference Scontras, Fuchs and Polinsky2015). When HSs switch to the dominant language in childhood, their competence in HL often slows down or even ceases to develop beyond this point. Reduced input in HL and varying exposure to literacy that are difficult to control leave many HSs at the ‘child’ state of language development even as they reach adulthood.
Turning now to the low-proficient HSs, we found that they were on par with the L2 learners in the majority of the eye-tracking measures but less so with the children. They lagged behind the children (as evident from findings of this study and previous research) in mean gaze duration, total reading times, fixation counts, and skipping probability (Blythe et al., Reference Blythe, Häikiö, Bertam, Liversedge and Hyönä2011; Blythe & Joseph, Reference Blythe, Joseph, Liversedge, Gilchrist and Everling2011). Thus, despite some overlap in early fixation duration measures, the low-proficient HSs group resembles more ‘typical’ unbalanced L2 learners.
Finally, we would like to point out one important finding that concerns the saccade landing position. Bilingual readers differed from the skilled adult readers in where they first landed the gaze in the word: their saccade landing position was shifted significantly towards the beginning of the word (i.e., 44% for monolingual adults vs. 36% and 38% for L2 learners and HSs, respectively) compared to the expected word-centered OVP (O'Reagan & Jacobs, Reference O'Regan and Jacobs1992). In that sense, they were similar to the children (36% into the word). This is a pattern reported for less proficient readers, readers with dyslexia or children who read texts too difficult for their age (e.g., Barnes & Kim, Reference Barnes and Kim2016; Hawelka, Gagl & Wimmer, Reference Hawelka, Gagl and Wimmer2010; Kuperman & Van Dyke, Reference Kuperman and Van Dyke2011). The shift to the beginning of the word signals difficulties in grapheme-phoneme conversion, the process that is automatized in skilled readers. Struggling readers process words in a sequential manner, starting with the beginning and slowly progressing along the word, which leads to multiple refixations. For HSs reading in HL Russian, this grapheme-phoneme conversion could be exacerbated by differences between Cyrillic and Roman scripts, in which only 16 letters out of 33 are shared. We hypothesize that grapheme-phoneme conversion of the weaker HL is inhibited by the dominant language (i.e., English).
5.2. Lexical effects on the eye movements in HL
Our study has confirmed the universal effect of lexical characteristics of the words on eye movements, such as length and frequency (e.g., Inhoff & Rayner, Reference Inhoff and Rayner1986; Staub, White, Drieghe, Hollway & Rayner, Reference Staub, White, Drieghe, Hollway and Rayner2010) for reading in HL. Regardless of the proficiency level, both groups of HSs showed sensitivity to frequency (in all measures) and length (in all measures in high-proficient HSs, some measures in low-proficient HSs) of the currently fixated words.
One exception is the lack of length effect on first fixation duration and single fixation duration in low-proficient HSs, but it also lacked for single fixation duration in the monolingual adult (Laurinavichyute et al., Reference Laurinavichyute, Sekerina, Alexeeva, Bagdasaryan and Kliegl2019) and child data (Korneev et al., Reference Korneev, Akhutina and Matveeva2017). Laurinavichyute and colleagues attribute this finding to a specific reading strategy in Russian: namely, that single fixation duration only serves as a quick check for the predictions made for the word before it was fixated, and, therefore, the fixation does not trigger the start of lexical processing and does not depend on the length of the current word. However, we doubt that the same explanation can apply for the low-proficient HSs considering their low skipping probability and high number of fixations per word. The exact same pattern was found in the L2 learners. The reduced lexical access for low-proficient readers might be a more appropriate explanation that led to the chain reaction in the form of low skipping rates and high regression probabilities resulting in multiple and longer fixations (including first fixation) on most words regardless of their length.
The length and frequency of parafoveally presented words had very limited effects on the eye-movement measures in HSs and L2 learners in our study. With a few exceptions, we found that the characteristics of parafoveally presented words had significant impact only on the late eye-movement measures. Specifically, longer and more frequent words in the parafovea decreased the total reading times in high-proficient HSs and reduced regression rates on the currently fixated word in all bilingual groups. However, these findings do not allow us to make strong conclusions concerning parafoveal processing in HSs as the effects occurred mostly beyond the first pass reading (i.e., parafoveal words can receive fixations in second, third etc. passes). This pattern of results suggests that while HSs and L2 learners show some sensitivity to the lexical characteristics of parafoveal words, parafoveal processing has little impact on the initial stages of lexical access of the currently fixated word.
Finally, the frequency effects for HL reading were partially confirmed in our data. According to the weaker links account (Gollan et al., Reference Gollan, Montoya, Cera and Sandoval2008), the connection between word forms and their mental representation is weakened due to the reduced exposure to the bilinguals’ languages. Accordingly, bilinguals show larger frequency effects in their weaker language relative to monolinguals or relative to reading in their dominant language (Gollan, Slattery, Goldenberg, Van Assche, Duyck & Rayner, Reference Gollan, Slattery, Goldenberg, Van Assche, Duyck and Rayner2011; Whitford & Titone, Reference Whitford and Titone2012; cf. Cop et al., Reference Cop, Drieghe and Duyck2015; Duyck, Vanderelst, Desmet & Hartsuiker, Reference Duyck, Vanderelst, Desmet and Hartsuiker2008). The weaker links account also suggests that less proficient bilinguals should show larger frequency effects in the non-dominant language compared to more proficient bilinguals. Our findings are only partially consistent with this prediction; although the low-proficient HSs showed numerically larger frequency effects, significant differences were found only for gaze duration, total reading time, and the probability of fixating the word only once. It is plausible, therefore, that at the earliest stages of lexical access (reflected in single fixation and first fixation durations, skipping probability), greater proficiency in HL does not lead to more efficiency although it plays some role during later stages of HL processing.
In conclusion, this is the first study that has investigated and described basic eye-movement benchmarks in reading in heritage language and compared them to those of monolingual skilled readers, 8-year-old children learning to read, and L2 learners. Our findings suggest that although the proficiency level in HL reading had some effect on the early and late eye-movement measures in reading isolated sentences, its effect was limited. Both high- and low-proficient HSs were more similar to monolingual children learning to read than to skilled readers. Low-proficient HSs were at a particular disadvantage and resembled unbalanced L2 learners more than any other group, suggesting that early exposure to spoken HL does not seem to facilitate literacy and reading fluency in heritage language. We hope the findings reported here will serve as the first step for future research on reading in the heritage language field. Taken into account high variability in spoken HL skills of heritage speakers, an investigation of HSs’ individual differences in reading constitutes the next logical step in systematic study of heritage languages.
Supplementary Material
For supplementary material accompanying this paper, visit http://dx.doi.org/10.1017/S136672892000019X
List of supplementary materials
Table S5
Table S6
Tables S7
Table S8
Table S9
Table S10
Table S11
Table S12
Table S13
Table S14
Table S15
Acknowledgements
The contribution of Anna K. Laurinavichyute and Irina A. Sekerina was funded within the framework of a subsidy by the Russian Academic Excellence Project ‘5–100’. Irina A. Sekerina was also supported by a PSC-CUNY award TRADB-48–172. We are grateful to Aleksei Korneev for providing the data for monolingual children and Svetlana Alekseeva for the help with corpus annotation process. All errors are our responsibility.