How do readers process written words? Reading is a complicated skill involving the orchestration of a number of lexical representations in terms of orthography, phonology, and semantics. In recent decades, there has been a dramatic increase the investigation of the orthographic processing in the mental lexicon. It is widely reported that there are two different orthographic processes of visual word recognition: letter identity and letter position. Successful visual word recognition not only requires actual identification of the letter within a word but also encoding of the letter position. The flexible letter position coding is well documented in current literature. Previous masked priming studies on alphabetic languages have consistently shown that transposed-letter nonwords (e.g., “jugde”−JUDGE) facilitate target word recognition relative to orthographic controls in which two different letters are replaced (e.g., “jupre”−JUDGE; Perea & Lupker, Reference Perea and Lupker2003, Reference Perea and Lupker2004; Perea et al., Reference Perea, Rosa and Gómez2005). The transposed-letter (TL) facilitation effect suggests that skilled readers are tolerant to disruptions of letter positions, also showing that their whole-word orthographic representations are automatically activated even when internal letters have been transposed in a word.
The location uncertainty in letter position coding has also been extended to the nonalphabetic languages such as Chinese. The flexibility of Chinese character position coding of compound word recognition has been repeatedly observed in adults (Bai et al., Reference Bai, Cai and Schumacher2011; Cheng, Reference Cheng1981; Mattingly & Xu, Reference Mattingly, Xu, Chang, Huang, Hue and Tzeng1994; Peng et al., Reference Peng, Ding, Wang, Taft and Zhu1999; Taft et al., Reference Taft, Zhu and Peng1999). However, remarkably little research has been conducted with children to determine whether the character transposition effect could emerge in the early Chinese reading development. This information gap prevents us from fully understanding the developmental trajectory of character position coding and mechanisms for the promotion of effective visual word recognition skills that are unique to Chinese readers. Liu et al. (Reference Liu, Chung, Zhang and Lu2014) found that Chinese elementary school students (second graders and third graders) and high school students responded to transposed-character nonwords (e.g., ‘‘麗美”) more slowly and less accurately than they did to replaced-character nonwords (e.g., “麗新”) and real words (“圖書”) in a lexical-decision task. Similar results were reported by Liu et al. (Reference Liu, Chung, McBride-Chang and Tong2010) of fourth graders using a similar lexical-decision task. Furthermore, Liu et al. (Reference Liu, Chung, Zhang and Lu2014) demonstrated that the magnitudes of transposed-character effects are similar among second and third grade students, suggesting that whole-word orthographic representation emerges at an early stage of reading acquisition. Although previous studies reveal some uncertainty surrounding character position coding of Chinese reading development, they do not address to what extent stimuli and subjects modulate the magnitude of the transposed-character effect of Chinese compound words. To bridge the knowledge gap, this study examines the impacts of word frequency and vocabulary size on the character position coding of Chinese compound words by focusing on the developmental trajectories of developing readers, which are both insufficiently understood and understudied.
The effect of whole-word frequency
Substantial evidence on visual word recognition in alphabetic languages has identified several lexical variables, including the orthographic, phonological, and semantic properties such as whole-word frequency, phonological feature, and semantic transparency, that significantly affect the processing efficiency of visual word recognition (see Giraudo & Dal Maso, Reference Giraudo and Dal Maso2016, for a review). Of these lexical variables, word frequency (i.e., the number of times a reader has encountered a word) is considered one of the strongest predictors of processing efficiency for visual word recognition among readers. Understanding the influence of word frequency on lexical access can help us infer the organization, structure, and processing of the mental lexicon (Joseph et al., Reference Joseph, Nation and Liversedge2013).
One of the main sources of evidence for the influence of word frequency on the transposed-letter effect comes from unmasked lexical-decision and text reading studies (see Andrews, Reference Andrew1996; O’Connor & Forster, Reference O’Connor and Forster1981; Perea et al., Reference Perea, Rosa and Gómez2005; White et al., Reference White, Johnson, Liversedge and Rayner2008) that have examined the important role of whole-word frequency in the letter position coding of visual word recognition. For example, O’Connor and Forster (Reference O’Connor and Forster1981) found that English-speaking adults exhibited more errors in responding to TL nonwords derived from high-frequency words than in responding to those derived from low-frequency words in a lexical-decision task. Likewise, Andrews (Reference Andrew1996) found a larger delay in the response time (around 34 ms) to TL nonwords than to orthographic controls (replaced-letter nonwords) when TL nonwords were created from high-frequency words. However, the TL effect was not observed in the lexical-decision task when TL nonwords were created from low-frequency words. The aforementioned results indicate that high-frequency TL nonwords are more effective at eliciting orthographic representations of their base words than low-frequency TL nonwords are.
A recent event-related potential (ERP) study has also lent further support to the idea that the size of the transposition effect varies as a function of whole-word frequency. For example, in both semantic categorization and lexical-decision tasks, Vergara-Martínez et al. (Reference Vergara-Martinez, Perea, Gómez and Swaab2013) demonstrated that words that are easy to identify (e.g., high-frequency words) can induce a larger transposition effect than difficult-to-identify words (e.g., low-frequency words) can. Vergara-Martínez et al. (Reference Vergara-Martinez, Perea, Gómez and Swaab2013) also found that TL nonwords created from high-frequency words exhibited N400 amplitudes similar to those of their base words. In contrast, replaced-letter nonwords created from low-frequency words triggered larger N400 amplitudes than their base words did. The results suggested higher activation of the whole-word representation of the TL nonwords derived from high-frequency words than that of TL nonwords derived from low-frequency words. The ERP results clearly showed that the whole-word access route takes priority in the accessing of high-frequency words, whereas the decompositional route plays a central role in the identification of low-frequency words. Although previous studies have revealed the influence of whole-word frequency on the magnitude of the transposition effect in adults who read morphologically simple words, the question of whether whole-word frequency modulates the transposition effect of morphologically complex words such as compound words remains unanswered in the current literature. In particular, no research has examined the influence of whole-word frequency on the transposed-character effect of Chinese compound words in developing readers during the elementary school years.
The effects of vocabulary knowledge and age
In addition to word frequency, previous studies (e.g., De Zeeuw et al., Reference De Zeeuw, Schreuder and Verhoeven2015; Hasenäcker & Schroeder, Reference Hasenäcker and Schroeder2019; Hasenäcker et al., Reference Hasenäcker, Schröter and Schroeder2017; Häikiö et al., Reference Häikiö, Bertram and Hyönä2011) have demonstrated that morphological processing is strongly tied to vocabulary knowledge (indexed by vocabulary size) in reading development. Developmental studies using the masked priming paradigm have demonstrated that transposed-letter priming decreases with age, indicating that letter position coding becomes stricter as development progresses (Acha & Perea, Reference Acha and Perea2008; Castles et al., Reference Castles, Davis, Cavalot and Forster2007). This finding has been explained by a theoretical account called the lexical tuning hypothesis (Castles et al., Reference Castles, Davis and Letcher1999, Reference Castles, Davis, Forster, Kinoshita and Lupker2003, Reference Castles, Davis, Cavalot and Forster2007). The hypothesis assumes that the children’s age might affect the magnitude of the transposition effect during visual world word recognition. Specifically, the hypothesis predicts that a modification in the lexical tuning of the word recognition system responds to a child’s vocabulary size. For old children, high-vocabulary sizes precisely tuned to match the criterion of the visual word recognition leads to a strict letter position coding (a smaller magnitude of transposition effect). For younger children, the matching criterion of word recognition might be less finely tuned by their limited vocabulary sizes in the mental lexicon. Younger children therefore are highly tolerant of the confusability of letter position information and exhibit a larger magnitude of transposed-letter effect in word recognition. Prior alphabetic research has suggested that vocabulary size and age might affect the flexible letter position coding in visual word recognition. Nonetheless, conclusions on nonalphabetic scripts such as Chinese have not been reached yet. The reason for this is that cross-linguistic generalizations might be problematic because the morphological structures are very different between English and Chinese. Morphological systems vary considerably across English and Chinese. For example, Chinese has highly productive lexical compounding, so it contains a relatively large number of compound words but few derived or inflected words. In contrast, lexical compounding in English is less prevalent than inflection or derivation. To address this differentiation, the present study tackled the question of the effects of vocabulary size and age (indexed by grade level) on the magnitude of transposed-character effect in Chinese compound word recognition.
The mouse-tracking lexical-decision paradigm
In the past few years, previous studies (Anderson et al., Reference Anderson, Farmer, Goldstein, Schwade, Spivey, Arnon and Clark2011; Cargill et al., Reference Cargill, Farmer, Schwade, Goldstein and Spivey2007; Hermens, Reference Hermens2018; Krueger et al., Reference Krueger, Storkel and Minai2018; Schroeder & Verrel, Reference Schroeder and Verrel2014) have shown that computer mouse tracking and other motor-tracking techniques (e.g., finger tracking or three-dimensional reach tracking) are feasible for studying language processing in both younger and older children (for a more comprehensive overview of using the mouse-tracking technique with child participants, see Erb, Reference Erb2018). In particular, the mouse-tracking lexical-decision paradigm has proven to be a powerful and popular research tool for assessing the cognitive processing that underlies visual word recognition (e.g., Barca & Pezzulo, Reference Barca and Pezzulo2012, Reference Barca and Pezzulo2015). In a mouse-tracking lexical-decision task, Lin and Lin (Reference Lin and Lin2016) demonstrated that the letter transposition effect could be revealed by not only key press responses (e.g., total response times and error rates) but also mouse movement curvatures, such as the area under the curve (AUC) and maximum deviation, during visual word recognition. Schroeder and Verrel (Reference Schroeder and Verrel2014) have shown that hand movement trajectories complement the traditional behavior measures and provide wholly new insights into lexical access in younger children (elementary school Grades 2 and 3), older children (secondary school Grades 5 and 6), and adults. In the present study, the kinematics of hand movements (e.g., computer cursor movement trajectories) were considered as a reliable indicator of the relative lexical activation level of “word” and “nonword” response options underlying the transposed-character effect in developing readers from different grades.
Previous studies (e.g., Bai et al., Reference Bai, Cai and Schumacher2011; Liu et al., Reference Liu, Chung, McBride-Chang and Tong2010, Reference Liu, Chung, Zhang and Lu2014; Peng et al., Reference Peng, Ding, Wang, Taft and Zhu1999) on the transposed-character effect in Chinese compound word recognition mainly used outcome-based lexical-decision measures such as response time and errors. This response-focused method has an obvious limitation in that it only reflects the end point of lexical processing, not the real-time processing underlying visual word recognition. Studies using the novel computer mouse-tracking method (e.g., Barca & Pezzulo, Reference Barca and Pezzulo2015; Dale & Duran, Reference Dale and Duran2011; Lin et al., Reference Lin, Bangert and Schwartz2015; Lin & Lin, Reference Lin and Lin2016; Tomlinson et al., Reference Tomlinson, Bailey and Bott2013) instead of traditional button-press measures have shown that the continuous nature of mouse movements presents a more detailed view of the real-time mental processing of lexical processing. Furthermore, compared with other time-sensitive measures such as ERPs, mouse tracking can quantitatively distinguish different decision-making processes from multiple response alternatives. Although ERPs provide an index of the specific timing of neural activity associated with the presentation of a single event or response option, it is difficult to track the parallel and continuous nature of lexical activation from multiple response alternatives. In contrast, mouse-tracking provides a more direct measure of multiple response activations and how the participant accumulates evidence to drive the lexical decision over a span of hundreds of milliseconds.
The unique contribution of the present mouse-tracking study is that it targeted two distinct inhibition processes underlying the transposed-character effect across reading development: a response threshold adjustment process versus a conflict resolution process. According to a reach-tracking study (Erb et al., Reference Erb, Moher, Sobel and Song2016), these two inhibition processes can be identified by initial time (the time until reach movement onset) and reach movement curvature (the degree to which a movement deviates from a direct path to the selected target). By employing these two key measures, the present study revealed how the response threshold adjustment and conflict resolution processes contribute to the transposed-character effect during compound word recognition. Therefore, one can expect that the presence or absence of transposition effects in the mouse movement parameters (initiation time and AUC) will reveal how a Chinese reader uses different inhibition processes to resolve competing interpretations of a transposed-character stimulus in visual word recognition.
The current study
The present study examined two research questions with regard to the character transposition effect with Chinese compound words. First, we investigated whether participants’ hand movement trajectories can be used to index flexible character position coding during visual word recognition. If this is the case, we would expect that developing readers’ behavioral responses would be affected by recognizing transposed-character nonwords in a mouse-tracking lexical-decision task. We expected longer movement times for transposed-character nonwords than for replaced-character nonwords. Furthermore, regarding mouse movement trajectories (for example, AUC), we anticipated that the transposed-character nonwords would trigger a greater overall spatial attraction to the “YES” response option indicating the base word rather than the replaced-character nonwords on the mouse movement trajectories. One expect that the mean of the AUC value of a trajectory can serve as a reliable index of spatial attraction toward the unselected alternative (Freeman & Ambady, Reference Freeman and Ambady2010). A larger AUC indicates a greater deviation in movement trajectory toward the unselected alternative.
Second, we examined whether the item (whole-word frequency) and subject (vocabulary size and grade) affect the magnitude of the transposition effect. In the present study, we manipulated the whole-word frequency of a base word (high frequency vs. low frequency) to investigate its influence on transposition effects during visual word recognition. As suggested by the earlier studies in alphabetic languages, high-frequency transposed-letter nonwords should be more effective in activating the whole-word orthographic representation of their base words, as they are perceptually more similar to their base words than to low-frequency nonwords. In a similar vein, we expected that a higher degree of word similarities could be elicited by high-frequency transposed-character nonwords and that they might lead to a larger-magnitude transposition effect in Chinese compound word recognition. The whole-word frequency could thus play a central role in modulating the activation level of whole-word orthographic representation of Chinese compound words. In addition to the word-frequency, we examined the impact of vocabulary size and grade level on the character position processing of Chinese compound word recognition over the course of reading development. This prediction would expand upon previous evidence on compound word processing research in children from a developmental perspective. Based on a recent study by Hasenäcker and Schroeder (2018), we hypothesized that vocabulary size has an independent role in influencing the overall processing and accuracy of compound word reading, unrelated to whole-word frequency. In regard to the effect of grade levels, we predicted that the magnitude of transposed-character effects should be similar across different grades (the second grade vs. the fourth grade, particularly) in elementary school children. This prediction is consistent with a finding from Liu et al. (Reference Liu, Chung, Zhang and Lu2014) suggesting that whole-word representation of Chinese compound words can emerge at an early stage of reading development.
Method
Participants
A sample of 82 Taiwanese children separated into two groups by grade completed the mouse-tracking lexical-decision task. The sample consisted of 32 second graders (mean age 9.8 years, SD = 0.4, range 9.0–10.0; 12 males and 20 females) and 50 fourth graders (mean age 11.4 years, SD = 0.5, range 11.0–12.0; 26 males and 24 females). All participants were native speakers of Mandarin Chinese and Taiwan Southern Min. All participating children had received exposure to Mandarin Chinese and Taiwan Southern Min from birth and literacy instruction at home and/or in after-school programs. They also studied English as a foreign language. The participants were recruited from two public elementary schools in Taiwan. Their homeroom teachers reported no reading difficulties or dyslexia in these participants. Children in the second grade should have developed good knowledge of lexical compounding and are expected to make significant progress beyond the fourth grade, so we included children in the second and fourth grades in our study to provide credible evidence of age-related differences in the effect of transposed characters.
Vocabulary knowledge measures
The Mandarin Chinese version of the Peabody Picture Vocabulary Test–Revised (PPVT; Lu & Liu, Reference Lu and Liu1998) was used to test the children’s vocabulary sizes. The PPVT was used to test the children’s receptive vocabulary by asking them to point to one of four pictures that represented a spoken word. The maximum score was 60. The measure of word reading reflected larger vocabulary sizes in the fourth graders (M = 55.1, SD = 3.3) than in the second graders (M = 50.5, SD = 4.9), with a significant difference between these two groups, t = −22.0, p < .001. The split-half reliability of the PPVT–R ranges from .90 to .97, with alternate-forms reliability ranging from .60 to .91 and test–retest reliability ranging from .84 to .90. In addition, the children’s Chinese character reading ability was assessed with the Chinese Character Recognition Test for School–Age Children (Huang, Reference Huang2001). Both the internal consistency reliability and split-half reliability of this measure is .99, with test–retest reliability ranging from .81 to .95. In the test, the children were asked to write down the characters one by one. There was a significant difference in character reading between the fourth graders (M = 124.7, SD = 24.2) and the second graders (M = 86.8, SD = 22.8), t = −33.9, p < .001. The maximum score was 200.
Materials
The full lists of the experimental stimuli for second and fourth graders are described in more detail in Appendix A. The stimuli were adapted from the stimulus sets from Experiment 1 in Lau’s study (Reference Lau2012), which were chosen from the Hong Kong Corpus of Primary School Chinese (Leung & Lee, Reference Leung and Lee2002). For the second grade stimuli list, a total number of 54 experimental items were selected, as follows: 18 Chinese two-character compound base words, 18 transposed-character nonwords, and 18 replaced-character nonwords. For the fourth grade stimuli list, 78 experimental items (i.e., 26 base words, 26 transposed-character nonwords, and 26 replaced-character nonwords) were selected. Each compound base word was matched with two non-words: (a) a transposed-character (TC) nonword, created by transposing two characters of the base words (e.g., “學習” [learning]–“習學”), and (b) a replaced-character (RC) nonword, created by replacing both characters of the base words with two other unrelated characters (e.g., “學習”–“以修”), which served as an orthographic control for the TC condition.
Example stimuli are provided in Table 1. Four running lists were created such that the experimental items and the positions (top-right and top-left corners) of the response box were counterbalanced across conditions. Half of the base words were presented in the TC condition, and the other half in the RC condition. Each participant received only one of the lists to ensure that no participant saw the same target more than once. That is, each second grade participant saw a total of 36 trials (18 base words, 9 TC nonwords, and 9 RC nonwords), and each fourth grade participant saw a total of 52 trials (26 base word, 13 TC nonwords, and 13 RC nonwords) in the mouse-tracking lexical-decision task.
Table 1. Example stimuli
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_tab1.png?pub-status=live)
Note: HF, high frequency. LF, low frequency. TC, transposed character. RC, replaced character.
To reduce the variations in word usage between Hong Kong and Taiwan, we excluded some words that are not used in Taiwan and made sure that the critical words were suitable for Taiwanese elementary students by using the database of the List of Commonly Used Characters and Words for Taiwanese Elementary Students (Ministry of Education, 2002). For example, some high-frequency (e.g., “課室”) and low-frequency (e.g., “齊備”) words could not be used because those items exist in Hong Kong Mandarin vocabulary repositories but not in Taiwan Mandarin.
Half of the base words were high-frequency whole-word compounds, and half were low-frequency whole-word compounds. The high-frequency and low-frequency base words were matched for morpheme/character frequency, constituent morphological family size, number of strokes, and imageability of the first and second constituent characters. Details of the selected stimuli are listed in Table 2. A small number of stimuli remained after the lexical variables were matched. In addition, to avoid the possible confounding variable of semantic transparency, we selected only semantically transparent words for this study. This word selection criterion limited the number of stimuli that could be used in the present study.
Table 2. Means and standard deviations of whole-word frequencies, constituent morpheme frequencies, and constituent family sizes of the HF words and LF words selected for different grade levels
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_tab2.png?pub-status=live)
Note: HF, high frequency. LF, low frequency.
According to Lau’s argument (Reference Lau2012), “a lack of developmental perspective in choosing stimuli might lead to severe errors because a high-frequency word at the grade four levels could have relatively low frequency at the grade two levels” (see p. 73). In our pilot study, we did find that it was difficult for either group to read the same set of stimuli when they were not age-inappropriate materials for some students. Accordingly, the properties of the stimuli in our materials were carefully controlled from a developmental perspective according to the children’s grade levels. For example, for the second grade stimuli, the average whole-word frequency count of the high-frequency words was 4.2% (SD = 0.4%), and for the low-frequency words, it was 1% (SD = 0%). Similarly, for the fourth grade stimuli, the average whole-word frequency count of the high-frequency words was 4.2% (SD = 0.4%); for the low-frequency words, it was 1% (SD = 0%). In the present study, each word item was labeled as either a high- or low-frequency word, following from a Hong Kong Word Corpus of Primary School Chinese consisting of 8,142 words coded for word frequency (Lau, Reference Lau2012). High- and low-frequency words were operationalized as having a count of 4 or more occurrences and 1 or fewer occurrences per 8,142 words, respectively. Furthermore, the Cronbach’s α internal consistency coefficients were .85 for the second grade stimuli, .86 for the fourth grade stimuli, and .86 for the entire task.
Procedure
Participants sat in a quiet testing room in front of a computer screen. They first performed 10 practice trials to familiarize themselves with the procedure; the experimental trials followed. Stimulus presentation was controlled by MouseTracker software (sampling rate of mouse pointer, 70 Hz; Freeman & Ambady, Reference Freeman and Ambady2010). As shown in Figure 1, participants initiated each trial by clicking a “開始/START” button centrally located at the bottom of the screen. This button was then replaced by two-character compound words or nonwords in Chinese. The character string remained on the screen for 3000 ms. Participants were asked to quickly and accurately determine whether or not the word was a real word by moving to and clicking a “是/YES” (indicating it is a word) or “否/NO” (indicating it is a nonword) box in the top right and left corners of the screen, respectively. The stimulus order was randomized.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_fig1.png?pub-status=live)
Figure 1. Schematic of a single trial for the mouse-tracking lexical-decision task.
If a participant’s total response time for a lexical decision on a particular trial exceeded 3000 ms, “TIME OUT” appeared on the screen. We excluded trials with total response times slower than 3000 ms and trials where the participant did not respond correctly. For the remaining correct trials, we eliminated values above and below the ±3 SD limits around the grand mean of the participants. Participants were given feedback (i.e., whether their responses were correct or incorrect) at the end of each trial to help them self-monitor their performance. The temporal characteristics and x-, - coordinates (in pixels) of mouse trajectories were recorded by the MouseTracker software.
Results
All analyses were performed in R (R Core Team, 2015) and the R package “lme4” (Bates et al., Reference Bates, Mächler, Bolker and Walker2015). We used linear mixed models (LMMs) analysisFootnote 1 to evaluate the impacts of transposed-character similarity, grade, and word frequency on total response time (TRT), initiation time (IT), movement time (MT), and AUC. TRT was the time elapsed between clicking of the START button and the response completion, and it was similar to the response time of discrete responses (e.g., key presses or verbal responses) in a traditional lexical-decision task. IT referred to the time in macroseconds at which the first mouse movement was initiated (the time elapsed between stimulus presentation and mouse movement initiation). If the TC effect was observed in IT, we would expect the TC effect to precede mouse movement execution. In contrast, MT (MT = TRT – IT) was defined as the interval between the onset of the initial mouse movement and response completion. If the TC effect affected MT, then one would assume that the TC effect would continue to influence actions even after a mouse movement had been initiated.
Separate LMMs were created for each dependent measure. Log transformations were conducted on TRTs, ITs, and MTs to correct for distribution skews. We also performed a generalized linear mixed-effects (GLMMs) analysis on the accuracy. As fixed effects, we entered similarity type, grade, word frequency, and their interactions into the models. As random effects, we had intercepts for subjects and items. Each of the categorical variables was effect (sum) coded.
Additional analyses paralleled the original analysis and were conducted to examine whether the magnitude of transposed-character effects was related to the participants’ vocabulary size and single-character reading. Previous studies (e.g., Liu et al., Reference Liu, Chung, McBride-Chang and Tong2010; Wang & McBride, Reference Wang and McBride2016) found that character and word reading might constitute somewhat different processes in Chinese children. To investigate their effects on the similarity type, we separately entered them following two continuous measures, such as vocabulary size and character reading, as fixed factors and their interactions with similarity type in the LMMs and GLMMs models. These two continuous measures were centered around their respective means.
The 82 participants completed a total of 1,876 trials. Of these, 499 trials (27%) resulted in incorrect responses, and they were excluded in data analysis. For the time analysis, outliers were removed by excluding TRT, INIT, MT, and AUC values that exceeded three standard deviations (TRT: 9 trials, 0.5%; IT: 24 trials, 1.3%; MT: 10 trials, 0.6%; AUC: 3 trials, 0.2%) from the mean for each participant across the correct trials. We also excluded an additional 41 trials (2.3%) for which response times exceeded the deadline (3000 ms). All subsequent analyses of RTs (1,327 trials), ITs (1,312 trials), MTs (1,326 trials), and AUCs (1,333 trials) were conducted on the remaining trials from the participants. The descriptive statistics of the TRTs, ITs, MTs, AUCs, and accuracy rates in the mouse-tracking lexical-decision task are provided in Tables 3 and 4.
Table 3. Mean and standard errors for total response times, initiation times, and movement times by word frequency, similarity type, and grade
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_tab3.png?pub-status=live)
Note: HF, high frequency. LF, low frequency. TC, transposed character. RC, replaced character. Table 3 shows untransformed but trimmed reaction times, initiation time, and movement times for correct responses.
Table 4. Mean and standard errors for area under the curve values and accuracy rates by word frequency, similarity type, and grade
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_tab4.png?pub-status=live)
Note: HF, high frequency. LF, low frequency. TC, transposed character. RC, replaced character. aUntransformed but trimmed area under the curve values for correct responses.
Total response times
To determine whether the main effects of similarity type (transposed character vs. replaced character), grade (second grade vs. fourth grade), compound word frequency (low- vs. high- whole-word frequency), and their interactions were significant, the Kenward–Roger approximationFootnote 2 (Kenward & Roger, Reference Kenward and Roger1997) was implemented using the mixed function from the R package (Singmann et al., Reference Singmann, Bolker and Westfall2015) to estimate the p values in our models. The LMM model of the total response time revealed significant main effects of similarity type (B = 0.04, t = 3.53, p < .001), grade (B = −0.24, t = −6.45, p < .0001), and frequency (B = −0.04, t = −3.13, p < .01). We found that our participants’ response times with the transposed-character nonwords were 84 ms slower than response times with the RC nonwords (1578 vs. 1493 ms), indicating TC effects in both second and fourth graders. In addition, the second graders spent significantly more time recognizing the nonwords (across TC and RC nonwords) than the fourth graders did, as indicated by the negative estimate (1743 vs. 1375 ms). Furthermore, the lexical decision times were longer for low-frequency words (1562 ms) than for high-frequency words (1509 ms). However, the three-way interaction among similarity type, grade, and word frequency was not significant (B = −0.03, t = −0.63, p = .53). In addition, neither the interaction of Similarity Type × Grade (B =0 .02, t = 0.82, p = .41) nor that of Similarity Type × Frequency (B = −0.02, t = −0.76, p = .44) was significant, suggesting that grade level and frequency did not affect the magnitude of the TC effect.
Similar to the initial data analysis, we also found a main effect of similarity type when both vocabulary size and character reading (B = 0.0347, t = 2.57, p < .05) were added to the model. There were also main effects of vocabulary size and character reading (B = −0.0207, t = −3.24, p < .01, and B = −0.0025, t = −3.13, p < .01), indicating that both a larger vocabulary size and a greater character reading ability resulted in faster response times. However, there were no interactions between similarity type and vocabulary size (B = −0.0039, t = −0.934, p = .3504) or similarity type and character reading (B = 0.0007, t = 1.367, p = .1720).
Initiation times
As before, using the Kenward–Roger approximation, we did not find any significant main effects of similarity type, grade, frequency, or their interactions. Our results revealed neither main effects of similarity type (B = −0.014, t = −0.275, p = .783) nor interactions between similarity type and vocabulary size (B = 0.018, t = 1.08, p = .283) or similarity type and character reading (B = −0.0005, t = −0.238, p = .812). There were no significant main effects of vocabulary size (B = −0.021, t = −1.050, p = .297) or character reading (B = 0.002, t = 0.956, p = .342).
Movement times
Movement time results indicated significant effects of similarity type (B = 0.051, t = 3.46, p < .001), grade (B = −.258, t = −5.79, p < .0001), and frequency (B = −0.038, t = –2.56, p < .011). The participants’ movement times for the TC nonwords were 93 ms slower than those for the RC nonwords (1378 vs. 1285 ms), indicating TC effects in both second and fourth graders. Moreover, the second graders spent significantly more time recognizing the nonwords across TC and RC nonwords than did the fourth graders (1519 vs. 1187 ms). Furthermore, lexical decision times were longer for low-frequency words (1353 ms) than for high-frequency words (1309 ms). None of the other effects were significant.
After entering both vocabulary size and character reading into the final data analysis, we also found significant main effects of similarity type (B = 0.047, t = 2.741, p < .01), vocabulary size (B = −0.020, t = −2.723, p = .0079), and character reading (B = −0.004, t = −3.822, p = .0003). Neither the interaction between similarity type and vocabulary size (B = −0.005, t = −1.004, p = .3155) nor that between similarity type and character reading (B = 0.0006, t = 0.827, p = .4082) was significant.
Accuracy rates
In the GLMM with accuracy rates as the dependent outcome, our results indicated a three-way interaction among similarity type, grade, and word frequency (B = 1.17, t = 2.35, p < .05). The results also indicated significant effects of word frequency (B = 0.47, t = 3.79, p < .001) and similarity type (B = −0.31, t = −2.46, p < .05). There was no significant interaction between similarity type and word frequency (B = 0.32, t = −1.27, p = .2053).
However, we did find a significant interaction between similarity type and grade (B = −0.72, t = −2.88, p < .01). An examination of simple main effects revealed that the effect of similarity type was significant for fourth graders (B = −0.67, t = −4.29, p < .0001), but not for second graders (B = 0.06, t = 0.31, p = .753). The results showed that the fourth graders had lower accuracy rates on the TC nonwords than on the RC nonwords (70% vs. 81%), whereas there was no difference in accuracy rates between TC nonwords and RC nonwords for the second graders (78% vs. 77%). This suggests that, unlike the second graders, the fourth graders exhibited a reliable transposed-character effect. Neither the interaction between similarity type and vocabulary size (B = 0.01, t = 0.172, p = .8636) nor the interaction between similarity type and character reading (B = −0.004, t = −0.93, p = .353) was significant.
AUC
To evaluate the overall spatial attraction of the mouse trajectory, AUC values were also introduced into the LMM analysis. As shown in Figure 2, the AUC is the geometric area enclosed by the observed mouse-trajectory and the idealized straight-line trajectory connecting the starting position and the end position (Freeman & Ambady, Reference Freeman and Ambady2010). The main effect of similarity type was significant (B = 0.521, t = 6.22, p < .0001); the results demonstrated that the TC nonwords exhibited a greater overall spatial attraction to the “YES” response option than did the RC nonwords across grades (see Figure 3). As we expected, this result indicated that the TC nonwords elicited a larger spatial attraction to the base word response option. Nonetheless, the main effects of grade (B = 0.227, t = 1.14, p = .2576) and word frequency were not significant (B = −0.153, t = –1.823, p = .0686), nor were the other two-way or three-way interactions significant. Most important, our results indicated a significant interaction between similarity type and word frequency (B = 0.37, t = 2.22, p < .05). The interaction between similarity type and frequency was explored employing the R package (De Rosario-Martínez, Reference De Rosario-Martínez2015). An examination of simple main effects revealed that the effect of similarity type was significant for the nonwords derived from both high-frequency words (χ2 = 39.40, df =1, p < .0001) and low-frequency words (χ2 = 7.58, df = 1, p < .01). Of interest, the AUC data further showed that the magnitude of the effect of similarity type was greater for high-frequency words than for low-frequency words (χ2 = 5.25, df = 1, p < .05). As for the effects of vocabulary size and character reading, we only found significant main effects of similarity type (B = 0.5243, t = 5.329, p < .000), but no interactions between similarity type and either vocabulary size (B = 0.008, t = 0.273, p = .785) or character reading (B = −0.005, t = −1.224, p = .221). None of the other effects were significant.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_fig2.png?pub-status=live)
Figure 2. Illustration of the area under the curve (AUC) in our mouse-tracking lexical-decision task.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_fig3.png?pub-status=live)
Figure 3. Schematic illustrations of mean movement trajectories for transposed-character nonwords (TC nonwords, triangles) and replaced-character nonwords (RC nonwords, circles) in the (a) second graders and (b) fourth graders. In both panels, it is clear that the trajectories for the TC nonwords were strongly attracted to the “YES” word response for a significant portion of the movement response as compared to RC nonword trajectories.
Discussion
In the present study, we used a mouse-tracking lexical-decision task to examine the emergence of whole-word processing of compound words via the TC effect in early Chinese reading development. Mouse tracking was used to record the overall spatial attraction of computer mouse movement (AUC) toward the alternative response option, suggesting that lexical processing cascades into motor execution. We found that the TC effect can be observed in the kinematics of mouse movements in second and fourth graders. As predicted, TC nonwords triggered a greater overall spatial attraction to the real-word response option (the base word) relative to the RC nonwords, as exhibited by the mouse movement trajectories over time during lexical decision. That is, TC effects were observed in the AUC values, in both second and fourth graders. Based on this finding, movement trajectories of the computer mouse cursor can serve as a reliable index of the TC effect in Chinese reading development and mouse tracking presents a new tool to study the links among perception, cognition, and action across reading development, which is a major strength of the current study. Our finding also suggests that the mouse trajectory and other conventional dependent variables (e.g., accuracy and reaction time) are complementary and they can be tapped into similar TC effects. Similar to the mouse trajectory results, the total response times and movement times of all the children were higher in the TC condition than in the RC condition. Our results on total response times and movement times are similar to those of previous studies in Chinese adults and children (e.g., Bai et al., Reference Bai, Cai and Schumacher2011; Liu et al., Reference Liu, Chung, McBride-Chang and Tong2010, Reference Liu, Chung, Zhang and Lu2014; Peng et al., Reference Peng, Ding, Wang, Taft and Zhu1999; Taft et al., Reference Taft, Zhu and Peng1999) performing lexical-decision tasks, indicating that Chinese readers tend to spend more time and make more errors in the TC condition than in the RC condition. For example, Liu et al. (Reference Liu, Chung, Zhang and Lu2014) demonstrated that both second- and third-grade elementary students showed very similar sizes of the TC effect on both response times and errors in a lexical-decision task.
Combining the results from mouse movement trajectories, total response times, movement times, and accuracy rates, we can conclude that there was a robust TC effect, showing that developing Chinese readers exhibited stronger spatial attraction to the base words and more motor response conflict for TC nonwords than for RC nonwords. The results of the present study lead to an assumption that the whole-word access route of Chinese compound word reading could emerge in the early reading acquisition of children.
Unlike a previous study of Liu et al. (Reference Liu, Chung, Zhang and Lu2014) with a conventional lexical-decision task, the unique contribution of the present mouse-tracking study not only informs us whether a lexical-decision process is fast or slow but also provides a more direct real-time and data-rich window into the dynamics of inhibition processes underlying compound word recognition. The present study has identified two distinct types of inhibition processes during the lexical-decision task: the response threshold adjustment and conflict resolution processes underlying the TC effect of Chinese compound word recognition. These two motor movement measures (i.e., initiation time and AUC) were documented in previous literature (e.g., Erb et al., Reference Erb, Moher, Sobel and Song2016) to index the response threshold adjustment process and the conflict resolution process, respectively. In the current study, we did not find a significant TC effect on initiation time (i.e., the time elapsed between the beginning of a trial and the initiation of mouse movement), which suggests that there was no difference between TC and RC nonwords when participants initiated their movement responses. Based on this result, it is likely that TC and RC nonwords might trigger comparable degrees of motoric stopping before a movement is started, which might be due to their similar response thresholds. The result shows us the TC effect might not arise at an early representational stage, which implies that the activation of whole-word orthographic representation did not occur before motor response selection stage of lexical access. Still, the stimuli used and other confounding factors may contribute to the nonsignificant effect; they have yet to be fully explored using real-time metrics of conflict, and we look forward to future research in testing the assumption.
In contrast, we detected the transposition effect in the mouse movement trajectories and movement times (the time elapsed between movement onset and response completion) throughout ongoing motor responses. It is more likely that the observed transposition effect occurs at a lexical-decision stage in which two orthographic codes compete (i.e., the competition between real-word and nonword orthographic nodes) and the competition feeds forward into the later motor response stage. The character transposition effect arises as response competition during the preparation and execution of the manual response stage. A similar explanation was elaborately explored in other mouse-tracking studies using a lexical-decision task (see Barca & Pezzulo, Reference Barca and Pezzulo2012; Barca et al., Reference Barca, Pezzulo, Ouellet and Ferrand2017), which reported that the interference arises at the manual response/late stage, in which two orthographic codes (i.e., the real-word versus nonword) are activated in functionally independent routes, and then these codes compete at a motor decision stage during lexical access. Such a conflict resolution process at the motor response stage also has been explained by the late interaction model (see Santens & Verguts, Reference Santens and Verguts2011, for a detailed discussion about the shared-decision account) in other cognition domains (e.g., numerical cognition). Another important finding of the present study is that the magnitude of the transposition effect did not vary as a function of grade level and vocabulary knowledge. Consistent with one finding of a past study by Liu et al. (Reference Liu, Chung, Zhang and Lu2014), our results indicate that the Taiwanese children exhibited the TC effect regardless of their different grades. The second graders (mean age = 9.8 years) showed a magnitude of the TC effect similar to that of the fourth graders (mean age 11.4 years). Liu et al. (Reference Liu, Chung, Zhang and Lu2014) found a nonsignificant interaction effect between grade level and similarity type, which also demonstrated that the magnitude of the TC effect was similar in young readers of Chinese in the second (mean age = 7.3 years) and third (mean age = 8.3 years) grades. These results imply flexibility in character position encoding during visual word recognition throughout development and that developing readers of Chinese are less sensitive to constituent morphemes during compound word recognition in early stages of reading acquisition.
Moreover, we found that individual differences in vocabulary knowledge did not affect whole-word processing in children’s reading acquisition and development in elementary school. Although previous studies reported that the developmental trajectories of morphological processing are moderated by individual differences in vocabulary knowledge (Hasenäcker et al., Reference Hasenäcker, Schröter and Schroeder2017), our empirical results are not in line with the previous findings. Our finding suggests a weaker effect of vocabulary knowledge on orthographic or morphological processing mechanisms in developing readers of Chinese, though vocabulary knowledge of the quality lexical representations plays an important role in the crucial building blocks of reading development. There are two plausible explanations for the finding. First, we reason that written vocabulary knowledge can instead be expected to feed more directly into the semantic process, and this might not reinforce the character transposition in orthographic or morphological processes in early reading development. Second, it is also possible that individual differences in vocabulary knowledge are associated with differences in the manner and/or extent of the decompositional route instead of the whole-word route, as Andrews and Lo (Reference Andrews and Lo2013) suggested may be true in skilled readers. As a result, a more comprehensive understanding of direct comparison of the whole-word frequency and constituent frequency of compound words in reading development across the elementary school years will be needed. It is important to concurrently manipulate constituent frequency and whole-word frequency in children within the scope of developmental studies on Chinese compound word recognition. Doing so would provide further insight into the character position processing mechanisms underlying the emerging ability to recruit the whole and decompositional access routes in visual word recognition.
Furthermore, we found that whole-word frequency significantly modulated the TC effect in accuracy and hand movement trajectory (i.e., AUC) rather than response durations (i.e., total response time, initiation time, and movement time). Our results suggest that word frequency variation mainly affected the participants’ accuracy and hand trajectories when they categorized lexical stimuli but did not impact their processing speed. The previous literature on the character position processing is predominantly based on accuracy and latency measurements of key-press responses. The current study expands on existing evidence by revealing the influence of lexical frequency on character position processing during Chinese compound word recognition through the measurement of computer mouse-movement trajectories. Our data clearly indicate that a larger magnitude TC effect exists for nonwords derived from high-frequency words than for nonwords derived from low-frequency words, suggesting that high-frequency TC nonwords are more effective in activating the whole-word orthographic representation of their base words and that they are perceptually closer to their base words than low-frequency nonwords are. Similar findings were demonstrated in previous research in English-speaking adults with lexical-decision, naming, and semantic categorization tasks (see Andrews, Reference Andrew1996; O’Connor & Forster, Reference O’Connor and Forster1981; Perea et al., Reference Perea, Rosa and Gómez2005; Vergara-Martínez et al., Reference Vergara-Martinez, Perea, Gómez and Swaab2013). Based on this compelling evidence, we can infer that lexical frequency might be an influential factor affecting the magnitude of TC effects in developing children.
An important theoretical implication of the present study is the introduction of a nonsensitive character position encoding assumption to the premise of previous models in Chinese compound word recognition (see Li et al., Reference Li, Rayner and Cave2009, for the word segmentation and recognition model; Taft, Reference Taft2004 for the multilevel interactive model; Zhou & Marslen-Wilson, Reference Zhou and Marslen-Wilson1995, for the multilevel cluster representation model). Contrary to previous models, our result demonstrated the flexibility of character position encoding in Chinese compound word reading. The present study not only delineates the flexibility of positional information in learning to read but also reveals the developmental trend of accessing compound words at the whole-word level, providing novel insights about the significant effect of word frequency on whole-word representation. Furthermore, this point is of interest because it can advance current word reading interventions and reading ability assessments of Chinese compound word recognition with respect to whole-word level processing across reading development.
A potential limitation in our study is the relatively small number of stimuli, which might limit the generalizability of our findings. Similarly to previous Chinese compound word studies, we had difficulty in selecting sufficient numbers of paired sets of compound words that differed in a categorical variable (e.g., high- vs. low-frequency words) with good matches on other lexical variables such as the morpheme frequency, morphological family size, number of strokes, and imageability of the first and second constituent characters (see Tse et al., Reference Tse, Yap, Chan, Sze, Shaoul and Lin2017, for a comprehensive discussion). In addition, it is essential to note that all the compound words used in the stimuli sets were semantically transparent instead of semantically opaque. This criterion for stimulus selection prevented the present study from using a larger number of stimuli. In future studies, we can address this issue by including semantically opaque compounds.
Conclusion and future direction
This research aimed to investigate a developmental trend of the TC effect in Chinese compound word processing by examining the kinematics of mouse movements in developing readers. Our findings indicate that developing readers of Chinese exhibit flexibility in character positional encoding and accessing the base word representation while recognizing two-character compound words. Our results are in line with the findings reported by Liu et al. (Reference Liu, Chung, Zhang and Lu2014) in their testing of second and third graders. Liu et al. (Reference Liu, Chung, Zhang and Lu2014) reported similar results for Chinese elementary school students. Their results indicated that second- and third-grade students show very similar sizes of TC effects in the response times and errors in a lexical-decision task. Unlike their study, the current study did not include proficient readers of Chinese such as high school students or adults. To obtain a clearer developmental trend of the transposition effect in Chinese reading development, the inclusion of proficient readers of Chinese should be further considered in future research.
To date, it should be noted that relatively few studies have investigated how the size of the TC effect changes across grades in the development of Taiwanese children’s Chinese reading. Unlike response-based outcomes, such as the lexical-decision task adopted by Liu et al. (Reference Liu, Chung, Zhang and Lu2014), we employed a novel experimental paradigm that also produced a similar result for response-time data. Nonetheless, it is difficult, using only outcome-based measures like response-time and accuracy rate, to infer anything about lexical and cognitive processes (e.g., inhibitory control processes) of the TC effect in Chinese children. The response data from previous studies cannot be used to examine whether the TC effect arises in an early representational stage or in a late motor decision stage. In contrast, our mouse-tracking task was a time-sensitive and process-tracking methodology designed to detect the TC effect that occurred before the response was executed or during the time course of the mouse movement. The mouse-tracking lexical-decision paradigm required participants to reject the unselected alternative response from the base word (i.e., the “YES” response option). Therefore, this paradigm not only added to the lexical processing demand but also increased the probability of detecting the inhibition processes underlying the transposition effects via mouse initiation time and movement trajectories.
In summary, the current study, by using mouse movement trajectories, showed that the magnitude of the TC effect does not change across grades in elementary school. A similar finding was obtained by Liu et al. (Reference Liu, Chung, Zhang and Lu2014). Our mouse-movement data demonstrated that the TC nonwords had strong effects on the mouse-movement trajectories at the late motor response decision stage in Chinese reading development. Specifically, the results showed that the size of the TC effect varies with the lexical frequency, with stronger TC effects for high-frequency words. The influence of a higher degree of word-likeness of the high-frequency word on the transposition effect is not specific to alphabetic writing systems (e.g., English), as these effects were also observed in a nonalphabetic writing system (e.g., Chinese). This finding contributes to a deeper understanding of the important role played by lexical item in the character position encoding in Chinese reading development. Such a novel finding demonstrates how critical it is to consider issues of lexical properties, such as word frequency, when researchers are evaluating the developmental trajectories of TC effects in Chinese compound word recognition.
Acknowledgments
We sincerely thank Chang-Shou Lin (林彰收) and Chin-Yun Wu (吳錦雲) for their generous and invaluable assistance with the data collection. They are our heroes and inspiration. Without their efforts, this study could never have been conducted. Therefore, the article is dedicated to them.
Appendix A. Full stimulus sets for the transposed-character and replaced-character conditions
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20201023013441112-0449:S0142716420000181:S0142716420000181_tabu1.png?pub-status=live)