Frost's main argument rests on the somewhat tenuous assumption that the concept of word recognition is constant across scripts. His and other theories in alphabetic languages attempting to model visual word recognition have been based on single-syllable words, though there have been recent attempts to explain how longer words, with prefixes and suffixes, for example, are read (Grainger & Ziegler Reference Grainger and Ziegler2011; Perry et al. Reference Perry, Ziegler and Zorzi2010). Most such models of visual word recognition assume that the concept of a word as the unit of analysis in reading across scripts is always clear.
Frost's description of Chinese is particularly problematic in this respect. To be sure, Chinese characters are monosyllabic (he argues that almost all Chinese words are monosyllabic), and most characters can also serve as words. However, the majority of words in Chinese consist of two or more morphemes. Moreover, recognition of both two-morpheme words and characters can be affected uniquely by character as well as word frequency, and for both children and adults. The structures of characters and words are also different, as indicated by experimental manipulations (Liu et al. Reference Liu, Chung, McBride-Chang and Tong2010) that highlight the importance of semantic/morphological information in Chinese (Chen & Shu Reference Chen and Shu2001; Tsang & Chen Reference Tsang and Chen2010; Wong & Chen, in press). For example, whereas parents tend to teach children to write words based primarily on phonological information in alphabetic languages such as Hebrew (Aram & Levin Reference Aram and Levin2001; Reference Aram and Levin2004), they almost never focus on phonological information in teaching words or characters in Chinese (Lin et al. Reference Lin, McBride-Chang, Aram, Levin, Cheung and Chow2009; Reference Lin, McBride-Chang, Aram, Shu, Levin and Cho2012); rather, they point out either visual-orthographic or morphological information as they talk about print.
The actual unit of reading as word versus character continues to be debated in Chinese (Chen et al. Reference Chen, Song, Lau, Wong, Tang, McBride-Chang and Chen2003). A crude analogy in English might be that wallpaper and paperweight or toenail and tiptoe are all common words consisting of common morphemes (e.g., paper; toe). Yet not all English speakers analyze these words as made up of separate morphemes or acquire each morpheme comprising compound words before learning these words themselves; some compound words are learned holistically first. Chinese multimorphemic words are often learned as single entities, with the individual characters comprising them becoming salient only when explicitly analyzed. The way in which Chinese appears on a page, with each character equally spaced and no differences in spacing distinguishing what are conceptual words, may also make the concept of a word confusing. This issue is not a problem for the alphabetic languages highlighted by Frost, because words are spatially distinguished in these orthographies.
However, this is not the case in Thai, another alphabetic script. Thai has no spaces between words, so words cannot be defined by using spaces. Importantly, whereas spacing between words in Chinese does not change the speed of reading for adults, in Thai (artificial, experimental) spacing between words actually facilitates word reading, particularly for poor readers (Kasisopa et al. Reference Kasisopa, Reilly, Burnham, Shen, Bai, Yan and Rayner2010). In Thai, word-position frequencies (i.e., statistical regularities) of particular graphemes within words are used to help identify where a word is likely to begin and end. For example, some graphemes that have a high probability of occurring as the initial or the final grapheme may assist readers operationally in defining word boundaries: They effectively direct eye movements to the optimal viewing position in a word (Kasisopa et al. Reference Kasisopa, Reilly, Burnham, Shen, Bai, Yan and Rayner2010). Nevertheless, there is relatively high ambiguity in defining words in Thai since word segmentation relies heavily on sentential context (Aroonmanakun Reference Aroonmanakun, Kawtrakul and Zock2007). Thus, what is segmented as a single word in one sentence may be a phrase or even a sentence in another context. For example, the string “คนขับรถ” can be a single word or a whole sentence, depending on the context in which it occurs: It is a word in the sentence คนขับรถเป็นผู้ชาย – that is, คนขับรถ (driver) เป็น (is) ผู้ชาย (man) (“the driver is a man”) – but a three-word phrase in the sentence คนขับรถเร็ว – that is, คน (man) ขับ (drive) รถ (car) เร็ว (fast) (i.e., “The man drives the car fast”). In addition, many Thai word strings are ambiguous. For example, ตากลม can be read either as ตาก + ลม (exposed + wind = “exposed to wind”) or ตา + กลม (eye[s] + round = “round eyes”). Such ambiguities and the aforementioned contextual effects make it difficult to design automatic segmentation strategies because the number of decisions that cannot be made by machine is surprisingly high (Aroonmanakun Reference Aroonmanakun, Theeramunkong and Sornlertlamvanich2002). Extensive top-down processing is required for resolving these ambiguities. Thus, in both Chinese and Thai, the reading process involves a certain amount of flexibility in order for word recognition to be a clear and salient concept.
For Finnish, an alphabetic orthography with word boundaries marked by spaces, defining the nature of a visual word is also potentially important. Because Finnish has highly inflectional morphology with 15 cases, plural markers, and different clitics, each noun can have more than 2,000 orthographic variants (Niemi et al. Reference Niemi, Laine and Tuominen1994). The root also often changes together with inflections. Moreover, it is possible to express a complicated concept that in other languages might require multiple words by using a single word, especially for verbs, because Finnish allows so much compounding. For example, “lukea” is the basic form of “to read,” whereas “(vielä) luettuammekin…” means “(even) after we had also read…”. In the latter example, the stem change (“luettua”) denotes the past event, “mme” is an inflectional form for “we,” and “kin” is a common ending meaning “also.” These characteristics have prompted Hirsimäki et al. (Reference Hirsimäki, Creutz, Siivola, Kurimo, Virpioja and Pylkkönen2006) to suggest that, for Finnish, “traditional models based on full words are not very effective” (p. 539). Rather, these researchers advocate a word fragment model. Thus, expert Finnish readers efficiently use frequently occurring sub-units of words for which there are well-formed/strong representations and which often do not obey classical word or syllable boundaries.
From these three examples, it is not entirely clear that there can or should be a universal model of visual word recognition. Indeed, what constitutes a word across scripts may be somewhat ambiguous, though dimensions of orthography, phonology, morphology, syntax, and semantics are clearly universal components of reading.
Frost's main argument rests on the somewhat tenuous assumption that the concept of word recognition is constant across scripts. His and other theories in alphabetic languages attempting to model visual word recognition have been based on single-syllable words, though there have been recent attempts to explain how longer words, with prefixes and suffixes, for example, are read (Grainger & Ziegler Reference Grainger and Ziegler2011; Perry et al. Reference Perry, Ziegler and Zorzi2010). Most such models of visual word recognition assume that the concept of a word as the unit of analysis in reading across scripts is always clear.
Frost's description of Chinese is particularly problematic in this respect. To be sure, Chinese characters are monosyllabic (he argues that almost all Chinese words are monosyllabic), and most characters can also serve as words. However, the majority of words in Chinese consist of two or more morphemes. Moreover, recognition of both two-morpheme words and characters can be affected uniquely by character as well as word frequency, and for both children and adults. The structures of characters and words are also different, as indicated by experimental manipulations (Liu et al. Reference Liu, Chung, McBride-Chang and Tong2010) that highlight the importance of semantic/morphological information in Chinese (Chen & Shu Reference Chen and Shu2001; Tsang & Chen Reference Tsang and Chen2010; Wong & Chen, in press). For example, whereas parents tend to teach children to write words based primarily on phonological information in alphabetic languages such as Hebrew (Aram & Levin Reference Aram and Levin2001; Reference Aram and Levin2004), they almost never focus on phonological information in teaching words or characters in Chinese (Lin et al. Reference Lin, McBride-Chang, Aram, Levin, Cheung and Chow2009; Reference Lin, McBride-Chang, Aram, Shu, Levin and Cho2012); rather, they point out either visual-orthographic or morphological information as they talk about print.
The actual unit of reading as word versus character continues to be debated in Chinese (Chen et al. Reference Chen, Song, Lau, Wong, Tang, McBride-Chang and Chen2003). A crude analogy in English might be that wallpaper and paperweight or toenail and tiptoe are all common words consisting of common morphemes (e.g., paper; toe). Yet not all English speakers analyze these words as made up of separate morphemes or acquire each morpheme comprising compound words before learning these words themselves; some compound words are learned holistically first. Chinese multimorphemic words are often learned as single entities, with the individual characters comprising them becoming salient only when explicitly analyzed. The way in which Chinese appears on a page, with each character equally spaced and no differences in spacing distinguishing what are conceptual words, may also make the concept of a word confusing. This issue is not a problem for the alphabetic languages highlighted by Frost, because words are spatially distinguished in these orthographies.
However, this is not the case in Thai, another alphabetic script. Thai has no spaces between words, so words cannot be defined by using spaces. Importantly, whereas spacing between words in Chinese does not change the speed of reading for adults, in Thai (artificial, experimental) spacing between words actually facilitates word reading, particularly for poor readers (Kasisopa et al. Reference Kasisopa, Reilly, Burnham, Shen, Bai, Yan and Rayner2010). In Thai, word-position frequencies (i.e., statistical regularities) of particular graphemes within words are used to help identify where a word is likely to begin and end. For example, some graphemes that have a high probability of occurring as the initial or the final grapheme may assist readers operationally in defining word boundaries: They effectively direct eye movements to the optimal viewing position in a word (Kasisopa et al. Reference Kasisopa, Reilly, Burnham, Shen, Bai, Yan and Rayner2010). Nevertheless, there is relatively high ambiguity in defining words in Thai since word segmentation relies heavily on sentential context (Aroonmanakun Reference Aroonmanakun, Kawtrakul and Zock2007). Thus, what is segmented as a single word in one sentence may be a phrase or even a sentence in another context. For example, the string “คนขับรถ” can be a single word or a whole sentence, depending on the context in which it occurs: It is a word in the sentence คนขับรถเป็นผู้ชาย – that is, คนขับรถ (driver) เป็น (is) ผู้ชาย (man) (“the driver is a man”) – but a three-word phrase in the sentence คนขับรถเร็ว – that is, คน (man) ขับ (drive) รถ (car) เร็ว (fast) (i.e., “The man drives the car fast”). In addition, many Thai word strings are ambiguous. For example, ตากลม can be read either as ตาก + ลม (exposed + wind = “exposed to wind”) or ตา + กลม (eye[s] + round = “round eyes”). Such ambiguities and the aforementioned contextual effects make it difficult to design automatic segmentation strategies because the number of decisions that cannot be made by machine is surprisingly high (Aroonmanakun Reference Aroonmanakun, Theeramunkong and Sornlertlamvanich2002). Extensive top-down processing is required for resolving these ambiguities. Thus, in both Chinese and Thai, the reading process involves a certain amount of flexibility in order for word recognition to be a clear and salient concept.
For Finnish, an alphabetic orthography with word boundaries marked by spaces, defining the nature of a visual word is also potentially important. Because Finnish has highly inflectional morphology with 15 cases, plural markers, and different clitics, each noun can have more than 2,000 orthographic variants (Niemi et al. Reference Niemi, Laine and Tuominen1994). The root also often changes together with inflections. Moreover, it is possible to express a complicated concept that in other languages might require multiple words by using a single word, especially for verbs, because Finnish allows so much compounding. For example, “lukea” is the basic form of “to read,” whereas “(vielä) luettuammekin…” means “(even) after we had also read…”. In the latter example, the stem change (“luettua”) denotes the past event, “mme” is an inflectional form for “we,” and “kin” is a common ending meaning “also.” These characteristics have prompted Hirsimäki et al. (Reference Hirsimäki, Creutz, Siivola, Kurimo, Virpioja and Pylkkönen2006) to suggest that, for Finnish, “traditional models based on full words are not very effective” (p. 539). Rather, these researchers advocate a word fragment model. Thus, expert Finnish readers efficiently use frequently occurring sub-units of words for which there are well-formed/strong representations and which often do not obey classical word or syllable boundaries.
From these three examples, it is not entirely clear that there can or should be a universal model of visual word recognition. Indeed, what constitutes a word across scripts may be somewhat ambiguous, though dimensions of orthography, phonology, morphology, syntax, and semantics are clearly universal components of reading.