INTRODUCTION
Decades of reading research led to an almost universal agreement about three fundamental aspects that govern reading: orthography (or spelling), phonology (or sound) and semantics. All three appear to contribute interactively to the reading process. Empirical evidence for the role of orthography and phonology in experienced and fluent reading is abundant (e.g. Barker, Torgesen & Wagner, Reference Barker, Torgesen and Wagner1992; Berent & Perfetti, Reference Berent and Perfetti1995; Bosman & van Hell, Reference Bosman, van Hell, Verhoeven, Elbro and Reitsma2002; Juel, Griffith & Gough, Reference Juel, Griffith and Gough1986; Stanovich & West, Reference Stanovich and West1989; Van Orden, Pennington & Stone, Reference Van Orden, Pennington and Stone1990; Seidenberg & McClelland, Reference Seidenberg and McClelland1989).
Although the role of semantics in reading (e.g. Azuma & Van Orden, Reference Azuma and Van Orden1997; Meyer & Schvaneveldt, Reference Meyer and Schvaneveldt1971; Strain, Patterson & Seidenberg, Reference Strain, Patterson and Seidenberg1995) and learning to read (e.g. Assink, Van Bergen, Van Teeseling & Knuijt, Reference Assink, Van Bergen, Van Teeseling and Knuijt2004; Scarborough, Reference Scarborough1991) has been recognized, it has not received as much attention as the orthographic and phonological aspects have. The main goal of this study, therefore, is to add to the understanding of the role of semantics to reading in students attending primary grades. We will address the issue of semantic development and the relationship between semantic skills and word reading.
Semantic(s) is a rather abstract notion, revealed by the many different ways in which it has been operationalized in the domain of experimental psycholinguistics. Examples are recognizing and defining words (receptive and expressive vocabulary tests; e.g. Vellutino, Scanlon & Spearing, Reference Vellutino, Scanlon and Spearing1995), generating words in an association test (semantic fluency; e.g. Nation & Snowling, Reference Nation and Snowling2004), making synonym judgments (e.g. Nation & Snowling, Reference Nation and Snowling2004), detecting common features in semantic concepts (e.g. Vellutino et al., Reference Vellutino, Scanlon and Spearing1995), generating exemplars of a semantic category (e.g. Ben-Dror, Bentin & Frost, Reference Ben-Dror, Bentin and Frost1995) and verifying whether a word belongs to a certain semantic category (e.g. Ben-Dror et al., Reference Ben-Dror, Bentin and Frost1995). In addition, ‘semantic skills’ has also been used to refer to the notion of higher levels of processing, that is, the ability to predict the plot of a story, extract the meaning of a story and discuss a book after reading.
The first empirical demonstration of the role of semantics in word identification is the seminal study by Meyer & Schvaneveldt (Reference Meyer and Schvaneveldt1971) on semantic priming. In their lexical decision task, participants responded faster and more accurately to the second word, nurse, in a semantically related word pair (doctor followed by nurse) than in a semantically unrelated word pair (e.g. cloud followed by nurse). Semantic priming studies have been carried out with both adult participants (see Neely, Reference Neely, Besner and Humphreys1991, for a review) and children (e.g. Assink et al., Reference Assink, Van Bergen, Van Teeseling and Knuijt2004). In general, these studies have demonstrated that target words preceded by a semantically related prime word are recognized faster and with fewer errors than words preceded by an unrelated prime word, and suggest that semantic activation appears to facilitate word identification.
A second line of evidence for the role of semantics in word identification has been provided by studies on single word reading. Words associated with multiple meanings (Azuma & Van Orden, Reference Azuma and Van Orden1997) and ambiguous words (Borowsky & Masson, Reference Borowsky and Masson1996) are processed more quickly in a lexical decision task than words with single meanings or unambiguous words. Words without clear synonyms (Pecher, Reference Pecher2001), highly imageable words (Strain et al., Reference Strain, Patterson and Seidenberg1995) and words with a large number of features (Pexman, Lupker & Hino, Reference Pexman, Lupker and Hino2002) are also read relatively more quickly. The general conclusion is that semantically rich words (i.e. ambiguous words, highly imageable words and words with a large number of features) are recognized faster and with fewer errors than words with fewer and less enriched semantic features.
The notion that semantics contributes to the identification of single words raises the issue of the relationship between semantic skills and reading development. Scarborough (Reference Scarborough1991), for instance, showed that a low level of semantic skills, such as vocabulary, language comprehension and narrative skills, play a role in the development of reading difficulties (see also Keenan & Betjemann, Reference Keenan, Betjemann, Grigorenko and Naples2008). Berends & Reitsma (Reference Berends and Reitsma2006) found evidence for the superiority of semantic training. Focusing on the semantic characteristics of words improved reading fluency more than orthographic training in Grade 2 students.
A large part of experimental studies on the role of semantics in word reading has focused on semantic categorization. Semantic categorization is a means to assess taxonomic knowledge and is often used to obtain insight into the organization of the semantic memory system. Taxonomic knowledge refers to the organization of concepts in a hierarchical structure. It pertains, for example, to the knowledge that a poodle belongs to the category ‘dogs’ or that cats and dogs share meaning similarities because they both belong to the semantic category ‘animals’ or ‘pets’.
It is assumed that semantic knowledge in adults is organized hierarchically, and that even young children are able to use taxonomic relationships between concepts (e.g. Nguyen & Murphy, Reference Nguyen and Murphy2003). With respect to taxonomic knowledge, a distinction can be made between horizontal and vertical relationships. Horizontal relationships refer to items at the same hierarchical level (e.g. ‘cat – dog’) and vertical relationships refer to items at different hierarchical levels, including the use of superordinate terms (e.g. ‘animal – dog’). Lucariello, Kyratzis & Nelson (Reference Lucariello, Kyratzis and Nelson1992) showed that conventional, horizontal and vertical, taxonomic knowledge develops with age and emerges around the age of seven.
Several studies revealed a positive relationship between semantic categorization skills and reading in primary grades (e.g. Ben-Dror et al., Reference Ben-Dror, Bentin and Frost1995; Howell & Manis, Reference Howell and Manis1986; Vellutino et al., Reference Vellutino, Scanlon and Spearing1995). Howell & Manis (Reference Howell and Manis1986) presented participants with pictures and printed words that were exemplars of one of four semantic categories. Participants had to decide whether the stimulus (e.g. ‘dog’) was presented with either the correct superordinate label (i.e. ‘animal’) or the correct basic label (i.e. ‘dog’). Performance differed as a function of age and reading skill. Poor readers were significantly slower in making decisions than controls and this difference was larger in younger readers than in older readers. Slower semantic performance of poor readers was apparent in the pictures as well as the printed words condition, which suggests that semantic skills are not specific to written words.
Ben-Dror et al. (Reference Ben-Dror, Bentin and Frost1995) addressed the relationships among semantic categorization, morphological skills, phonological skills and reading in Hebrew. In their task, children were presented with a semantic category (e.g. ‘clothing’), followed by positive exemplars (e.g. ‘dress’) and negative exemplars (e.g. ‘boat’). Participants were both typical readers and children with reading difficulties from Grade 5 and a control group of typical readers from Grade 3. Results showed that poor readers performed worse (more errors and longer reaction times) than typical readers at the same age. In short, these studies reveal that poor semantic categorization skills are associated with poor word reading skills.
There are, however, researchers who failed to demonstrate a relationship between semantic categorization and word reading. Vellutino, Scanlon & Tanzman (Reference Vellutino, Scanlon and Tanzman1990) presented poor readers and typical readers from Grades 1, 2, 4 and 6 with triads of words. They were asked to ‘put together words that go together’. They used words with a horizontal semantic relationship (e.g. roof and wall), a vertical semantic relationship (e.g. food and cake) or a phonological/orthographic relationship (e.g. rest and roof). It appeared that poor and typical decoders did not differ with respect to their semantic categorization skill.
Silva-Pereyra et al. (Reference Silva-Pereyra, Rivera-Gaxiola, Fernández, Díaz-Comas, Harmony, Fernández-Bouzas, Rodríguez, Bernal and Marosi2003) also challenged the claim that poor readers exhibit semantic deficits and attributed poor readers' lower performance in semantic tests to deficiencies in the processing of words, rather than to poor semantic skills per se. In their experiment, children were presented with pictures and words representing animals or non-animals (e.g. household items). They were asked whether the stimulus was an animal or not. Poor readers differed from control readers on a word categorization task, but not on a picture categorization task.
The reason for these contradicting results concerning the relationship between semantic skills and word reading might be that it changes with age. After all, this phenomenon has also been demonstrated in the relationship between word identification and phonological skills, and word identification and orthographic skills. Wagner et al. (Reference Wagner, Torgesen, Rashotte, Hecht, Barker, Burgess, Donahue and Garon1997) showed that the effect of phonological awareness on naming faded with development, whereas the relationship between orthographic skills and word reading increased with age (Juel et al., Reference Juel, Griffith and Gough1986). Juel et al. found that orthographic skills explained more of the variance in word reading in second graders than in first graders.
Vellutino et al. (Reference Vellutino, Scanlon and Spearing1995, Experiment 1A) studied the issue of semantic development in primary grade students. In a category verification test, a word was presented verbally and the child had to decide whether it belonged to a specific semantic category. Both typical readers and poor readers of Grades 2 and 6 performed the tests. It turned out that typical readers were generally faster than poor readers and reader group differences were smaller in second grade than in sixth grade.
To summarize, evidence for a positive relationship between semantic categorization and reading is not fully convincing. Moreover, few studies have been carried out with beginning readers and none were performed in the Dutch language. We, therefore, decided to study semantic categorization and its relationship with reading (i.e. word identification) in Dutch-speaking children from Grades 1 to 6, covering all grades of primary school.
Dutch is a relatively transparent language; that is, it has relatively unambiguous spelling-to-sound relationships. Formal reading instruction in the Netherlands starts in Grade 1 and as soon as children master the decoding rules, relatively few errors are made and variations among readers are largely marked by speed differences (for a concise description of the Dutch orthographic system and its relationship to learning to read and spell, please consult Bosman, de Graaff & Gijsel, Reference Bosman, de Graaff, Gijsel, Joshi and Aron2006).
In short, the present study investigated the role of semantic skills in a relatively transparent language across all primary grades. Because studies of conceptual knowledge of children have yielded mixed results concerning the order of development of different kinds of relationships (e.g. horizontal and vertical relationships), we decided to use two types of semantic categorization tasks, namely, Exemplar-level tasks (Experiment 1) and Superordinate-level tasks (Experiment 2). In an Exemplar-level task, knowledge of horizontal relations is assessed, and in Superordinate-level tasks, knowledge of vertical relationships is investigated.
To ensure the participation of young readers with limited word recognition skills, stimuli were not just written words, but also spoken words and pictures; for comparison purposes among condition, response options were always pictures (e.g. Vellutino, Scanlon, DeSetto & Pruzek, Reference Vellutino, Scanlon, DeSetto and Pruzek1981; Vellutino et al., Reference Vellutino, Scanlon and Tanzman1990). The different presentation conditions enabled us to investigate whether semantic skills are independent of task specifics. Additionally, we manipulated distracter type to examine the effect of phonological, semantic and perceptual characteristics. Vellutino et al. (Reference Vellutino, Scanlon and Tanzman1990, Experiment 1) created word pairs with related semantic/syntactic attributes (S/S attributes, such as roof–wall and old–man) and with orthographic/phonological attributes (O/P attributes, such as room–goose, rest–roof). They found that both poor and typical readers predominantly matched words based on S/S attributes rather than O/P attributes, and the percentage of S/S categorizations increased significantly from Grade 1 to Grade 4. In their second experiment, the S/S categorization principle was omitted and words could only be categorized based on either an orthographic or a phonological similarity. It turned out that typical readers used phonological similarity more often than poor readers did. Poor readers used the phonological categorization principle infrequently, suggesting that phonological attributes of words are more weakly represented in poor readers than orthographic attributes.
In both tasks of the present study, children were instructed to categorize on the basis of semantics. Apart from semantic distracters, we also included phonological distracters, because of its major role in reading, and perceptual distracters, because perceptual similarity appears to be involved in learning word meanings (e.g. Gentner & Namy, Reference Gentner and Namy1999). Moreover, response options were always pictures.
In sum, this study aims at investigating whether: (a) semantic categorization skills increase over time; (b) poor readers have lower semantic categorization skills than typical or good readers; and (c) poor readers and younger readers use different categorization aspects (semantic, phonological or perceptual) than typical or good readers and older readers.
EXPERIMENT 1: EXEMPLAR LEVEL
METHOD
Participants
In this study, 141 students participated: 66 boys (46·8%) and 75 girls (53·2%) from two regular primary schools in the Netherlands. Both schools were randomly selected and all children were included in the experiment. The group of participants consisted of 47 children in the lower grades (Grades 1 and 2), 47 children in the intermediate grades (Grades 3 and 4), and 47 children the higher grades (Grades 5 and 6). The mean age ranged from 7.6 (SD=0·7) in the lower grades to 9.5 (SD=0.7) in the intermediate grades, and 11.6 (SD=0.7) in the higher grades (Grades 5 and 6). The majority of the children (93%) were native Dutch speakers.
Materials
The materials comprised three forced-choice tasks to assess taxonomic knowledge and a standardized word-reading test.
Forced-choice semantic tasks
Three Exemplar-level tasks (including a picture, spoken word or written word as target stimulus) were included. The response options for each target stimulus in each task always contained four pictures. Children were instructed to choose the picture that best matched the target stimulus. All pictures originated from ‘Leesladder’ [Reading Ladder] from Irausquin & Mommers (Reference Irausquin and Mommers2001), a computer program for children with reading disabilities. The pictures were coloured line drawings and represented nouns which were well known by six-year-old Dutch children (Schaerlaekens, Kohnstamm & Lejaegere, Reference Schaerlaekens, Kohnstamm and Lejaegere1999; familiarity rating ⩾0·80 on a scale from 0 to 1). Moreover, only high-imageability nouns were selected (van Loon-Vervoorn, Reference Van Loon-Vervoorn1985; imageability rating >5·5 on a seven-point scale). All stimuli were presented on a laptop. All tasks consisted of twenty experimental trials, preceded by three practice trials, and both accuracies and reaction times were registered. A detailed description of each task is presented below.
Picture condition
A target picture was presented (e.g. ‘orange’), followed by four pictures, one target response and three distracters. The target response represented a concept from the same taxonomic category (e.g. ‘cherry’). Categories that were included were ‘insects’, ‘predators’, ‘mammals’, ‘rodents’, ‘reptiles’, ‘sense organs’, ‘furniture’, ‘transport’, ‘clothes’, ‘jewels’, ‘tools’, ‘toys’, ‘vegetables’, ‘fruit’, ‘parts of the body’ and ‘buildings’. Each trial was a different semantic category, except for the last four categories (‘vegetables’, ‘fruit’, ‘parts of the body’ and ‘buildings’). Each of these categories had two trials. A first distracter picture, which was included in half of the trials (ten out of twenty trials), was a semantic distracter: a concept (e.g. ‘egg’) that belonged to the superordinate category ‘food’, but was not an exemplar of the subcategory ‘fruit’, represented by the target stimulus (‘orange’). A second distracter picture was a phonological distracter (eleven out of twenty trials, e.g. sneeuw ‘snow’ for the stimulus leeuw ‘lion’) or a perceptual distracter (nine out of twenty trials, e.g. ‘ball’ with the stimulus ‘orange’). The criterion for phonological similarity was sharing the same end-rime with the stimulus. Perceptual similarity was created by similar contours, similar perceptual features or a similar colour of the distracter to the stimulus. A third distracter picture, included in all twenty trials, was an unrelated picture (e.g. ‘chair’ with the stimulus ‘orange’). Trials without a semantic distracter included two unrelated response options. The criteria for the unrelated concept were the absence of semantic (taxonomic or associative), perceptual and phonological similarity.
Spoken word condition
The target stimuli were members from the same categories as those in the picture task. However, the target stimuli in this task were presented verbally. Again, response options were presented by means of pictures. The identity of the target stimuli and response options differed from those in the pictures task but were of comparable difficulty, as indicated by the familiarity ratings of Schaerlaekens et al. (Reference Schaerlaekens, Kohnstamm and Lejaegere1999) . The main body of the target stimuli constituted of one-syllable CVC-, CCVC- or CVCC-words (C stands for consonant, V stands for vowel); six stimuli consisted of two syllables. None of the word stimuli were semantically ambiguous. The types and distribution of the distracters were identical to the picture task, except for the distribution of phonological and perceptual distracters; this task involved exactly ten trials for each distracter type.
Written word condition
The target stimuli were derived from the same categories that were used in the picture condition and spoken word condition. However, target stimuli in this task were presented as written letter strings. Again, the identity of the target stimuli and response options differed from the other Exemplar-level tasks, but were of comparable difficulty. The main body of the target stimuli contained one-syllable CVC-, CCVC-, CVCC or CCCVC-words; two stimuli consisted of two syllables. The types and distribution of the distracters were identical to the picture condition.
Reading test
Drie-Minuten Toets [Three-Minute Test] by Verhoeven (Reference Verhoeven1995)
This standardized word reading test consists of three cards with three different types of words. Card 1 consists of 150 one-syllable CVC-, CV- and VC-words. Card 2 consists of 150 one-syllable CCVC-, CCVCC-, CCCVC- and CVCCCC-words. Card 3 consists of 120 polysyllabic words. Children are asked to read the words as quickly and as accurately as possible within one minute. The score is the number of items read correctly. In Grades 1 and 2, Card 1 was administered. In Grades 3 to 6, Card 3 was administered.
Apparatus and procedure
All forced-choice tasks were designed in E-prime (Schneider, Eschman & Zuccolotto, Reference Schneider, Eschman and Zuccolotto2002). Words were recorded in ‘Spraak’ [Speech] by Boersma & Weenink (Reference Boersma and Weenink2004) . The experiments were performed on a laptop. First, a fixation stimulus (a plus sign, 50-point Times New Roman font) was presented for 1000 ms. Immediately at the offset, the target stimulus (picture, spoken word, or letter string) was presented. This stimulus remained visible until a response was provided. Subsequently, four response options (pictures) appeared centrally on the screen and the child had to decide which picture best matched the stimulus. Participants indicated their responses by means of four keys on the keyboard, corresponding to the position of the pictures on the screen (keys ‘c’, ‘b’, ‘m’ and ‘.’ on the ‘qwerty keyboard’). These keys were marked by white tags. The participants were asked to put their hands in front of the keyboard. Word stimuli appeared in white on a black background centrally on the screen in lowercase 24-point Courier New font. There was a 1500 ms delay between the response and the onset of the next trial. Each participant was presented with a different random order. Reaction times were measured from the offset of the stimulus (the time that the space bar was pressed) until a response was given.
Children performed all three forced-choice tasks in groups of eight students. Each experimental forced-choice task started with three practice items. The practice items were used to explain the relationship between exemplars (e.g. ‘bicycle’ and ‘car’) and their superordinate label (e.g. ‘vehicles’). There were six different orders of the subtasks in the Exemplar-level experiment (pictures, spoken words and written words). Children who were tested simultaneously received the subtasks in the same order. Trials within a subtask were randomized. Children from Grades 1 and 2 were tested in two different sessions. Children from Grades 3 to 6 performed all tasks in one single session. The reading test was administered in March (Grades 2 to 6) and May (Grade 1). To examine the effect of grade, three groups were distinguished: lower grades (Grades 1 and 2), intermediate grades (Grades 3 and 4) and higher grades (Grades 5 and 6).
Results
For each participant, mean reaction times and accuracy percentages were calculated for each semantic task. For each task, incorrect trials and reaction times more than two standard deviations below or above a participant's mean reaction time were excluded from the latencies analyses.Footnote 1 The percentages of outliers in each task are listed in Table 1.
Table 1. Percentages of outliers in the Exemplar-level task removed from the analyses
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20151027044807806-0503:S0305000909990420_tab1.gif?pub-status=live)
Grade level and semantic categorization
Table 2 shows the results of the Exemplar-level tasks for all children and for the lower grades, intermediate grades and higher grades, separately. A 3 (task condition: written word vs. spoken word vs. picture)×3 (grade: lower vs. intermediate vs. higher) GLM Repeated Measures analysis was performed on reaction times and error rates with grade as between-subjects factor and task condition as within-subjects factors. Significant effects are based on an alpha level of 0·05. The interaction effect between grade level and task condition did not reach significant levels, and neither did reaction times (F(4, 274)=1·13, p>0·05), nor errors (F<1).
Table 2. Mean reaction times in ms., standard deviations and error percentages in the Exemplar-level task for each grade level
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20151027044807806-0503:S0305000909990420_tab2.gif?pub-status=live)
The main effect of task condition was significant, both for reaction times (F(2, 137)=17·72, p<0·001, partial η2=0·21) and errors (F(2, 137)=29·31, p<0·001, partial η2=0·30). Contrast analyses showed that error percentages were significantly higher in the spoken word condition (32·1%) than in the written word condition (25·6%) (F(1, 138)=29·76, p<0·001, partial η2=0·18) and in the picture condition (22·6%) (F(1, 138)=55·02, p<0·001, partial η2=0·29). Error percentages in the picture condition and written word condition also differed significantly from each other (F(1, 138)=6·06, p=0·02, partial η2=0·04). Reaction times were significantly higher in the spoken word condition (3295 ms) than in the written word condition (2834 ms) (F(1, 138)=34·35, p<0·001, partial η2=0·20) and in the picture condition (2905 ms) (F(1, 138)=17·48, p<0·001, partial η2=0·11).
The main effect of grade level was significant, both for reaction times (F(2, 138)=11·91, p<0·001, partial η2=0·15) and for errors (F(2, 138)=13·22, p<0·001, partial η2=0·16). A post hoc Bonferroni test revealed that reaction times in the higher grades (2585 ms) were significantly shorter than those in the intermediate grades (3039 ms) and in the lower grades (3411 ms). Reaction times in the lower grades and intermediate grades did not differ significantly from each other (p=0·09). A post hoc Bonferroni test on error percentages revealed that error percentages in the lower grades (33·1%) were significantly higher than those in the intermediate grades (25·1%) and in the higher grades (22·2%). Error percentages in the intermediate grades and higher grades did not differ significantly from each other.
Reading ability and semantic categorization
To examine the relationship between word reading ability and semantic categorization, we selected a group of poor readers (PR) from the higher grades (Grades 5 and 6). This group of poor readers had reading scores in the lowest quartile of the distribution (scores on Three-Minute Test ranging from 62 through 82). Second, we defined a reading-level (RL) control group. This group included younger normal readers of the intermediate grades (Grades 3 and 4), who were reading at the same developmental level as the poor readers group from the higher grades. Finally, we defined a chronological age (CA) control group. This group consisted of children with the same chronological age as the poor readers in the higher grades, but with normal reading skills. The lower grades were excluded from the analyses, because in these grades a different card of the Three-Minute Test was used (Card 1 instead of Card 3). Table 3 summarizes the results of all groups.
Table 3. Mean age, reading scores, reaction times in ms. (standard deviations in parentheses), and error percentages in the Exemplar-level task for each reading level
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20151027044807806-0503:S0305000909990420_tab3.gif?pub-status=live)
Next, a 3 (task condition: written word vs. spoken word vs. picture)×3 (reading group: PR-group vs. RL-control vs. CA-control) GLM Repeated Measures analysis was performed on reaction times and error percentages with reading group as between-subjects factor and task condition as within-subjects factor. Significant effects are based on an alpha level of 0·05. The interaction effect between reading group and task condition did not reach significant levels, neither for reaction times nor for errors (both Fs<1).
The main effect of task condition was significant, both for reaction times (F(2, 32)=4·75, p=0·02, partial η2=0·23) and for errors (F(2, 32)=8·89, p=0·001, partial η2=0·36). Contrast analyses showed that error percentages were significantly higher in the spoken word condition (27·5%) than in the written word condition (22·1%) (F(1, 33)=6·95, p=0·01, partial η2=0·17) and in the picture condition (18·3%) (F(1, 33)=16·67, p<0·001, partial η2=0·34). Reaction times were significantly higher in the spoken word condition (2860 ms) than in the written word condition (2466 ms) (F(1, 33)=7·15, p=0·01, partial η2=0·18) and in the picture condition (2390 ms) (F(1, 33)=8·62, p=0·006, partial η2=0·21). The main effect of reading group was not significant, neither for reaction times, nor for errors (both Fs<1).
Distracter options
Third, we investigated the effect of distracter type. To that end, we computed for each of the distracter types (i.e. phonological, perceptual and semantic) the percentages of distracter choices. This refers to the number of actual distracter choices made by the children divided by the number of potential distracter options of this type, multiplied by 100. Table 4 shows for each distracter type the percentage of distracter choices.
Table 4. Percentages of distracter choices for each distracter type
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20151027044807806-0503:S0305000909990420_tab4.gif?pub-status=live)
note: Percentages of distracter choices have been provided for each combination of response options. ‘Sem’ is semantic, ‘phon’ is phonological, ‘unrel’ is unrelated.
Subsequently, we examined whether the effect of distracter type varied among readers of different grade levels (lower vs. intermediate and higher grades) and different reading groups (PR-group vs. RL-control vs. CA-control).
Distracter type and grade level
A GLM Repeated Measures analysis with task condition (spoken word vs. picture vs. written word) and distracter type (semantic, phonological, perceptual) as within-subjects variables, and grade level (lower vs. intermediate vs. higher) as a between-subjects variable was performed on the percentages of distracter choices (N=141).
The interaction between task condition and distracter type was significant (F(4, 135)=21·56, p<0·001, partial η2=0·39). Separate analyses on the spoken word, written word, and picture condition showed significant main effects of distracter type in all three task conditions: spoken word (F(2, 137)=55·54, p<0·001, partial η2=0·49); picture (F(2, 137)=79·43, p<0·001, partial η2=0·54); written word (F(2, 137)=15·80, p<0·001, partial η2=0·19). In the picture and in the written word condition, all percentages of distracter choices differed significantly from each other. However, in the spoken word condition, the percentages of perceptual and semantic distracter choices did not differ significantly from each other.
The main effect of task condition was significant (F(2, 137)=29·60, p<0·001, partial η2=0·30). Percentage of distracter choices in the spoken word condition (18·4%) were significantly larger than those in the written word condition (15%) and in the picture condition (12·7%), which in turn also differed significantly from each other.
The main effect of distracter type was also significant (F(2, 137)=71·23, p<0·001, partial η2=0·51). Percentages of phonological distracter choices (7·7%) were significantly lower than those of semantic (18·4%) and perceptual distracter choices (20·1%), which in turn did not differ significantly from each other.
Finally, the main effect of grade was also significant (F(2, 138)=12·67, p<0·001, partial η2=0·16). Percentages of distracter choices in the lower grades (18·8%) were significantly higher than those in the intermediate grades (14·5%) and those in the higher grades (12·9%). Percentages of distracter choices of the intermediate grades and higher grades did not differ significantly from each other.
Distracter type and reading skill
A GLM Repeated Measures analysis with task condition (spoken word vs. picture vs. written word) and distracter type (phonological, perceptual, semantic) as within-subjects variables and reading group (PR-group vs. RL-control vs. CA-control) as between-subjects variable was performed on the percentages of distracter choices (N=36).
The interaction effect between task condition and distracter type was significant (F(4, 30)=7·55, p<0·001, partial η2=0·50). Separate analyses on the spoken word, picture, and written word condition showed significant main effects of distracter type in all three task conditions: spoken word (F(2, 32)=21·45, p<0·001, partial η2=0·57); picture (F(2, 32)=21·11, p<0·001, partial η2=0·57); written word (F(2, 32)=9·22, p=0·001, partial η2=0·37). In the picture condition, all percentages of distracter choices differed significantly from each other. However, in the written word condition and spoken word condition, percentages of perceptual and semantic distracter choices did not differ significantly from each other.
The main effect of task condition was significant (F(2, 32)=6·61, p=0·004, partial η2=0·29). Percentages of distracter choices in the spoken word condition (16·1%) were significantly larger than those in the written word condition (13·4%) and picture condition (10·6%), which in turn did not differ significantly from each other.
The main effect of distracter type was also significant (F(2, 32)=34·7, p<0·001, partial η2=0·68). Percentages of phonological distracter choices (5·2%) were significantly lower than those of semantic (16·6%) and perceptual distracter choices (18·2%), which in turn did not differ significantly from each other.
Conclusion
To sum up, in Experiment 1, readers from Grades 1 to 6 performed three Exemplar-level tasks. Results demonstrated longer reaction times and more errors in the spoken word condition than in the written word and the picture condition. The lower grades showed longer reaction times and more errors than the higher grades. No significant differences in semantic categorization skills were found between poor readers, chronological age matched readers and reading level matched readers.
The effect of distracter type (phonological, perceptual, semantic) on percentages of distracter choices of children of different grade levels and different reading skills showed that the percentage of phonological distracter choices was significantly lower than the number of semantic distracter choices and perceptual distracter choices. In the picture condition, the difference between percentages of semantic distracter choices and perceptual distracter choices also reached significance. In the written word condition, this difference reached significance in the analyses on grade. Neither grade level nor reading skills influenced the effect of distracter type. In other words, children from different grade levels and different reading skills were not differently affected by the phonological, perceptual or semantic distracters.
EXPERIMENT 2: SUPERORDINATE LEVEL
In Experiment 2, we address the same issues as in Experiment 1. However, instead of using an Exemplar-level task to assess semantic categorization skills, we used a Superordinate-level task, which measures knowledge of vertical relationships. We investigated whether semantic categorization skills increase over time, and whether poor readers have lower semantic categorization skills than typical or good readers. Because the Superordinate-level tasks included only semantic distracters and unrelated distracters, we decided not to investigate the effect of distracter type.
Method
Participants
The same children as in Experiment 1 participated in Experiment 2, except for the children in the lower grades. The Superordinate-level tasks included multisyllable words like muziekinstrumenten ‘musical instruments’ and vervoersmiddelen ‘means of transportation’. These words require a minimum level of reading skills not yet mastered by children in Grades 1 and 2. Therefore, children from the lower grades were excluded. The number of participants in Experiment 2 consisted of 94 children: 47 children in the intermediate grades and 47 children in the higher grades.
Materials, apparatus and procedure
The materials comprised two forced-choice semantic tasks to assess taxonomic superordinate knowledge, and the word-reading test that was described in Experiment 1. Both experimental tasks consisted of twenty-five experimental trials and were preceded by two practice items. Accuracy as well as reaction times was measured. The same apparatus and procedure that was used in Experiment 1 was used in this experiment.
Forced-choice semantic tasks
Spoken word condition
The semantic category was presented verbally, followed by four pictures: one target response and three distracters. The target response represented a member of the semantic category that was presented. Categories that were included were: ‘residence’, ‘toys’, ‘drinks’, ‘cutlery’, ‘jobs’, ‘transport’, ‘musical instruments’, ‘tools’, ‘animals’, ‘clothes’, ‘sport’, ‘pets’, ‘birds’, ‘fruit’, ‘dairy products’, ‘headgear’, ‘insects’, ‘furniture’, ‘vegetables’, ‘limbs’, ‘mammals’, ‘writing tools’, ‘sense organs’, ‘numbers’ and ‘flowers’. Additionally, fifteen stimuli included two semantic distracters and one unrelated picture in the response options. Ten stimuli included three unrelated pictures in the response options. Criteria for these distracters have been described earlier.
Written word condition
This test consisted of twenty-five experimental trials, preceded by two practice items. Categories that were included were identical to the spoken word task. However, the stimuli in this subtask were presented by written words. Response options were all pictures. The identity of the stimuli and response options differed from the spoken word condition, but were of comparable difficulty (see ‘Results’ section). The types and distribution of distracters was the same as those included in the spoken word condition. Both accuracy and reaction time were measured.
Results
For each participant, mean reaction times and accuracy percentages were calculated for each semantic task. For each task, incorrect trials and reaction times more than two standard deviations below or above a participant's mean reaction time were excluded from the latencies analyses.1 The percentages of outliers in each task are listed in Table 5.
Table 5. Percentages of outliers in the Superordinate-level task removed from the analyses
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20151027044807806-0503:S0305000909990420_tab5.gif?pub-status=live)
Grade level and semantic categorization
Table 6 shows the results of the Superordinate-level tasks for all children and for the intermediate grades (Grades 3, 4) and higher grades (Grades 5, 6) separately. A 2 (task condition: written words vs. spoken words)×2 (Grade: intermediate vs. higher) ANOVA was performed on reaction times and error rates with grade as between-subjects factor and task condition as within-subjects factor. Significant effects are based on an alpha level of 0·05. The interaction effect between grade level and task condition did not reach significant levels, neither for reaction times (F(1, 92)=3·07, p>0·05) nor for errors (F<1).
Table 6. Mean reaction times in ms., standard deviations and error percentages in the Superordinate-level task for each grade level
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20151027044807806-0503:S0305000909990420_tab6.gif?pub-status=live)
The main effect of task condition was significant only for errors (F(1, 92)=4·23, p=0·04, partial η2=0·04). Error percentages were significantly higher in the spoken word condition (12·8%) than in the written word condition (10·9%).
The main effect of grade was also significant, both for reaction times (F(1, 92)=29·58, p<0·001, partial η2=0·24) and for errors (F(1, 92)=10·42, p=0·002, partial η2=0·10). Reaction times in the higher grades (1714 ms) were significantly shorter than those in the intermediate grades (2120 ms). Error percentages in the higher grades (9·6%) were significantly lower than those in the intermediate grades (14·1%).
Reading ability and semantic categorization
To examine the relationship between reading and semantic categorization skills, the selection criteria used in Experiment 1 were applied here to compose a poor reader group (PR), a reading level (RL) control group, and a chronological age (CA) control group. Table 7 summarizes the results of all groups.
Table 7. Mean age, reading scores, reaction times in ms (standard deviations in parentheses) and error percentages in the Superordinate-level tasks for each reading level
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20151027044807806-0503:S0305000909990420_tab7.gif?pub-status=live)
Next, a 2 (task condition: written words vs. spoken words)×3 (reading group: PR-group vs. RL-control vs. CA-control) GLM Repeated Measures analysis was performed on reaction times and error rates with reading group as between-subjects factor and task condition as within-subjects factor. Significant effects are based on an alpha level of 0·05. The interaction effect between reading group and task condition did not reach significant levels, neither for reaction times nor for errors (both Fs<1).
The main effect of task condition was also not significant, neither for reaction times (F(1, 33)=1·71, p>0·05) nor for errors (F(1, 33)=3·81, p>0·05). The main effect of reading group was only significant for reaction times (F(2, 33)=6·52, p=0·004, partial η2=0·28). Reaction times of the RL-control group (1991 ms) were significantly larger than those of the PR-group (1663 ms) and CA-control group (1624 ms).
Conclusion
To sum up, in Experiment 2, readers from Grades 3 to 6 performed two Superordinate-level tasks. More errors were made in the spoken word condition than in the written word condition. Moreover, results demonstrated longer reaction times and more errors in the intermediate grades than in the higher grades. The effect of reading skill on performance of the Superordinate-level tasks was visible only in the reaction time analyses. The children in the (younger) reading level matched group were slower than those in the older poor reader group and in the chronological age matched group.
GENERAL DISCUSSION
The focus of the present study was the development of semantic categorization across primary grades and its relationship with reading. Three Exemplar-level tasks (Experiment 1) and two Superordinate-level tasks (Experiment 2) with different types of distracters (phonological, semantic, perceptual, unrelated) were administered to assess semantic categorization skills.
Before discussing the main question of this study, we address how children's semantic categorization performance changes with task requirements. Children took longer to respond and committed more errors in the spoken word condition than in the picture and in the written word condition in both the Exemplar-level and the Superordinate-level task. The lower performance in the spoken word condition is most likely due to the nature of the stimulus category. In the spoken word condition, the presentation of stimulus category is relatively transient compared to the written word and the picture condition. In these last two conditions, the stimulus remained visible until the child decided to ask for the answer options, whereas in the spoken word condition the answer options appeared after the stimulus had been presented. Thus, in the spoken word condition, the stimulus naturally disappeared, whereas in the written word and in the picture condition, the children had control over the disappearance of the stimulus. This implies that the spoken word condition required careful attention and a good short-term memory compared to the other two conditions in which the child could influence the time span of the stimulus presentation.
Not only did task condition affect semantic categorization, but also distracter types influenced children's performance. In the Exemplar-level tasks, children chose the phonological distracter significantly less frequently than the perceptual distracter or semantic distracter. It appears that in the semantic categorization tasks developed here, children are less inclined to categorize words on the basis of their phonological information when pictures are included as response options.
The major research question of the present study was whether semantic categorization is affected by grade and/or reading level. First, in the Exemplar-Level tasks as well as in the Superordinate-level tasks, higher grades were faster and more accurate in all task conditions. This strongly suggests that the test is sensitive to developmental changes. Increasing accuracy in semantic categorization with age is in line with previous studies (e.g. Nguyen & Murphy, Reference Nguyen and Murphy2003). Moreover, in the Superordinate-level tasks, the reaction times of younger typical readers were larger than those of older poor readers and chronological age matched readers. Recall that, in the Superordinate-level tasks, knowledge of vertical taxonomic relations between concepts was assessed. For example, the stimulus clothes was presented and the child had to indicate which picture belonged to the semantic category clothes, with trousers as the correct response and bag, cloth and box as distracters. It appears that knowledge of vertical taxonomic relationships increases with age, and this development is independent of reading skills. During primary grades, children's conceptual knowledge increases over time and semantic structures become more specified.
Second, there was no relationship between reading level and semantic categorization performance. In all task conditions, poor readers showed similar semantic categorization performance as typical readers. This result is in line with the results of Silva-Pereyra et al. (Reference Silva-Pereyra, Rivera-Gaxiola, Fernández, Díaz-Comas, Harmony, Fernández-Bouzas, Rodríguez, Bernal and Marosi2003) . They also failed to find a relationship between figure categorization skills and reading. Silva-Pereyra et al. demonstrated both with behavioural data and with ERP recordings that poor readers performed worse than normal readers on a word categorization task, but not on a picture categorization task. These and the present finding suggest that there is no reason to assume that poor readers have incomplete semantic networks (expressed in error percentages) or have slower access to semantic memory (expressed in reaction times) than typical readers. Note that in the present study, semantic categorization times were measured separately from the time it took to encode the (written) stimulus; this experimental choice disentangles semantic access from reading performance to a large degree.
Some authors, however, have suggested that semantic processing is actually the preferred strategy in poor readers, if semantic information is available. Waterman & Lewandowski (Reference Waterman and Lewandowski1993), for example, orally presented participants with a set of words. After listening to the list of words, they were presented with a new set of words and had to decide whether each word they heard had been in the first list. The test words included groups of antecedent words (bag), rhyming targets (rag), rhyming controls (dab), semantic targets (sack, and semantic controls (mess). All readers made significantly more errors on targets than on controls. Thus, both poor and good readers more often incorrectly reported that they had heard a semantic target (e.g. sack) than a semantic control (e.g. mess). However, results also showed that the difference between responses on semantic targets and semantic controls was significantly larger for poor readers than for good readers. Thus, poor readers made more semantic errors than good readers. From these results, the authors concluded that the semantic processing system of poor readers is intact and might even be the preferred coding strategy in poor readers. Because our poor readers' taxonomic knowledge was similar to the taxonomic knowledge of good readers, our results also support the conclusion that the semantic processing of poor readers is highly similar to that of good readers.
To conclude, our study does not provide evidence for a positive relationship between semantic categorization skills and word reading skills. Of course, we do not believe that these findings can be generalized to text reading, because it is generally assumed that semantic skills such as vocabulary do play a role in text reading and text comprehension.
Our study did not provide evidence for the superiority of semantic skills in poor decoders, as suggested by Waterman & Lewandowski (Reference Waterman and Lewandowski1993) . Note, however, that the procedure in their study was different from ours. Their task addressed implicit semantic knowledge (i.e. on-line procedures), whereas the semantic categorization tasks in our study required purposive semantic processing (i.e. off-line procedures). Moreover, the authors were unable to replicate and extend their findings in a subsequent study when stimuli were presented visually, rather than verbally (Waterman & Lewandowski, Reference Waterman and Lewandowski1994). In the latter study, poor readers and good readers were equally affected by semantic confusability. Several researchers (e.g. Assink et al., Reference Assink, Van Bergen, Van Teeseling and Knuijt2004) have argued that off-line tasks are less suitable for examining implicit semantic knowledge, because they are more directly linked to word reading than conscious processes are, suggesting that semantic priming studies might be a better alternative. Assink et al. carried out a semantic priming experiment with poor readers and controls and found no sensitivity differences to semantic cues between poor readers and typical readers. Although semantic processes can be assessed either in a direct way (purposive, off-line processing) or in a rather indirect way (e.g. semantic priming), the two methods appear to address different processes. We believe that both on-line procedures and off-line procedures are important in the study of the semantic processing of poor readers, because deficiencies in purposive semantic processing do not automatically imply deficits in implicit semantic processing and vice versa.
Finally, we examined whether poor readers and younger readers were more distracted by other characteristics of concepts (phonology vs. semantics vs. shape) than good readers or older readers were. Neither grade level nor reading skills affected the outcomes. Thus, poor readers were no differently affected by the phonological, perceptual or semantic distracters than good readers. This result is in accordance with the results of Waterman & Lewandowski (Reference Waterman and Lewandowski1994) . They manipulated phonological, orthographic and semantic confusability. The task for the poor and good readers was to indicate whether stimuli had been visually presented a second time. Poor readers and good readers were equally affected by orthographic and semantic confusability. Phonological confusability was not observed for either of the groups. These results, together with the results of the semantic priming studies (e.g. Assink et al., Reference Assink, Van Bergen, Van Teeseling and Knuijt2004), suggest that semantic processing in poor readers is qualitatively similar to that of good readers. This mirrors the results on reading tasks in relatively transparent languages: children with dyslexia in the Netherlands show identical patterns of reading errors to normal readers and they are equally affected by spelling (in)consistencies as normal readers (e.g. Bosman, Vonk and van Zwam, Reference Bosman, Vonk and van Zwam2006).
To conclude, the semantic categorization skills of poor and good readers are equally advanced, suggesting that semantic processing in poor readers is not deficient. This finding needs to be incorporated in theories of word reading. As noted in the ‘Introduction’, almost all models of reading acknowledge the fundamental and interactive role of phonology, orthography and semantics. If semantic processing in poor readers is not dissimilar to that of good readers, poor reading needs to be explained from the other aspects that reading involves. All models that assume recurrent interaction between all three aspects that govern reading are able to explain this phenomenon (e.g. Bosman & Van Orden, Reference Bosman, Van Orden, Perfetti, Rieben and Fayol1997; Harm & Seidenberg, Reference Harm and Seidenberg1999; Van Orden et al., Reference Van Orden, Pennington and Stone1990). Interactive models cause the presentation of a printed word to activate letter nodes, which in turn activate phoneme and semantic nodes. Following initial activation, recurrent feedback dynamics begin among all these node families. It is assumed that, in poor readers, the connections between the phonological nodes and orthographic nodes are inefficient or less stable. However, because of the interconnectivity between all nodes, activation of semantic information may support the word identification process. Farrar & Van Orden (Reference Farrar and Van Orden2001) investigated this assumption in a simulation study on English words, and indeed found that intact semantics supports the reading process, in case connections between phonology and orthography are weak or even absent. To substantiate the claim of semantics supporting the reading process, cross-linguistic research is needed to answer the question of whether semantics also plays a role in more transparent languages such as Dutch. After all, the relationships between semantics, orthography and phonology might differ between deep and shallow orthographies, resulting in different findings concerning the role of semantic skills in word identification.
In sum, the results of this study demonstrated a strong development (in both speed and accuracy) in semantic skills through primary grades. The development was apparent in all Exemplar-level and Superordinate-level tasks. No differences emerged in semantic performance between poor readers and typical readers, and the effect of distracter type was the same for readers of different grade levels and different reading skills. We, therefore, reject the notion of a qualitative difference in taxonomic knowledge between reader groups with respect to word reading. Identical error patterns of all reader groups and similar preferences for different types of distracters support this view.