Spared bottom-up but impaired top-down interactive effects during naturalistic language processing in schizophrenia: evidence from the visual-world paradigm

Hugh Rabagliati; Nathaniel Delaney-Busch; Jesse Snedeker; Gina Kuperberg

doi:10.1017/S0033291718001952

Spared bottom-up but impaired top-down interactive effects during naturalistic language processing in schizophrenia: evidence from the visual-world paradigm

Published online by Cambridge University Press: 22 August 2018

Hugh Rabagliati

Nathaniel Delaney-Busch ,

Jesse Snedeker and

Gina Kuperberg

Show author details

Hugh Rabagliati*: Affiliation:
Department of Psychology, Tufts University, Medford, MA 02155, USA Department of Psychology, Harvard University, Cambridge, MA 01238, USA School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK
Nathaniel Delaney-Busch: Affiliation:
Department of Psychology, Tufts University, Medford, MA 02155, USA
Jesse Snedeker: Affiliation:
Department of Psychology, Harvard University, Cambridge, MA 01238, USA
Gina Kuperberg: Affiliation:
Department of Psychology, Tufts University, Medford, MA 02155, USA Department of Psychiatry and the Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Harvard Medical School, Charlestown, MA 02129, USA
*: Author for correspondence: Hugh Rabagliati, E-mail: hugh.rabagliati@ed.ac.uk

Article contents

Abstract
Background
Methods
Results
Conclusions
Methods and materials
Results
Discussion
Footnotes
References

Rights & Permissions

Abstract

Background

People with schizophrenia process language in unusual ways, but the causes of these abnormalities are unclear. In particular, it has proven difficult to empirically disentangle explanations based on impairments in the top-down processing of higher level information from those based on the bottom-up processing of lower level information.

Methods

To distinguish these accounts, we used visual-world eye tracking, a paradigm that measures spoken language processing during real-world interactions. Participants listened to and then acted out syntactically ambiguous spoken instructions (e.g. ‘tickle the frog with the feather’, which could either specify how to tickle a frog, or which frog to tickle). We contrasted how 24 people with schizophrenia and 24 demographically matched controls used two types of lower level information (prosody and lexical representations) and two types of higher level information (pragmatic and discourse-level representations) to resolve the ambiguous meanings of these instructions. Eye tracking allowed us to assess how participants arrived at their interpretation in real time, while recordings of participants’ actions measured how they ultimately interpreted the instructions.

Results

We found a striking dissociation in participants’ eye movements: the two groups were similarly adept at using lower level information to immediately constrain their interpretations of the instructions, but only controls showed evidence of fast top-down use of higher level information. People with schizophrenia, nonetheless, did eventually reach the same interpretations as controls.

Conclusions

These data suggest that language abnormalities in schizophrenia partially result from a failure to use higher level information in a top-down fashion, to constrain the interpretation of language as it unfolds in real time.

Keywords

Eye movements language prediction schizophrenia visual-world paradigm

Type: Original Articles
Information: Psychological Medicine , Volume 49 , Issue 8 , June 2019 , pp. 1335 - 1345

DOI: https://doi.org/10.1017/S0033291718001952 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2018

Language is the backbone of interpersonal interaction and an essential part of human cognition: to understand or speak a sentence requires the coordination of a range of processes, ranging from low-level perception to high-level social cognition. In schizophrenia, language dysfunction has long been noted (Bleuler, Reference Bleuler1911/1950; Andreasen, Reference Andreasen1979a, Reference Andreasen1979b; Kuperberg, Reference Kuperberg2010a), and is most obviously seen in the disorganized (‘thought-disordered’) speech produced by some patients (Bleuler, Reference Bleuler1911/1950; Andreasen, Reference Andreasen1986). But abnormalities in language comprehension can also be detected in the absence of overt thought disorder (for reviews, see Kuperberg, Reference Kuperberg2010b; Brown and Kuperberg, Reference Brown and Kuperberg2015) and these can predict psychosocial function (e.g. Bowie and Harvey, Reference Bowie and Harvey2008; Swaab et al., Reference Swaab, Boudewyn, Long, Luck, Kring, Ragland, Ranganath, Lesh, Niendam, Solomon and Mangun2013; Holshausen et al., Reference Holshausen, Harvey, Elvevåg, Foltz and Bowie2014). Understanding the basis of abnormal language processing in schizophrenia therefore has important general implications for understanding the disorder's cognitive architecture more broadly, particularly the relationships between perceptual and higher order disturbances that characterize the disorder (Brown and Kuperberg, Reference Brown and Kuperberg2015). Moreover, the important role that language plays in social interaction suggest that understanding these linguistic abnormalities may shed light on the everyday social challenges faced by people with schizophrenia.

Abnormalities of language in schizophrenia have been described at multiple levels, including sentence and discourse processing (Cohen and Servan-Schreiber, Reference Cohen and Servan-Schreiber1992; Kuperberg et al., Reference Kuperberg, McGuire and David1998; Ditman and Kuperberg, Reference Ditman and Kuperberg2007; Boudewyn et al., Reference Boudewyn, Carter and Swaab2012), pragmatic inferencing (Frith, Reference Frith2004; Bambini et al., Reference Bambini, Arcara, Bechi, Buonocore, Cavallaro and Bosia2016), lexico-semantic associations (Spitzer et al., Reference Spitzer, Braun, Hermle and Maier1993; Mathalon et al., Reference Mathalon, Faustman and Ford2002; Minzenberg et al., Reference Minzenberg, Ober and Vinogradov2002; Titone and Levy, Reference Titone and Levy2004; Elvevåg et al., Reference Elvevåg, Foltz, Weinberger and Goldberg2007; Kreher et al., Reference Kreher, Goff and Kuperberg2009), phonology and orthography (Whitford et al., Reference Whitford, O'Driscoll, Pack, Joober, Malla and Titone2013; Revheim et al., Reference Revheim, Hole, Bruland, Reitan, Bjerkehagen, Julsrud and Seierstad2014; Whitford et al., Reference Whitford, O'Driscoll and Titone2017), and prosody (Kantrowitz et al., Reference Kantrowitz, Hoptman, Leitman, Silipo and Javitt2014). While higher and lower level language abnormalities in schizophrenia have usually been discussed independently, some have proposed that they are linked, with two major theories discussing the nature of these links.

The first ‘bottom-up’ theory proposes that lower level impairments cascade up to cause higher level language abnormalities in schizophrenia. This proposal assumes that the primary locus of linguistic dysfunction is in the perception and propagation of lower level information (such as speech sounds or early visual representations) up the linguistic hierarchy, driving abnormalities at higher levels of representation, such as the interpretation of a sentence's meaning (Leitman et al., Reference Leitman, Foxe, Butler, Saperstein, Revheim and Javitt2005; Javitt, Reference Javitt2009; Jahshan et al., Reference Jahshan, Wynn and Green2013; Kantrowitz et al., Reference Kantrowitz, Hoptman, Leitman, Silipo and Javitt2014; Revheim et al., Reference Revheim, Hole, Bruland, Reitan, Bjerkehagen, Julsrud and Seierstad2014; Javitt and Freedman, Reference Javitt and Freedman2015).

The second ‘top-down interactive’ theory proposes that linguistic abnormalities in schizophrenia stem from disruptions of the fast interactions between higher and lower level representations as language is comprehended. This theory (see Brown and Kuperberg, Reference Brown and Kuperberg2015 for a recent review) is based on models of typical language processing that posit constant communication between higher and lower level representations during language comprehension (McClelland and Rumelhart, Reference McClelland and Rumelhart1981; Rumelhart and McClelland, Reference Rumelhart and McClelland1982; Tanenhaus et al., Reference Tanenhaus, Spivey-Knowlton, Eberhard and Sedivy1995; Elman et al., Reference Elman, Hare and McRae2004), an idea that is echoed in more general cognitive models of schizophrenia (e.g. Cohen and Servan-Schreiber, Reference Cohen and Servan-Schreiber1992). For example, probabilistic predictive frameworks propose a crucial role of top-down inputs from higher level representations in constraining activity at lower level representations (Brown and Kuperberg, Reference Brown and Kuperberg2015; Kuperberg and Jaeger, Reference Kuperberg and Jaeger2016). If these predictive interactions are disrupted in schizophrenia, this would result in unconstrained bottom-up activity (Corlett et al., Reference Corlett, Frith and Fletcher2009; Fletcher and Frith, Reference Fletcher and Frith2009), and thus abnormal patterns of language processing (Brown and Kuperberg, Reference Brown and Kuperberg2015).

Although these two theories appear distinct, they have proven difficult to disentangle (see Brown and Kuperberg, Reference Brown and Kuperberg2015). For example, some researchers have taken correlations between lower and higher level language abnormalities in schizophrenia as evidence for the first theory (Leitman et al., Reference Leitman, Foxe, Butler, Saperstein, Revheim and Javitt2005; Jahshan et al., Reference Jahshan, Wynn and Green2013; Kantrowitz et al., Reference Kantrowitz, Hoptman, Leitman, Silipo and Javitt2014), but these data are equally well explained by the second. Conversely, others have taken impairments in patients’ use of higher level discourse representations, but preserved sensitivity to simple lexico-semantic associations (Titone et al., Reference Titone, Levy and Holzman2000; Kuperberg et al., Reference Kuperberg, Sitnikova, Goff and Holcomb2006; Ditman and Kuperberg, Reference Ditman and Kuperberg2007; Swaab et al., Reference Swaab, Boudewyn, Long, Luck, Kring, Ragland, Ranganath, Lesh, Niendam, Solomon and Mangun2013; and see Kuperberg, Reference Kuperberg2010b, for a review), as support for the second theory. However, because language comprehension is highly incremental, with each incoming word being integrated into a high-level discourse representation in real time, it is possible that apparent impairments in using higher level discourse context could actually arise from a difficulty building this context in the first place, due to impaired lower level processing.

The present study was designed to distinguish between these two theories by examining how people with schizophrenia interpret ambiguous sentences. Ambiguity resolution is a critical component of everyday language comprehension: To understand a sentence, listeners constantly have to resolve a series of ambiguous sounds, words, and meanings. Here, we focused on one particularly common type of ambiguity – syntactic ambiguities such as ‘wave to the man with the flag’, where the flag could be held by the man or by the waver. Syntactic ambiguity resolution provides an ideal test case for understanding the effects of bottom-up and top-down interactive processes. This is because syntax is often assumed to lie at an intermediate level on the linguistic hierarchy: it may lie above lower level representations such as prosody or lexical information, which are therefore said to interact with syntax in a bottom-up fashion. However, it lies below higher level representations such as discourse and pragmatics, which are therefore said to interact with syntax in a top-down fashion (see Table 1 for definitions). Here, we asked how people with schizophrenia used these two types of lower level information in a bottom-up fashion, and these two types of higher level information in a top-down fashion, to influence syntactic ambiguity resolution, and hence interpretation.

Table 1. Definitions of terms and summary of manipulations

To do this, we used the visual-world eye-tracking method, a well-established and well-validated psycholinguistics technique that has become a ubiquitous tool for studying the time course of spoken language comprehension (Tanenhaus et al., Reference Tanenhaus, Spivey-Knowlton, Eberhard and Sedivy1995; Tanenhaus and Trueswell, Reference Tanenhaus, Trueswell, Traxler and Gernsbacher2006). Visual-world eye tracking has not been previously used to study schizophrenia, yet it is particularly well suited for this purpose as it provides a naturalistic and minimally demanding experimental analogue to everyday communication. In our paradigm, participants interacted with a set of real-world objects placed in front of them (following Sedivy et al., Reference Sedivy, Tanenhaus, Chambers and Carlson1999; Tanenhaus et al., Reference Tanenhaus, Spivey-Knowlton, Eberhard and Sedivy1995; Keysar et al., Reference Keysar, Barr, Balin and Brauner2000; and see also Trueswell et al., Reference Trueswell, Sekerina, Hill and Logrip1999; Snedeker and Trueswell, Reference Snedeker and Trueswell2004; Snedeker and Yuan, Reference Snedeker and Yuan2008; Huang and Snedeker, Reference Huang and Snedeker2009a, Reference Huang and Snedeker2009b; Diehl et al., Reference Diehl, Friedberg, Paul and Snedeker2015; Gambi et al., Reference Gambi, Pickering and Rabagliati2016; for work validating this paradigm in populations other than typical adults). For example, participants might see (1) a toy frog holding a small feather, (2) a large feather, (3) a toy cat holding a small flower, and (4) a large flower (see Fig. 1). They then listened to spoken instructions telling them how to manipulate these objects, e.g. ‘Poke the frog with the feather’. Although this instruction appears simple, it is actually syntactically ambiguous: it can either be interpreted as an instruction to use the large feather as an ‘instrument’ to poke the frog (the so-called instrument interpretation), or to use one's own finger to poke the frog that is holding the small feather. Importantly, there are no ‘correct’ responses to an instruction like this: its interpretation depends upon how the syntactic ambiguity is resolved, which, in turn, depends upon whether and when participants use different types of informational cues within the context. As participants listen to such instructions, their use of different types of cues can be inferred by examining the pattern of their eye movements to the objects as the spoken verbal input unfolds. For example, if participants infer an instrument interpretation, then they should be more likely to gaze toward the large feather (i.e. the instrument) when they hear the word ‘feather’. Critically, there is little reason to believe that the types of oculomotor process that are measured in the visual-world paradigm (i.e. patterns of saccadic eye movements and fixations) are impaired in schizophrenia. Unlike the so-called ‘smooth pursuit’ eye movements (Iacono, Reference Iacono1981), there is little evidence that deficits in oculomotor control affect patients’ saccades (Whitford et al., Reference Whitford, O'Driscoll, Pack, Joober, Malla and Titone2013).

Fig. 1. Illustration of the experimental setup used. Left: an action performed with the target instrument. Right: an action performed without the target instrument. TI, target instrument; TA, target animal; DI, distractor instrument; DA, distractor animal.

To assess how participants used lower and higher level information to influence their interpretation of these syntactically ambiguous spoken sentences, we separately manipulated four features of the linguistic and non-linguistic input – two lower level cues (prosodic phrasing see Snedeker and Yuan, Reference Snedeker and Yuan2008, and semantic–thematic verb constraints, see Snedeker and Trueswell, Reference Snedeker and Trueswell2004), and two higher level cues (pragmatically-relevant visual context, see Tanenhaus et al., Reference Tanenhaus, Spivey-Knowlton, Eberhard and Sedivy1995, and conversational discourse information, see Rabagliati et al., Reference Rabagliati, Gambi and Pickering2014). These manipulations are described, together with definitions and examples, in Table 1. By examining how these cues affected eye movements, we were able to distinguish between the two theories outlined above. The bottom-up theory would predict reduced looks to the instrument in the schizophrenia group when both lower and higher level cues bias toward the instrument interpretation. The top-down interactive theory, however, would predict reduced looks to the instrument in the schizophrenia group, only when higher level cues bias toward this interpretation.

In addition to examining eye movements while participants listened to the sentences, we also examined participants’ final actions, reflecting their final interpretations of the sentences. Some previous studies have found that, even though people with schizophrenia can struggle with using different types of cue to process language as it unfolds very quickly, if there is enough time, they can still use such cues to ultimately interpret sentences in similar ways to healthy controls (Ditman and Kuperberg, Reference Ditman and Kuperberg2007; Kuperberg et al., Reference Kuperberg, Ditman and Choi Perrachione2018). If this was the case in the present study, then people with schizophrenia and healthy controls might show the same pattern of final actions, even if they showed different patterns of eye movements. Given the very fast pace of real-world conversation, this would have important psychosocial implications for understanding why some people with schizophrenia struggle with day-to-day social communication.

Methods and materials

Participants

Twenty-four stable outpatients (three females) were recruited from the Lindemann Mental Health Center, Boston. All met the DSM-IV-TR criteria for schizophrenia or schizoaffective disorder, confirmed using the Structured Clinical Interview for DSM-IV-TR Axis I Disorders (First et al., Reference First, Spitzer, Miriam and Williams2002b). Twenty-two were taking stable doses of antipsychotic medication (19 atypicals; three typicals) and two were unmedicated. Symptoms were assessed using the Scale for the Assessment of Positive Symptoms (SAPS, Andreasen, Reference Andreasen1984b) and the Scale for the Assessment of Negative Symptoms (SANS, Andreasen, Reference Andreasen1984a) either on the day of testing (20 participants) or within 60 days (four participants), see Table 2. Twenty-four demographically matched controls (three females) were recruited by advertisement. Control participants were not taking psychoactive medication and were screened to exclude psychiatric and neurological disorders or substance abuse/dependence (First et al., Reference First, Spitzer, Miriam and Williams2002a).

Table 2. Demographic, medication and symptom measures

Means are shown with standard deviations in parentheses.

^a Premorbid IQ was assessed using the North American Adult Reading Test: NAART (Blair and Spreen, Reference Blair and Spreen1989).

^b Parental socio-economic status (SES) was calculated using the Hollingshead Index (Hollingshead, Reference Hollingshead1965). One control and one patient did not provide parental occupation.

^c Chlorpromazine (CPZ) equivalents were calculated following the International Consensus Study of Antipsychotic Dosing (Gardner et al., Reference Gardner, Murphy, O'Donnell, Centorrino and Baldessarini2010).

^d SAPS: Scale for the Assessment of Positive Symptoms (Andreasen, Reference Andreasen1984b); SANS: Scale for the Assessment of Negative Symptoms (Andreasen, Reference Andreasen1984a). SAPS and SANS scores shown are summary scores (sum of the global ratings).

All participants were native English speakers. This study was carried out with the explicit review and approval of the Partners Human Research Committee and Massachusetts General Hospital IRB (#2010P001683) and Tufts Health Sciences Institutional Review Board (#5110). Participants gave written informed consent and were compensated for taking part in the study in accordance with the approved IRB protocols.

General procedures

Each participant was tested on three similar experimental tasks examining their use of prosodic phrasing (task 1), the semantic–thematic constraints of the verb (task 2), pragmatically-relevant visual context (also in task 2), and conversational discourse context (task 3). Participants completed the tasks in one of two orders, with task 2 always second.

We used a ‘looking while listening’ variant of the visual-world paradigm in which participants’ eye movements were remotely monitored via video camera and then hand coded (Snedeker and Trueswell, Reference Snedeker and Trueswell2004; Snedeker and Yuan, Reference Snedeker and Yuan2008). Participants sat in front of a sloped shelf containing four small platforms (see Fig. 1). On every trial, an experimenter placed four different objects on the platforms and named them. These were: (a) the target animal: a toy animal holding a small object (e.g. a toy frog holding a small feather); (b) the target instrument: a larger object (e.g. a large feather that can be used for poking); (c) the distractor animal: another toy animal, either of the same or different type as the target animal, holding a different small object (e.g. a different toy frog or a toy cat holding a small flower); and (d) the distractor instrument: a different large object (e.g. a large flower).

Participants heard spoken instructions over a loudspeaker (pre-recorded by an unfamiliar female American English speaker). A video camera, embedded in the shelf, recorded the participant's face at 30 frames per second as she/he listened to the instructions; this video was later used to code gaze fixations (see online Supplementary Materials for full details). A second camera, behind the participant's shoulder, recorded their final actions. Participants were told the purpose of each camera, and that the study was part of a larger project assessing language in children and adults, which explained the somewhat ‘silly’ nature of the instructions.

Each trial used different combinations of animals and instruments. Positions were counterbalanced across trials to avoid learned associations between particular objects and locations. Experimental trials were interspersed with filler trials using a variety of linguistic constructions, animals, and instruments.

Task 1: use of prosodic phrasing

Following Snedeker and Yuan (Reference Snedeker and Yuan2008)’s design, we varied how pauses were placed in the experimental instructions, to produce a bias toward the target instrument in four experimental trials (e.g. ‘You can poke the frog…with the feather’), and a bias against the target instrument in the remaining four experimental trials (e.g. ‘You can poke…the frog with the feather’). Trials were blocked, such that all four trials from one condition preceded trials from the other and were interspersed amongst 20 filler trials. Scenes always contained animals of different types (e.g. a frog holding a feather and a cat holding a flower).

Task 2: use of the verb's semantic–thematic constraints and pragmatically relevant visual information

Following Snedeker and Trueswell (Reference Snedeker and Trueswell2004)’s design, we varied the particular verb used in the spoken instruction. Eight experimental trials contained verbs that were independently rated (as described by Snedeker and Trueswell, Reference Snedeker and Trueswell2004) to probabilistically bias participants toward carrying out an action with an instrument (e.g. ‘poke the frog with the feather’), and eight trials contained verbs like sing that bias participants against using the instrument (e.g. ‘sing to the frog with the funnel’). These instructions did not contain any prosodic pauses.

Instructions were crossed with a manipulation of pragmatically relevant visual information. Specifically, we varied the number of potential animal referents of a particular type within the visual scene (Tanenhaus et al., Reference Tanenhaus, Spivey-Knowlton, Eberhard and Sedivy1995; Dahan and Tanenhaus, Reference Dahan and Tanenhaus2004; Snedeker and Trueswell, Reference Snedeker and Trueswell2004). In eight trials, the scene contained two animals of different types (e.g. a frog and a cat), while in the remaining eight trials, the scene contained two animals of one type (e.g. a frog holding a small feather and another frog holding a small flower). This manipulation works because the latter scene biases away from the instrument interpretation, as comprehenders who hear ‘poke the frog with the feather’ tend to infer that ‘with the feather’ disambiguates which of the two frogs should be poked. Experimental trials were randomly interspersed amongst 32 filler trials.

Task 3: use of conversational discourse information

A question preceded each of the eight experimental trials, asked by a male speaker. In four trials, the question biased participants toward using the target instrument (e.g. Question: ‘What should we do to a frog?’ Answer: ‘Poke the frog with feather’), and in the remaining four trials, the question biased against using the target instrument (e.g. Question: ‘Which frog should we play with now?’ Answer: ‘Poke the frog with feather’). All experimental trials contained two animals of the same type (e.g. a frog holding a feather and a frog holding a spoon). They were blocked and interspersed amongst 20 filler trials.

Analysis

Analysis of eye movements

On each trial, hypothesis-blind research assistants used the video to code the direction of each participant's gaze in relation to the particular location of the object for that trial, see online Supplementary Materials for full details.

We conducted a pre-planned ‘time-window’ analysis of the eye movements. This analysis focused on whether participants looked at the target instrument (e.g. the large feather) at any point within each of two time windows following the onset of each instruction's final word (feather) – from 200 to 699 ms and from 700 to 1199 ms. These time windows were selected a priori: they are the same as those analyzed by Snedeker and Trueswell (Reference Snedeker and Trueswell2004) and Diehl et al. (Reference Diehl, Friedberg, Paul and Snedeker2015), who used a similar paradigm to assess syntactic ambiguity resolution in healthy adults, adolescents with autism spectrum disorder, and young children. We specifically chose this approach over alternatives such as growth curve analysis (Mirman et al., Reference Mirman, Dixon and Magnuson2008), in part because recent work (Huang et al., Reference Huang, Stranahan and Snedeker2017) has suggested that the latter analyses can produce a high rate of false positives, a finding that we have confirmed with our own simulations on the present dataset. In contrast, as well as implementing strong a prior hypotheses, the time-window analysis we adopt here also accurately reflects many of the temporal properties of gaze behavior, including the fact that fixations typically last for many hundreds of milliseconds.

Analyses were carried out using mixed-effect logistic regressions fit using lme4 package version 1.1 (Bates et al., Reference Bates, Mächler, Bolker and Walker2015) in R (R Core Team, 2016). We used logistic rather than linear regression because our dependent variable was binary: whether a participant fixated the target instrument during each time window, or whether they looked elsewhere (collapsing across looks to one of the other quadrants, to the central fixation point, or off the stage altogether). The linking function for logistic regression thus provides a more accurate model of the data and is better able to account for floor and ceiling effects.

We structured the predictors in our regression to make them maximally comparable to an analysis of variance. For each task and population group, we crossed the factors information bias (cues biasing toward or away from the instrument interpretation) and time window (early or late). In all analyses, we treated subjects as random effects. In task 2 (where trials were randomly ordered), the effect of information bias was treated as a random effect within subjects, but in tasks 1 and 3, where trials were blocked, information bias was simply treated as a fixed effect, to account for the fact that many subjects perseverated on an interpretation (and thus effects could be clearly seen between subjects). Time window was allowed to vary within subjects. Then, to determine whether effects of information bias differed significantly between the control and schizophrenia groups, we also carried out between-group analyses, in which we crossed group (controls or patients) with information bias and time window.

To assess the significance of all main effects and interactions involving fixed factors, we used Wald tests. We report results for key regression coefficients in the main text; for full regression model results, see https://osf.io/bdkpy/.

Analysis of final actions

Hypothesis-blind research assistants coded whether or not participants used the target instrument as they acted out each instruction. This indicated whether participants ultimately adopted an ‘instrument’ interpretation of the instruction. Participants’ actions were then analyzed using logistic regressions. For each task, we crossed the factors information bias (cues biasing for or against using the target instrument) and group (controls or patients). Random effects were treated as above. The full results of all models are available at https://osf.io/bdkpy/.

Results

Analysis of online processing (eye movements)

Effects of prosodic phrasing and verb semantic–thematic constraints

The eye movements of control participants and people with schizophrenia were affected by both prosodic phrasing (Fig. 2A) and the verb's semantic–thematic constraints (Fig. 2B): both groups appeared to look more often to the instrument when these bottom-up cues suggested that they should do so (see Table 3 for descriptive statistics).

Fig. 2. How participants’ eye movements and final actions were affected by lower and higher level information. (a) Use of prosodic phrasing, (b) use of lexical information, (c) use of pragmatically relevant visual information, (d) use of conversational discourse information. Graphs show proportion of trials on which controls (left panel) and patients (middle panel) fixated on the target instrument within the early and late time windows, both when information biased toward and against the instrument interpretation. Lines are loess smoothers; shaded ribbons indicate 95% CI. Right panels show participants’ final actions. Error bars represent ±1 standard error of the mean. Online Supplementary Materials show eye movements to each object over time.

Table 3. Mean proportion of trials on which participants fixated the target instrument (early and late time windows) or used the target instrument to carry out their final actions, depending on whether the different experimental manipulations biased toward or against the instrument interpretation. Standard deviations are in parentheses

Logistic regressions confirmed these patterns. In controls, there were significant effects of prosodic phrasing on eye movements [β = −0.80 (s.e. = 0.13), CI −1.05 to −0.55, Wald's z = 6.3, p < 0.001]: when prosody biased toward the instrument interpretation, the odds of gazing at the target instrument were significantly higher than when it biased against the instrument interpretation. Similarly, in people with schizophrenia, the effect was also significant [β = −0.74 (0.16), CI −1.05 to −0.43, Wald's z = 4.7, p < 0.001], meaning that people in this group were also more likely to gaze at the target instrument when the prosody biased toward this interpretation. A between-group comparison confirmed that the size of the prosody effect did not significantly differ between controls and people with schizophrenia (no interaction between information bias and group, β = 0.11 (0.20), CI −0.28 to 0.50, Wald's z = 0.53, p = 0.59).

Similarly, in both the control and schizophrenia groups, there were significant effects of the verb's semantic–thematic constraints. The control group looked significantly more at the target instrument when the verb was biased toward this interpretation [β = −0.92 (0.16), CI −1.23 to −0.60, Wald's z = 5.7, p < 0.001], and the same was true for people with schizophrenia [β = −0.84 (0.19), CI −1.20 to −0.47, Wald's z = 4.5, p < 0.001]. Once again, this effect did not differ significantly between the two groups [β = −0.07 (0.10), CI −0.14 to 0.28, Wald's z = 0.68, p = 0.49].

Effects of pragmatically relevant visual information and conversational discourse information

In contrast to the lower level cues, the effects of both pragmatically relevant visual information (Fig. 2C) and conversational discourse information (Fig. 2D) on eye movements appeared to differ between the control and schizophrenia groups (see Table 3 for descriptive statistics). Whereas controls looked more often to the target instrument when both these higher level cues suggested that they should do so, people with schizophrenia did not appear to show such robust effects.

Logistic regressions confirmed these observations. In controls, the effect of pragmatically relevant visual context was significant [β = −0.39 (0.16), CI −0.71 to −0.07, Wald's z = 2.4, p = 0.02]: when visual context biased toward the instrument interpretation, controls were more likely to gaze at the target instrument. In people with schizophrenia, however, the effect was not significant [β = 0.10 (0.13), CI −0.17 to 0.36, Wald's z = 0.72, p = 0.47]: visual context did not significantly affect their gaze to the target instrument. The between-group analysis confirmed that visual context had a significantly greater effect on controls than on people with schizophrenia [significant interactions between information bias and group, β = 0.21 (0.10), CI 0.02–0.40, Wald's z = 2.1, p = 0.03].

Similarly, conversational discourse information significantly affected the eye movements of control participants [β = −0.58 (0.15), CI −0.88 to −0.28, Wald's z = 3.8, p < 0.001]; they were significantly more likely to gaze at the target instrument when the prior question was biased toward this instrument interpretation. In contrast, conversational discourse did not have a significant effect on the eye movements of people with schizophrenia [β = −0.1 (0.14), CI −0.37 to 0.16, Wald's z = 0.76, p = 0.45]. Once again, the between-group analysis confirmed that the conversational discourse information had a significantly greater effect in controls than in people with schizophrenia [significant interaction between information bias and group, β = 0.45 (0.20), CI 0.05–0.84, Wald's z = 2.2, p = 0.03].

We also carried out exploratory correlational analyses between patterns of eye movements and clinical variables within the schizophrenia group. These are reported in online Supplementary Material.

Analysis of final interpretations (final actions)

Both groups of participants made similar use of bottom-up prosodic phrasing and semantic–thematic constraints to inform their final actions (see Fig. 2 and Table 3 for descriptive statistics). Logistic regressions confirmed this pattern. When both these bottom-up cues biased toward the target instrument, then both control participants and people with schizophrenia were significantly more likely to use the target instrument to carry out their final actions, compared with when the phrasing was biased against the target instrument. This held for both prosodic phrasing [controls: β = −1.1 (0.21), CI −1.48 to −0.64, Wald's z = 4.9, p < 0.001; people with schizophrenia: β = −0.94 (0.18), CI −1.29 to −0.58, Wald's z = 5.2, p < 0.001] and for the verb's semantic–thematic constraints [controls: β = −1.19 (0.20), CI −1.58 to 0.81, Wald's z = 6.0, p < 0.001; people with schizophrenia: β = −2.5 (0.67), CI −3.81 to −1.19, Wald's z = 3.74, p < 0.001]. Between-group analyses revealed no significant differences between the two groups in how these two types of bottom-up information influenced their final actions (no significant interactions between information bias and group for prosodic phrasing: β = −0.04 (0.13), CI −0.30 to 0.21, Wald's z = 0.32, p = 0.75, or for semantic–thematic constraints: β = −0.19 (0.15), CI −0.53 to 0.14, Wald's z = 1.1, p = 0.25.

The pattern for conversational discourse was similar (Fig. 2D and Table 3). Both groups used this information to inform their final actions [controls: β = −0.42 (0.20), CI −0.82 to −0.02, Wald's z = 2.1, p = 0.04; people with schizophrenia: β = −0.58 (0.20), CI −0.99 to −0.18, Wald's z = 2.9, p = 0.004] and there was no significant difference between the two groups [no significant interaction between information bias and group, β = −0.09 (0.14), CI −0.37 to 0.19, Wald's z = 0.62, p = 0.54]. Interestingly, despite showing an effect on controls’ eye movements (see above), pragmatically relevant visual context (Fig. 2C and Table 3) did not significantly affect controls’ final actions [β = −0.24 (0.17), CI −0.56 to 0.09, Wald's z = 1.4, p = 0.16]Footnote ^†Footnote ¹. It also did not significantly affect patients’ final actions [β = −0.10 (0.22), CI −0.54 to 0.33, Wald's z = 0.45, p = 0.65], and there was no between-group difference in these effects [no significant interaction between information bias and group, β = 0.04 (0.12), CI −0.19 to 0.27, Wald's z = 0.32, p = 0.75].

Discussion

This study used the visual-world eye-tracking paradigm to compare how people with schizophrenia and demographically matched healthy controls use two types of lower level information (prosodic and lexical representations) and two types of higher level information (pragmatic and discourse representations) to guide syntactic processing during naturalistic spoken language comprehension. We found a dissociation in how the groups use these different types of cues as language is processed. In both groups, eye movements were robustly affected by a sentence's prosodic phrasing, as well as by the lexical constraints of its verb, suggesting that these lower level cues quickly biased syntactic processing to influence interpretation. However, in comparison with healthy controls, higher level cues – pragmatically relevant visual information and conversational discourse information – had a significantly reduced effect on the eye movements of people with schizophrenia, suggesting that they did not use these cues to immediately bias syntactic processing and sentence interpretation. Despite these differences in online processing, the two groups did ultimately reach the same interpretations, as reflected by their final actions.

These findings suggest that people with schizophrenia are impaired in their ability to predictively use higher level information in a highly interactive top-down fashion to inform the immediate processing and interpretation of incoming information. Importantly, this cannot easily be explained by a more general cognitive deficit. Such general deficits can sometimes lead to the artificial appearance of a differential deficit because of task demands or performance at ceiling or floor (see Chapman and Chapman, Reference Chapman and Chapman1973; Gold and Dickinson, Reference Gold and Dickinson2012). However, our eye-tracking paradigm posed essentially no task demands (participants simply needed to interpret simple sentences with no ‘correct’ interpretations)Footnote ², and performance was never at either ceiling or floor in our key measures.

Our findings go beyond prior work in several ways. The demonstration of a dissociation between the use of higher and lower level information to process the syntactic structure of an entire sentence extends previous findings reporting similar dissociations between the effects of higher level discourse and lower level lexical information on semantic processing of individual words within sentences (Titone et al., Reference Titone, Levy and Holzman2000; Sitnikova et al., Reference Sitnikova, Salisbury, Kuperberg and Holcomb2002; Kuperberg et al., Reference Kuperberg, Sitnikova, Goff and Holcomb2006; Ditman et al., Reference Ditman, Goff and Kuperberg2011; Swaab et al., Reference Swaab, Boudewyn, Long, Luck, Kring, Ragland, Ranganath, Lesh, Niendam, Solomon and Mangun2013). Our findings also show that this dissociation extends across multiple different higher and lower level information sources. Specifically, the same people with schizophrenia who were able to use lower level lexical information to modulate syntactic processing during real-time comprehension were also able to use lower level prosodic phrasing, and the same people with schizophrenia who were impaired in their use of higher level conversational discourse context were also impaired in their use of higher level pragmatically relevant visual information. This significantly bolsters claims for a selective impairment of top-down interactive processing in schizophrenia.

Our finding that people with schizophrenia were impaired in their use of non-verbal pragmatic information (i.e. relevant information within the surrounding visual scene) is consistent with other evidence of pragmatic communicative difficulties in schizophrenia (e.g. Harrow et al., Reference Harrow, Lanin-Kettering and Miller1989; Meilijson et al., Reference Meilijson, Kasher and Elizur2004; Colle et al., Reference Colle, Angeleri, Vallana, Sacco, Bara and Bosco2013; Bambini et al., Reference Bambini, Arcara, Bechi, Buonocore, Cavallaro and Bosia2016; Pawełczyk et al., Reference Pawełczyk, Kotlicka-Antczak, Łojek, Ruszpel and Pawełczyk2017), which may be related to more general theory of mind deficits (Frith, Reference Frith2004; but see McCabe et al., Reference McCabe, Leudar and Antaki2004). This finding also speaks to the precise role of working memory in language processing: given that participants could always see the visual scene in front of them, the relative insensitivity to this type of information in the schizophrenia group implies that high-level impairments are not solely due to problems in maintaining or manipulating higher level linguistic information over time within working memory. Rather, they suggest a more specific impairment in the top-down use of goal-relevant information to constrain processing, which may be dissociable from simple maintenance demands in schizophrenia (e.g. see Kim et al., Reference Kim, Somerville, Johnstone, Polis, Alexander, Shin and Whalen2004; Barch and Smith, Reference Barch and Smith2008 for discussion).

The key features of our study – its naturalistic methodology and broad exploration of linguistic context – license a number of novel conclusions. However, it is important to note how inferences from these data should be constrained. For example, one strength of our study was that the same participants completed multiple different tasks, permitting conclusions about patterns of strength and weakness. However, our sample size was comparatively small. This, along with the relatively small proportion of female participants, should be borne in mind when considering the generalizability of our findings, particularly over whether this pattern of results is a stable feature of schizophrenia or whether it evolves over the course of the disorder or through its pharmacological treatment. While we did not find correlations between performance and either age or medication (see online Supplementary Material), a definitive answer to this question would require a larger sample size and, ideally, longitudinal data. It will also be important to determine whether a similar dissociation is evident in people at high risk for developing schizophrenia.

Our main finding – eye-movement evidence that individuals with schizophrenia are selectively impaired in their use of higher level information to predictively and interactively influence processing of bottom-up linguistic input – is consistent with more general frameworks proposing that a breakdown of predictive mechanisms can explain multiple aspects of the schizophrenia syndrome (Corlett et al., Reference Corlett, Frith and Fletcher2009; Fletcher and Frith, Reference Fletcher and Frith2009; Corlett et al., Reference Corlett, Taylor, Wang, Fletcher and Krystal2010; Adams et al., Reference Adams, Stephan, Brown, Frith and Friston2013). Importantly, however, this theory does not imply that higher level representations are inherently abnormal or that they cannot be used at all in schizophrenia. Rather, it emphasizes a disturbance in the connections that allow inputs from higher levels of representation to rapidly and predictively influence processing at intermediate levels of representation, thereby constraining activity from lower levels of representation as they become available (Brown and Kuperberg, Reference Brown and Kuperberg2015). Such fast, online predictive processes are thought to play a critical role in allowing language to be understood quickly and accurately in healthy individuals (Kuperberg and Jaeger, Reference Kuperberg and Jaeger2016).

Our focus on top-down connections should also not be taken to imply that lower level perceptual processing is never impaired in schizophrenia, as disturbances in acoustic or lexical processing are well-attested (Cienfuegos et al., Reference Cienfuegos, March, Shelley and Javitt1999; Kasai et al., Reference Kasai, Nakagome, Itoh, Koshida, Hata, Iwanami, Fukuda and Kato2002; Javitt and Freedman, Reference Javitt and Freedman2015). However, our findings raise the interesting possibility that apparent low-level perceptual disturbances may stem from disturbances in top-down predictions (Hemsley, Reference Hemsley1993; Silverstein et al., Reference Silverstein, Matteson and Knight1996; Silverstein et al., Reference Silverstein, Hatashita-Wong, Schenkel, Wilkniss, Kovács, Fehér, Smith, Goicochea, Uhlhaas, Carpiniello and Savitz2006; Ford and Mathalon, Reference Ford and Mathalon2012; see Brown and Kuperberg, Reference Brown and Kuperberg2015, for discussion). This idea also raises the possibility that a breakdown in top-down interactions might actually cause lower level representations to develop abnormally, given the close relationship between prediction and learning in linguistic (Dell and Chang, Reference Dell and Chang2014; Kleinschmidt and Jaeger, Reference Kleinschmidt and Jaeger2015; Rabagliati et al., Reference Rabagliati, Gambi and Pickering2014) and non-linguistic (Rescorla, Reference Rescorla1988) domains (Adcock et al., Reference Adcock, Dale, Fisher, Aldebot, Genevsky, Simpson, Nagarajan and Vinogradov2009; Brown and Kuperberg, Reference Brown and Kuperberg2015). Future longitudinal work will be necessary for understanding the developmental relationship between predictive processing based on higher level representations and low-level perceptual processing in schizophrenia.

Finally, our finding that patients were impaired in their use of higher level cues in our naturalistic task has potential implications for understanding the use of spoken language in real-world contexts in schizophrenia. For example, the predictive use of higher level information plays a vital role in allowing smooth turn-taking during every day conversational interactions (de Ruiter et al., Reference de Ruiter, Mitterer and Enfield2006; Magyari and de Ruiter, Reference Magyari and de Ruiter2012). It also ensures that language comprehension is fast and accurate in noisy or challenging environments, such as when listening to announcements on public transport or attending to one speaker amongst many in social contexts. Our data shed light on why real-world communication situations like these may present important challenges in schizophrenia (Brown and Kuperberg, Reference Brown and Kuperberg2015). In addition, our finding that, given enough time, patients were able to use these top-down cues to inform their final interpretations (see also Ditman and Kuperberg, Reference Ditman and Kuperberg2007; Kuperberg et al., Reference Kuperberg, Ditman and Choi Perrachione2018) suggests that, despite such challenges, language deficits may not necessarily manifest using traditional ‘off-line’ assessment tools. We suggest that the visual-world eye-tracking method is an ideally naturalistic and well-controlled solution for studying these real-world communication issues in schizophrenia.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291718001952

Acknowledgements

This work was funded by the National Institute of Mental Health (R01MH071635 to G.R.K.), by the Economic and Social Research Council (ES/L01064X/1 to H.R.), by the National Science Foundation (BCS-0921012 to J.S.) and by a fellowship from the Harvard Mind, Brain, Behavior Initiative (to H.R., G.K., and J.S.). The authors are grateful to Donald Goff, Leah Briggs, and Claire Oppenheim for supporting patient recruitment, to Paul Mains, Gianna Wilkie, Dan Kim, Kristina Fanucci, and Margarita Zeitlin for supporting recruitment and testing, and to Meredith Brown for her comments on the manuscript.

Footnotes

^† The notes appear after the main text.

¹ It is unclear why control participants did not show this predicted effect, as it has previously been described in both healthy college students and children (Dahan & Tanenhaus, Reference Dahan and Tanenhaus2004; Snedeker & Trueswell, Reference Snedeker and Trueswell2004; Tanenhaus et al., Reference Tanenhaus, Spivey-Knowlton, Eberhard and Sedivy1995). One possibility is that this null finding is a ‘false negative’. However, it is also possible that the effect of visual information is simply less strong in the population from which our control group was drawn, which differs from these previously studied populations in a number of demographic ways. Importantly, for the purpose of this study, control participants did show a significant online effect (as indexed by their eye movements), while, as described above, people with schizophrenia failed to show this online effect.

² Note that this differs from many laboratory tasks and paradigms, such as the Stroop or the AX-CPT, in which the use of top-down information entails the use of specific task-relevant goals to over-ride prepotent bottom-up responses. In such tasks, using ‘top-down’ information is inherently more difficult than using bottom-up information.

References

Adams, RA, Stephan, KE, Brown, HR, Frith, CD and Friston, KJ (2013) The computational anatomy of psychosis. Frontiers in Psychiatry 4, 47.Google Scholar

Adcock, RA, Dale, C, Fisher, M, Aldebot, S, Genevsky, A, Simpson, GV, Nagarajan, S and Vinogradov, S (2009) When top-down meets bottom-up: auditory training enhances verbal memory in schizophrenia. Schizophrenia Bulletin 35, 1132–1141.Google Scholar

Andreasen, NC (1979 a) Thought, language and communication disorders. I. Clinical assessment, definition of terms, and evaluation of their reliability. Archives of General Psychiatry 36, 1315–1321.Google Scholar

Andreasen, NC (1979 b) Thought, language and communication disorders. II. Diagnostic significance. Archives of General Psychiatry 36, 1325–1330.Google Scholar

Andreasen, NC (1984 a) Scale for the Assessment of Negative Symptoms (SANS). Iowa City: The University of Iowa.Google Scholar

Andreasen, NC (1984 b) Scale for the Assessment of Positive Symptoms (SAPS). Iowa City, IA: The University of Iowa.Google Scholar

Andreasen, NC (1986) Scale for assessment of thought, language and communication (TLC). Schizophrenia Bulletin 12, 473–482.Google Scholar

Bambini, V, Arcara, G, Bechi, M, Buonocore, M, Cavallaro, R and Bosia, M (2016) The communicative impairment as a core feature of schizophrenia: frequency of pragmatic deficit, cognitive substrates, and relation with quality of life. Comprehensive Psychiatry 71, 106–120.Google Scholar

Barch, DM and Smith, E (2008) The cognitive neuroscience of working memory: relevance to CNTRICS and schizophrenia. Biological Psychiatry 64, 11–17.Google Scholar

Bates, DM, Mächler, M, Bolker, B and Walker, S (2015) Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67, 1–48.Google Scholar

Blair, JR and Spreen, O (1989) Predicting premorbid IQ: a revision of the National Adult Reading Test. Clinical Neuropsychologist 3, 129–136.Google Scholar

Bleuler, E (1911/1950) Dementia Praecox, or the Group of Schizophrenias (J. Zinker, Trans.). New York: International Universities Press.Google Scholar

Boudewyn, MA, Carter, CS and Swaab, TY (2012) Cognitive control and discourse comprehension in schizophrenia. Schizophrenia Research and Treatment 2012, 484–502.Google Scholar

Bowie, CR and Harvey, PD (2008) Communication abnormalities predict functional outcomes in chronic schizophrenia: differential associations with social and adaptive functions. Schizophrenia Research 103, 240–247.Google Scholar

Brown, M and Kuperberg, GR (2015) A hierarchical generative framework of language processing: linking language perception, interpretation, and production abnormalities in schizophrenia. Frontiers in Human Neuroscience 9, 643.Google Scholar

Chapman, LJ and Chapman, JP (1973) Problems in the measurement of cognitive deficit. Psychological Bulletin 79, 380–385.Google Scholar

Cienfuegos, A, March, L, Shelley, A-M and Javitt, DC (1999) Impaired categorical perception of synthetic speech sounds in schizophrenia. Biological Psychiatry 45, 82–88.Google Scholar

Cohen, JD and Servan-Schreiber, D (1992) Context, cortex, and dopamine: a connectionist approach to behaviour and biology in schizophrenia. Psychological Review 99, 45–77.Google Scholar

Colle, L, Angeleri, R, Vallana, M, Sacco, K, Bara, BG and Bosco, FM (2013) Understanding the communicative impairments in schizophrenia: a preliminary study. Journal of Communication Disorders 46, 294–308.Google Scholar

Corlett, PR, Frith, CD and Fletcher, PC (2009) From drugs to deprivation: a Bayesian framework for understanding models of psychosis. Psychopharmacology 206, 515–530.Google Scholar

Corlett, PR, Taylor, JR, Wang, XJ, Fletcher, PC and Krystal, JH (2010) Toward a neurobiology of delusions. Progress in Neurobiology 92, 345–369.Google Scholar

Dahan, D and Tanenhaus, MK (2004) Continuous mapping from sound to meaning in spoken-language comprehension: immediate effects of verb-based thematic constraints. Journal of Experimental Psychology: Learning, Memory, and Cognition 30, 498–513.Google Scholar

Dell, GS and Chang, F (2014) The P-chain: relating sentence production and its disorders to comprehension and acquisition. Philosophical Transactions of the Royal Society B: Biological Sciences 369, 20120394.Google Scholar

de Ruiter, JP, Mitterer, H and Enfield, NJ (2006) Projecting the end of a speaker's turn: a cognitive cornerstone of conversation. Language 82, 515–535.Google Scholar

Diehl, JJ, Friedberg, C, Paul, R and Snedeker, J (2015) The use of prosody during syntactic processing in children and adolescents with autism spectrum disorders. Development and Psychopathology 27, 867–884.Google Scholar

Ditman, T and Kuperberg, GR (2007) The time course of building discourse coherence in schizophrenia: an ERP investigation. Psychophysiology 44, 991–1001.Google Scholar

Ditman, T, Goff, D and Kuperberg, GR (2011) Slow and steady: sustained effects of lexico-semantic associations can mediate referential impairments in schizophrenia. Cognitive, Affective, & Behavioral Neuroscience 11, 245–258.Google Scholar

Elman, JL, Hare, M and McRae, K (2004) Cues, constraints, and competition in sentence processing. In Beyond Nature-Nurture: Essays in Honor of Elizabeth Bates. Mahwah, NJ: Lawrence Erlbaum Associates Publishers, pp. 111–138.Google Scholar

Elvevåg, B, Foltz, PW, Weinberger, DR and Goldberg, TE (2007) Quantifying incoherence in speech: an automated methodology and novel application to schizophrenia. Schizophrenia Research 93, 304–316.Google Scholar

First, M, Spitzer, R, Miriam, G and Williams, J (2002 a) Structured Clinical Interview for DSM-IV-TR Axis I Disorders, Research Version, Non-patient Edition (SCID-I/NP). Retrieved from New York.Google Scholar

First, M, Spitzer, R, Miriam, G and Williams, J (2002 b) Structured Clinical Interview for DSM-IV-TR Axis I Disorders, Research Version, Patient Edition. (SCID-I/P). Retrieved from New York.Google Scholar

Fletcher, PC and Frith, CD (2009) Perceiving is believing: a Bayesian approach to explaining the positive symptoms of schizophrenia. Nature Reviews Neuroscience 10, 48–58.Google Scholar

Ford, JM and Mathalon, DH (2012) Anticipating the future: automatic prediction failures in schizophrenia. International Journal of Psychophysiology 83, 232–239.Google Scholar

Frith, CD (2004) Schizophrenia and theory of mind. Psychological Medicine 34, 385–389.Google Scholar

Gambi, C, Pickering, MJ and Rabagliati, H (2016) Beyond associations: sensitivity to structure in pre-schoolers’ linguistic predictions. Cognition 157, 340–351.Google Scholar

Gardner, DM, Murphy, AL, O'Donnell, H, Centorrino, F and Baldessarini, RJ (2010) International consensus study of antipsychotic dosing. American Journal of Psychiatry 167, 686–693.Google Scholar

Gold, JM and Dickinson, D (2012) ‘Generalized cognitive deficit’ in schizophrenia: overused or underappreciated? Schizophrenia Bulletin 39, 263–265.Google Scholar

Harrow, M, Lanin-Kettering, I and Miller, JG (1989) Impaired perspective and thought pathology in schizophrenic and psychotic disorders. Schizophrenia Bulletin 15, 605–623.Google Scholar

Hemsley, DR (1993) A simple (or simplistic?) cognitive model for schizophrenia. Behaviour Research and Therapy 31, 633–645.Google Scholar

Hollingshead, AB (1965) Two Factor Index of Social Position. New Haven, CT: Yale University Press.Google Scholar

Holshausen, K, Harvey, PD, Elvevåg, B, Foltz, PW and Bowie, CR (2014) Latent semantic variables are associated with formal thought disorder and adaptive behavior in older inpatients with schizophrenia. Cortex 55, 88–96.Google Scholar

Huang, YT and Snedeker, J (2009 a) Online interpretation of scalar quantifiers: insight into the semantics-pragmatics interface. Cognitive Psychology 58, 376–415.Google Scholar

Huang, YT and Snedeker, J (2009 b) Semantic meaning and pragmatic interpretation in 5-year-olds: evidence from real-time spoken language comprehension. Developmental Psychology 45, 1723–1739.Google Scholar

Huang, YT, Stranahan, L and Snedeker, J (2017) Reconsideration on linking eye-movement data with argument realization. Paper presented at the 39th Annual Meeting of the Cognitive Science Society, London, UK.Google Scholar

Iacono, WG (1981) Dissociation of smooth-pursuit and saccadic eye tracking in remitted schizophrenics. Archives of General Psychiatry 38, 991.Google Scholar

Jahshan, C, Wynn, JK and Green, MF (2013) Relationship between auditory processing and affective prosody in schizophrenia. Schizophrenia Research 143, 348–353.Google Scholar

Javitt, DC (2009) When doors of perception close: bottom-up models of disrupted cognition in schizophrenia. Annual Review of Clinical Psychology 5, 249–275.Google Scholar

Javitt, DC and Freedman, R (2015) Sensory processing dysfunction in the personal experience and neuronal machinery of schizophrenia. American Journal of Psychiatry 172, 17–31.Google Scholar

Kantrowitz, JT, Hoptman, MJ, Leitman, DI, Silipo, G and Javitt, DC (2014) The 5% difference: early sensory processing predicts sarcasm perception in schizophrenia and schizo-affective disorder. Psychological Medicine 44, 25–36.Google Scholar

Kasai, K, Nakagome, K, Itoh, K, Koshida, I, Hata, A, Iwanami, A, Fukuda, M and Kato, N (2002) Impaired cortical network for preattentive detection of change in speech sounds in schizophrenia: a high-resolution event-related potential study. American Journal of Psychiatry 159, 546–553.Google Scholar

Keysar, B, Barr, DJ, Balin, JA and Brauner, JS (2000) Taking perspective in conversation: the role of mutual knowledge in comprehension. Psychological Science 11, 32–38.Google Scholar

Kim, H, Somerville, LH, Johnstone, T, Polis, S, Alexander, AL, Shin, LM and Whalen, PJ (2004) Contextual modulation of amygdala responsivity to surprised faces. Journal of Cognitive Neuroscience 16, 1730–1745.Google Scholar

Kleinschmidt, DF and Jaeger, FT (2015) Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel. Psychological Review 122, 148–203.Google Scholar

Kreher, DA, Goff, D and Kuperberg, GR (2009) Why all the confusion? Experimental task explains discrepant semantic priming effects in schizophrenia under ‘automatic’ conditions: evidence from event-related potentials. Schizophrenia Research 111, 174–181.Google Scholar

Kuperberg, GR (2010 a) Language in schizophrenia part 1: an introduction. Language and Linguistics Compass 4, 576–589.Google Scholar

Kuperberg, GR (2010 b) Language in schizophrenia Part 2: what can psycholinguistics bring to the study of schizophrenia…and vice versa? Language and Linguistics Compass 4, 590–604.Google Scholar

Kuperberg, GR and Jaeger, TF (2016) What do we mean by prediction in language comprehension? Language, Cognition, and Neuroscience 31, 32–59.Google Scholar

Kuperberg, GR, McGuire, PK and David, A (1998) Reduced sensitivity to linguistic context in schizophrenic thought disorder: evidence from online monitoring for words in linguistically-anomalous sentences. Journal of Abnormal Psychology 107, 423–434.Google Scholar

Kuperberg, GR, Sitnikova, T, Goff, D and Holcomb, PJ (2006) Making sense of sentences in schizophrenia: electrophysiological evidence for abnormal interactions between semantic and syntactic processing. Journal of Abnormal Psychology 115, 251–265.Google Scholar

Kuperberg, GR, Ditman, T and Choi Perrachione, A (2018) When proactivity fails: an electrophysiological study of establishing reference in schizophrenia. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging 3, 77–87.Google Scholar

Leitman, DI, Foxe, JJ, Butler, PD, Saperstein, A, Revheim, N and Javitt, DC (2005) Sensory contributions to impaired prosodic processing in schizophrenia. Biological Psychiatry 58, 56–61.Google Scholar

Magyari, L and de Ruiter, JP (2012) Prediction of turn-ends based on anticipation of upcoming words. Frontiers in Psychology 3, 376.Google Scholar

Mathalon, DH, Faustman, WO and Ford, JM (2002) N400 and automatic semantic processing abnormalities in patients with schizophrenia. Archives of General Psychiatry 59, 641–648.Google Scholar

McCabe, R, Leudar, I and Antaki, C (2004) Do people with schizophrenia display theory of mind deficits in clinical interactions? Psychological Medicine 34, 401–412.Google Scholar

McClelland, JL and Rumelhart, DE (1981) An interactive activation model of context effects in letter perception: I. An account of basic findings. Psychological Review 88, 375–407.Google Scholar

Meilijson, SR, Kasher, A and Elizur, A (2004) Language performance in chronic schizophrenia. Journal of Speech Language and Hearing Research 47, 695.Google Scholar

Minzenberg, MJ, Ober, BA and Vinogradov, S (2002) Semantic priming in schizophrenia: a review and synthesis. Journal of the International Neuropsychological Society 8, 699–720.Google Scholar

Mirman, D, Dixon, JA and Magnuson, JS (2008) Statistical and computational models of the visual world paradigm: growth curves and individual differences. Journal of Memory and Language 59, 475–494.Google Scholar

Pawełczyk, A, Kotlicka-Antczak, M, Łojek, E, Ruszpel, A and Pawełczyk, T (2017) Schizophrenia patients have higher-order language and extralinguistic impairments. Schizophrenia Research 192, 274–280.Google Scholar

Rabagliati, H, Gambi, C and Pickering, MJ (2014) Learning to predict or predicting to learn? Language, Cognition and Neuroscience 31, 94–105.Google Scholar

R Core Team (2016) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0.Google Scholar

Rescorla, RA (1988) Pavlovian conditioning: it's not what you think it is. American Psychologist 43, 151.Google Scholar

Revheim, ME, Hole, KH, Bruland, OS, Reitan, E, Bjerkehagen, B, Julsrud, L and Seierstad, T (2014) Multimodal functional imaging for early response assessment in GIST patients treated with imatinib. Acta Oncologica 53, 143–148.Google Scholar

Rumelhart, DE and McClelland, JL (1982) An interactive activation model of context effects in letter perception: II. The contextual enhancement effect and some tests and extensions of the model. Psychological Review 89, 60–94.Google Scholar

Sedivy, JC, Tanenhaus, MK, Chambers, CG and Carlson, GN (1999) Achieving incremental semantic interpretation through contextual representation. Cognition 71, 109–147.Google Scholar

Silverstein, S, Hatashita-Wong, M, Schenkel, L, Wilkniss, S, Kovács, I, Fehér, A, Smith, T, Goicochea, C, Uhlhaas, P, Carpiniello, K and Savitz, A (2006) Reduced top-down influences in contour detection in schizophrenia. Cognitive Neuropsychiatry 11, 112–132.Google Scholar

Silverstein, SM, Matteson, S and Knight, RA (1996) Reduced top-down influence in auditory perceptual organization in schizophrenia. Journal of Abnormal Psychology 105, 663–667.Google Scholar

Sitnikova, T, Salisbury, DF, Kuperberg, GR and Holcomb, PJ (2002) Electrophysiological insights into language processing in schizophrenia. Psychophysiology 39, 851–860.Google Scholar

Snedeker, J and Trueswell, JC (2004) The developing constraints on parsing decisions: the role of lexical-biases and referential scenes in child and adult sentence processing. Cognitive Psychology 49, 238–299.Google Scholar

Snedeker, J and Yuan, S (2008) Effects of prosodic and lexical constraints on parsing in young children (and adults). Journal of Memory and Language 58, 574–608.Google Scholar

Spitzer, M, Braun, U, Hermle, L and Maier, S (1993) Associative semantic network dysfunction in thought-disordered schizophrenic patients: direct evidence from indirect semantic priming. Biological Psychiatry 34, 864–877.Google Scholar

Swaab, TY, Boudewyn, MA, Long, DL, Luck, SJ, Kring, AM, Ragland, JD, Ranganath, C, Lesh, T, Niendam, T, Solomon, M and Mangun, GR (2013) Spared and impaired spoken discourse processing in schizophrenia: effects of local and global language context. Journal of Neuroscience 33, 15578–15587.Google Scholar

Tanenhaus, MK and Trueswell, JC (2006) Eye movements and spoken language comprehension. In Traxler, MJ and Gernsbacher, MA (eds), Handbook of Psycholinguistics, 2nd Edn. Oxford: Oxford University Press, pp. 863–900.Google Scholar

Tanenhaus, MK, Spivey-Knowlton, MJ, Eberhard, KM and Sedivy, JC (1995) Integration of visual and linguistic information in spoken language comprehension. Science 268, 1632–1634.Google Scholar

Titone, D and Levy, DL (2004) Lexical competition and spoken word identification in schizophrenia. Schizophrenia Research 68, 75–85.Google Scholar

Titone, D, Levy, DL and Holzman, PS (2000) Contextual insensitivity in schizophrenic language processing: evidence from lexical ambiguity. Journal of Abnormal Psychology 109, 761–767.Google Scholar

Trueswell, JC, Sekerina, I, Hill, NM and Logrip, ML (1999) The kindergarten-path effect: studying on-line sentence processing in young children. Cognition 73, 89–134.Google Scholar

Whitford, V, O'Driscoll, GA, Pack, CC, Joober, R, Malla, A and Titone, D (2013) Reading impairments in schizophrenia relate to individual differences in phonological processing and oculomotor control: evidence from a gaze-contingent moving window paradigm. Journal of Experimental Psychology: General 142, 57–75.Google Scholar

Whitford, V, O'Driscoll, GA and Titone, D (2017) Reading deficits in schizophrenia and their relationship to developmental dyslexia: a review. Schizophrenia Research 193, 11–22.Google Scholar

Table 1. Definitions of terms and summary of manipulations

Table 2. Demographic, medication and symptom measures

Rabagliati et al. supplementary material

Rabagliati et al. supplementary material 1

File 1.1 MB

Article contents

Spared bottom-up but impaired top-down interactive effects during naturalistic language processing in schizophrenia: evidence from the visual-world paradigm

Abstract

Keywords

Methods and materials

Participants

General procedures

Task 1: use of prosodic phrasing

Task 2: use of the verb's semantic–thematic constraints and pragmatically relevant visual information

Task 3: use of conversational discourse information

Analysis

Analysis of eye movements

Analysis of final actions

Results

Analysis of online processing (eye movements)

Effects of prosodic phrasing and verb semantic–thematic constraints

Effects of pragmatically relevant visual information and conversational discourse information

Analysis of final interpretations (final actions)

Discussion

Supplementary material

Acknowledgements

Footnotes

References

Rabagliati et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests