Introduction
In a visual masking paradigm, the visibility of a target is disrupted by the presence of a mask that occurs briefly before or after the target, for forward masking and backward masking, respectively (Breitmeyer, Reference Breitmeyer1984; Breitmeyer & Ogmen, Reference Breitmeyer and Ogmen2000). A well-replicated finding is that schizophrenia patients perform more poorly than comparison subjects in identifying a visual target in the presence of a mask (Saccuzzo & Braff, Reference Saccuzzo and Braff1981, Reference Saccuzzo and Braff1986; Saccuzzo & Schubert, Reference Saccuzzo and Schubert1981; Rund, Reference Rund1993; Green et al. Reference Green, Nuechterlein and Mintz1994a, Reference Green, Nuechterlein and Mintzb; Cadenhead et al. Reference Cadenhead, Serper and Braff1998).
Visual masking in schizophrenia has been studied for several reasons, including evidence that impairment on this paradigm may indicate vulnerability to schizophrenia (Miller et al. Reference Miller, Saccuzzo and Braff1979; Lieb et al. Reference Lieb, Denz, Hess, Schuttler, Kornhuber and Schreiber1996; Green et al. Reference Green, Nuechterlein and Breitmeyer1997, Reference Green, Nuechterlein, Breitmeyer and Mintz1999, Reference Green, Nuechterlein, Breitmeyer and Mintz2006; Keri et al. Reference Keri, Kelemen, Benedek and Janka2001; Bedwell et al. Reference Bedwell, Brown and Miller2003). Also, deficits on masking tasks are related to functional outcome through key mediating variables, such as social cognition (Sergi et al. Reference Sergi, Rassovsky, Nuechterlein and Green2006; Rassovsky et al. Reference Rassovsky, Horan, Lee, Sergi and Green2010). An additional reason for applying masking paradigms to the study of schizophrenia is that they can be used to probe underlying neural systems. For example, masking paradigms have been used to probe the relative contribution of the magnocellular versus parvocellular visual processing channels (Schuck & Lee, Reference Schuck and Lee1989; Green et al. Reference Green, Nuechterlein and Mintz1994b; Cadenhead et al. Reference Cadenhead, Serper and Braff1998; Schechter et al. Reference Schechter, Butler, Silipo and Javitt2003), the role of neural oscillations in the γ band (Green et al. Reference Green, Mintz, Salveson, Nuechterlein, Breitmeyer, Light and Braff2003a; Wynn et al. Reference Wynn, Light, Breitmeyer, Nuechterlein and Green2005) and the functioning of key visual processing regions, such as the lateral occipital complex (area LO) (Green et al. Reference Green, Lee, Cohen, Engel, Korb, Nuechterlein, Wynn and Glahn2009).
There is strong evidence to suggest that most visual masking paradigms involve two distinct processes, an early fast-acting component associated with object formation and a later component that acts through object substitution (Di Lollo et al. Reference Di Lollo, Enns and Rensink2000; Enns & Di Lollo, Reference Enns and Di Lollo2000; Ro et al. Reference Ro, Breitmeyer, Burton, Singhal and Lane2003; Enns, Reference Enns2004; Chen & Treisman, Reference Chen and Treisman2009; Dux et al. Reference Dux, Visser, Goodhew and Lipp2010). Masking associated with object formation occurs in a very brief temporal window (i.e. target-mask intervals of 0–100 ms), when the target and mask are presented so closely in time that they essentially join together and form an integrated target–mask composite that makes it hard to identify the target. In contrast, masking by object substitution occurs when the target percept is replaced by the mask before it reaches awareness and this effect is maximal at delays following the first 100 ms. The theory behind masking by object substitution is that perception is a consequence of recurrent communication between lower-level and higher-level neural processes. Initially, the information is handled by lower-level units in a ‘feed forward’ sweep. Feed forward models of visual processing (i.e. those that involve the flow of information from the retina through to the hierarchical stages of visual cortex) have been predominant for explaining visual masking. However, cortical feedback (i.e. re-entrant) sweeps are necessary to resolve ambiguity between possible alternative visual patterns. Recent advances in cognitive and perceptual neuroscience have emphasized the importance of these re-entrant processes of feedback from higher to lower levels of the cortical hierarchy for processing visual stimuli within the cortex. These re-entrant processes appear to be essential for conscious perception (Ro et al. Reference Ro, Breitmeyer, Burton, Singhal and Lane2003; Chen & Treisman, Reference Chen and Treisman2009; Dux et al. Reference Dux, Visser, Goodhew and Lipp2010).
Object substitution is thought to be based solely on re-entrant processing (Enns & Di Lollo, Reference Enns and Di Lollo2000; Enns, Reference Enns2004), in which the representation of the target is replaced with the representation of the mask. Hence, if patients with schizophrenia show abnormalities in a paradigm that is limited to masking by object substitution, it would implicate aberrant re-entrant processes. However, all of the masking paradigms used in schizophrenia research have involved both object formation and object substitution components. There is one specialized procedure, four-dot masking, which involves only masking by object substitution (Enns, Reference Enns2004), but it has not been previously applied to schizophrenia. In this paradigm, the mask consists of four relatively small dots, one at each corner of the target. This type of mask provides no contour information and does not overlap with the target. Hence, it cannot integrate with the target in a way that would disrupt identification.
In this project, we administered three visual processing procedures to characterize better visual perceptual abnormalities in schizophrenia. These procedures included: (1) forward and backward location masking with overlapping target and mask stimuli; (2) four-dot masking; (3) a cuing procedure with a single dot that does not mask the target. The cuing procedure with a single dot was included to assess iconic decay. Four-dot masking involves longer processing delays than location masking and depends on maintaining an iconic representation (i.e. visible persistent) of the target. Hence, if patients and controls differ in their performance on four-dot masking, it could possibly be explained by different rates of iconic decay. Four-dot masking can have a delayed onset of the mask, or it can have the same onset as the target and a delayed offset (called common-onset masking). For the purposes of this study, we used the delayed onset version so that it would be more comparable with the location masking.
Using these three procedures we addressed the following goals. First, we wanted to see whether we could replicate our previous findings that schizophrenia patients show impairment on location masking after they have been equated for sensory input of the target. Second, we used a four-dot masking procedure to evaluate whether the groups differed in masking by object substitution. Third, we used a cuing task to determine whether the patients and controls differed in their rates of visible persistence/iconic decay. We predicted that patients would show increased susceptibility to the mask in four-dot masking performance but normal rates of visible persistence.
Method
Participants
A total of 136 (29 female) patients were recruited from out-patient treatment clinics at the Veterans Affairs Greater Los Angeles Healthcare System and through presentations at residences in the community. Patients met criteria for schizophrenia (n=124) or schizoaffective disorder (n=12) based on the SCID Axis I Disorders (First et al. Reference First, Spitzer, Gibbon and Williams1997). Altogether, 112 patients were receiving atypical antipsychotic medications, seven were receiving typical antipsychotic medications, nine were receiving both types of medication and eight were not taking an antipsychotic medication at time of assessment.
In total, 79 (20 female) healthy control participants were recruited through newspaper and Internet advertisements. Control participants were screened with the SCID and SCID-II (First et al. Reference First, Gibbon, Spitzer, Williams and Benjamin1996) and were excluded if they met criteria for any lifetime psychotic disorder, bipolar mood disorder, recurrent depression, substance dependence, paranoid, schizotypal or schizoid personality disorder and any evidence (according to participant report) of a history of psychotic disorder among their first-degree relatives. Additional exclusion criteria for both groups included aged <18 or >60 years, active substance use disorder in the past 6 months, identifiable neurological disorder, IQ <70, history of loss of consciousness for >1 h or insufficient fluency in English. All participants had the capacity to give informed consent and provided written informed consent after all procedures were fully explained in accordance with procedures approved by the Institutional Review Boards at UCLA and the Veterans Affairs Greater Los Angeles Healthcare System.
Clinical symptom ratings were conducted with the expanded 24-item UCLA version of the Brief Psychiatric Rating Scale (Ventura et al. Reference Ventura, Lukoff, Nuechterlein, Liberman, Green and Shaner1993b) and with the Scale for the Assessment of Negative Symptoms (Andreasen, Reference Andreasen1984). All clinical interviewers were trained through the Treatment Unit of the Department of Veterans Affairs VISN 22 Mental Illness Research, Education, and Clinical Center based on established procedures (Ventura et al. Reference Ventura, Green, Shaner and Liberman1993a, Reference Ventura, Liberman, Green and Shaner1998). The process included formal didactics, achieving a minimum level of reliability using an extensive library of videotaped interviews, as well as live, co-rated interviews conducted with faculty members. After certification, all raters participated in a continuous quality assurance programme that involved periodic reliability checks and co-rated live interviews with faculty.
Procedures
Participants performed three different visual processing tasks. The first task was identifying the location of a target (location masking) and consisted of both forward and backward masking conditions. The two other tasks (four-dot masking and cuing task) used only a backward masking format. All stimuli were presented using E-Prime 1.1 software (Psychological Software Tools, USA) on a 17 inch cathode ray tube computer monitor running at a resolution of 640×480 pixels and a 160 Hz refresh rate, yielding a screen sweep of 6.25 ms. Participants sat 1 m away from the monitor. Procedures were administered in a set order: location masking; four-dot masking; the cuing task.
Location masking
The location masking task was based on previous studies reported in our laboratory and reported in detail elsewhere (Green et al. Reference Green, Nuechterlein and Breitmeyer2002, Reference Green, Nuechterlein, Breitmeyer, Tsuang and Mintz2003b). The target consisted of a single square with a notch that could appear at the top, bottom or left side of the square. The target could appear at one of four different locations, arranged in a notional square, on the computer screen. The masking stimulus consisted of a pattern of squares that occupied every possible target location. Targets measured 0.27×0.27° of visual angle. The mask measured 2.01×2.01° of visual angle. Examples of the stimuli are shown in Fig. 1.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921043836934-0593:S003329171000214X:S003329171000214X_fig1g.gif?pub-status=live)
Fig. 1. The stimuli are shown for each of the visual processing paradigms. Location masking is on top, four-dot masking in the middle and cuing on the bottom.
The target was presented for 12.5 ms (two screen sweeps) and the mask was presented for 25 ms (four screen sweeps). As in previous masking studies, we first used a psychophysical procedure [critical stimulus intensity (CSI)] to equate the participants on the target threshold (Green et al. Reference Green, Nuechterlein and Breitmeyer2002). This procedure moves the contrast level up and down according to the subject's performance so that each subject is performing at 84% accuracy for identifying the unmasked target. The gray scale value determined in this CSI procedure is then used for all of the stimuli in the location masking task. In location masking, 12 trials were presented for each stimulus onset asynchrony (SOA; the interval between the onset of the target stimulus and the onset of the mask). Both forward and backward masking was assessed. In forward masking the mask preceded the target, whereas in backward masking the mask followed the target. Six SOAs (12.5, 25.0, 37.5, 50.0, 62.5, 75 ms) were used in both the forward and backward masking tasks. Additionally, an SOA of 0 ms was used (i.e. simultaneous target and mask onset). This masking condition typically yields a monotonic masking function, in which performance is lowest at a SOA of 0 ms.
Each trial started with a fixation cross presented for 400 ms. A blank screen was then presented for 500 ms. Following the blank screen, a forward or backward masking trial was presented. Participants verbally reported in which one of the four quadrants they believed the target appeared and the experimenter entered their response. The experimenter then initiated the next trial. A schematic of a trial can be seen in Fig. 1.
Four-dot masking
The four-dot masking procedure was modified from similar tasks described elsewhere (Enns & Di Lollo, Reference Enns and Di Lollo2000; Enns, Reference Enns2004). In this masking condition, four potential targets appeared in a notional square on the computer monitor followed by a mask surrounding one of the potential targets (see Fig. 1). The mask essentially cues which target the participant was supposed to identify. The target array consisted of four squares with a notch missing from either the top, bottom or left side of the square. The mask consisted of four dots that surround, but do not touch, one of the potential targets. Each potential target measured 1.55×1.55° of visual angle and was arranged in a square of 4.58×4.58° of visual angle. The four-dot mask measured 2.23×2.23° of visual angle and each dot in the mask subtended 0.23×0.23° of visual angle. The target was presented for 25 ms and the mask was presented for 37.5 ms. The target stimuli were black, presented on a white background and were supra-threshold for all subjects, unlike the location masking condition that was set to an individual's threshold. In total, 12 trials were presented for each of eight SOAs (0, 25, 50, 75, 100, 125, 150, 175 ms). In four-dot masking, performance typically decreases with increasing SOAs, unlike the pattern for location masking.
A trial started with a fixation cross presented for 450 ms, followed by a blank screen presented for 500 ms. Following the blank screen, the target and masking stimuli, separated by the above-mentioned SOAs, were presented. Participants verbally reported the direction of the notch (up, side or down) of the target that was surrounded by the four-dot mask and the experimenter entered their response. The next trial was then initiated by the experimenter.
Cuing task
A cuing task was used to assess rate of iconic decay (visible persistence). The cuing procedure used the exact same target stimuli and SOAs as the four-dot masking procedure except that instead of four dots only a single dot was used, which does not act as a mask. The single dot measured 0.34×0.34° of visual angle. Examples of the stimuli are shown in Fig. 1.
Data analysis
Accuracy was defined as percent correct (out of 12 trials) at each separate SOA for each visual processing procedure. For location masking, a two (group)×two (forward/backward) analysis of variance (ANOVA) was run. The value for forward masking was the mean of the six forward SOAs and the value for backward masking was the mean of the six backward SOAs; SOA=0 ms was not included. Because the masking function for location task in patients is well-characterized, we considered only the mean forward and mean backward values. In contrast, the function for four-dot masking in schizophrenia was unknown, so we conducted a two (group)×eight (SOA) ANOVA separately for the four-dot and cuing procedures. All statistical analyses used an a priori two-tailed significance level of 0.05 to determine significant results.
Results
Demographic information can be seen in Table 1. Patients were clinically stable with relatively low levels of symptoms. They were older than controls and had somewhat less parental education. Visual masking is known to be associated with age (Green et al. Reference Green, Nuechterlein, Breitmeyer, Tsuang and Mintz2003b) and preliminary analyses confirmed that age was associated with performance on the paradigms in this study. The effect of age on performance was not different between the two groups. Hence, we used age as a covariate in the subsequent analyses. There were no significant associations between parental education and performance on any task.
Table 1. Demographic information
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921043836934-0593:S003329171000214X:S003329171000214X_tab1.gif?pub-status=live)
BPRS, Brief Psychiatric Rating Scale; SANS, Scale for the Assessment of Negative Symptoms.
a p<0.01 for difference between controls and patients.
Location masking
Out of the original sample, 16 patients and five controls did not perform the CSI procedure in a way that reliably converged on a threshold and these subjects were not given the location masking task. The schizophrenia sample required more contrast (higher gray scale value) than controls to achieve comparable visual input thresholds; mean (s.d.) value was 33.8 (15.3) for patients and 26.1 (14.8) for controls [t(192)=3.48, p<0.01]. For location masking, the two (group)×two (forward versus backward masking) analysis of covariance (ANCOVA) revealed a significant main effect of group [F(1, 191)=6.64, p<0.05]. This result replicates previous findings of impairment on this and similar masking procedures. The main effect of forward/backward and the interaction was not significant. The means for each SOA and standard errors, adjusted for age, are shown in Fig. 2.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921043836934-0593:S003329171000214X:S003329171000214X_fig2g.jpeg?pub-status=live)
Fig. 2. The performance of patients and healthy controls on location masking after being matched for identification of unmasked targets. Values have been adjusted for age. On the x-axis, the negative stimulus onset asynchrony (SOA) values indicate forward masking and the positive values indicate backward masking. The figure includes bars for standard errors at each SOA and the dotted line shows chance performance at 25%.
Four-dot masking and cuing task
Before analyzing data from these tasks, we dropped subjects who were considered to have invalid performance. Specifically, we dropped subjects if they got <18 trials correct over the first three SOAs for the cuing task (i.e. <50% correct of the first three SOAs). This criterion was selected because: (1) the cuing task is the easier of the two tasks; (2) both of these tasks become harder at longer SOAs (unlike the location task) and performance reached asymptote after the first three SOAs; (3) the probability of getting ⩾18 trials correct solely by chance is <0.05 based on a binomial distribution. Altogether, 12 patients and two controls were excluded from analyses using this criterion. In addition, data were missing for three patients on the four-dot masking procedure.
For the four-dot masking, both groups showed the expected decrease in performance with increasing SOAs (see Fig. 3 a). The two (group)×eight (SOA) ANCOVA revealed a significant main effect of SOA [F(7, 1365)=7.84, p<0.001]. There was also a significant main effect of group [F(1, 195)=4.05, p<0.05]. The interaction between group and SOA was not significant. These results reveal that schizophrenia patients were more susceptible to masking by object substitution compared with healthy controls.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160921043836934-0593:S003329171000214X:S003329171000214X_fig3g.jpeg?pub-status=live)
Fig. 3. (a) The performance of each group, adjusted for age, on (a) the four-dot masking procedure and (b) the cuing task. Mask (for four-dot) or cue (for cuing) were always shown after the target (backward masking format). The groups differed significantly on four-dot masking, but not the cuing task. The figure includes bars for standard errors at each stimulus onset asynchrony (SOA) and the dotted line shows chance performance at 33%.
We next evaluated the cuing procedure to determine whether the groups differed in their rate of iconic decay. The two (group)×eight (SOA) ANCOVA revealed no significant main effects or interactions. Although the overall ANCOVA for the SOA effect was not significant, the linear contrast for the target decay over time was significant [F(1, 198)=8.00, p<0.01]. The linear contrast by group interaction was not significant. These results (Fig. 3 b) indicate that both groups showed comparable rates of decay.
Discussion
This paper yielded several key findings. First, we replicated previous findings of visual masking impairment in schizophrenia with a task that is believed to interrupt object formation. This impairment occurred after subjects were matched for sensory input of unmasked targets using a threshold procedure. Second, the groups did not differ in their iconic decay rates on a cuing task. Third, the patients differed from controls on four-dot masking, which is believed to work only through masking by object substitution. Masking by object substitution, thought to depend on re-entrant activation, occurs at higher levels of processing than other forms of masking. For example, object substitution masking can interfere at levels as high as those required for semantic processing (Reiss & Hoffman, Reference Reiss and Hoffman2006). Moreover, a recent study showed that object substitution masking occurs at levels higher than backward masking by (spatially overlapping) noise or by meta-contrast (Chakravarthi & Cavanagh, Reference Chakravarthi and Cavanagh2009).
We had predicted that patients and controls would be comparable in iconic decay. The most common way to assess iconic decay in the cognitive science literature is with a partial report procedure, in which a participant is cued about which part of a stimulus array to report. The cuing procedure in this study can be considered a modified partial report task. The partial report procedure has only been used twice before in schizophrenia research; in an early study (Knight et al. Reference Knight, Sherer and Shapiro1977) and in a recent paper (Hahn et al. Reference Hahn, Kappenman, Robinson, Fuller, Luck and Gold2010). The results of the paper from Knight et al. are complicated by the inclusion of three subgroups of schizophrenia patients and the absence of a healthy control group, but it appears that iconic decay was normal in at least one subgroup of patients. The results from Hahn et al. showed comparable iconic decay rates in patients and healthy controls, consistent with the current study. Our finding of normal iconic decay in schizophrenia patients implies that any group differences in masking cannot be accounted for by different decay rates. Because a normal rate of iconic decay was observed on the paradigm that was always administered last, differential fatigue cannot explain the pattern of results. Differential improvement in patients over the testing session, though unlikely, is a possibility.
Given the absence of differences in iconic decay, we can speculate that the observed impairment in four-dot masking might be due to other possible mechanisms. One possibility is that the well-replicated finding of problems in masking by disruption of object formation carries forward into the later-stage masking by object substitution. In this case, a single initial abnormality in constructing a visual percept would provide a weakened visual representation for the next stage of processing. An alternative explanation is that a separate abnormality affects re-entrant processes and creates a situation in which patients are more susceptible to masking by object substitution. This second explanation would suggest two separate visual processing abnormalities. It also suggests that the differences between groups would become larger as the four-dot masking delay increases (that is, larger group differences over longer SOAs). The lack of a group by SOA interaction fails to support this prediction.
Neuroimaging studies have provided insights into the neural basis for object formation and recognition, which, in turn, suggest a possible basis for abnormalities in visual processing in schizophrenia. Studies with healthy controls have demonstrated that area LO is closely tied to perceiving a masked target (Grill-Spector et al. Reference Grill-Spector, Kushnir, Hendler and Malach2000; Bar et al. Reference Bar, Tootell, Schacter, Greve, Fischl, Mendola, Rosen and Dale2001; Green et al. Reference Green, Glahn, Engel, Nuechterlein, Sabb, Strojwas and Cohen2005; Carlson et al. Reference Carlson, Rauschenberger and Verstraten2007). Hence, area LO appears to be integral for object recognition. Using functional magnetic resonance imaging, we found that patients have a blunted area LO response to masked stimuli across SOAs, suggesting that a lower, or less organized, response in area LO may be a neural basis for impairments in a variety of visual processing tasks that require intact object formation (Green et al. Reference Green, Lee, Cohen, Engel, Korb, Nuechterlein, Wynn and Glahn2009).
The results from this study can be applied more generally to models of outcome in schizophrenia. Most studies that attempt to model pathways to outcome in schizophrenia have not included perceptual measures. However, studies from our laboratory and others have found that perceptual processing measures fit well into these models and are consistent with bottom-up theoretical formulations (Leitman et al. Reference Leitman, Foxe, Butler, Saperstein, Revheim and Javitt2005, Reference Leitman, Sehatpour, Higgins, Foxe, Silipo and Javitt2010; Sergi et al. Reference Sergi, Rassovsky, Nuechterlein and Green2006; Javitt, Reference Javitt2009; Rassovsky et al. Reference Rassovsky, Horan, Lee, Sergi and Green2010). For example, we have found that visual processing in schizophrenia is associated with social cognition, which is associated, in turn, with community functioning (Sergi et al. Reference Sergi, Rassovsky, Nuechterlein and Green2006; Rassovsky et al. Reference Rassovsky, Horan, Lee, Sergi and Green2010). Similarly, other studies of early auditory perception have reported linkages to auditory measures of social cognition (Leitman et al. Reference Leitman, Foxe, Butler, Saperstein, Revheim and Javitt2005). It remains to be seen whether the information provided from object substitution masking helps to account for variance in these models that is not attributed to masking associated with object formation.
In summary, it appears that visual processing in schizophrenia is characterized by an impairment that includes not only object formation processes, but also later-stage masking by object substitution. While the results from this paper indicate the presence of an impairment in masking by object substitution, this study cannot determine whether this impairment constitutes a new and separate impairment (perhaps involving re-entrant processes) or a continuation of the known deficit in object formation. Ongoing studies with neuroimaging and electrophysiology will help to make this determination.
Acknowledgments
Support for this study came from NIMH Grants MH043292 and MH065707 (PI: Michael F. Green, Ph.D.).
Declaration of Interest
None.