Hostname: page-component-745bb68f8f-b95js Total loading time: 0 Render date: 2025-02-06T10:27:38.002Z Has data issue: false hasContentIssue false

Amusia and protolanguage impairments in schizophrenia

Published online by Cambridge University Press:  31 March 2014

J. T. Kantrowitz*
Affiliation:
Schizophrenia Research Center, Nathan Kline Institute, Orangeburg, NY, USA Department of Psychiatry, Columbia University, New York, NY, USA.
N. Scaramello
Affiliation:
Schizophrenia Research Center, Nathan Kline Institute, Orangeburg, NY, USA
A. Jakubovitz
Affiliation:
Schizophrenia Research Center, Nathan Kline Institute, Orangeburg, NY, USA
J. M. Lehrfeld
Affiliation:
Schizophrenia Research Center, Nathan Kline Institute, Orangeburg, NY, USA
P. Laukka
Affiliation:
Department of Psychology, Stockholm University, Sweden
H. A. Elfenbein
Affiliation:
Olin Business School, Washington University, St Louis, MO, USA
G. Silipo
Affiliation:
Schizophrenia Research Center, Nathan Kline Institute, Orangeburg, NY, USA
D. C. Javitt
Affiliation:
Schizophrenia Research Center, Nathan Kline Institute, Orangeburg, NY, USA Department of Psychiatry, Columbia University, New York, NY, USA.
*
*Address for correspondence: J. T. Kantrowitz, M.D., Nathan Kline Institute for Psychiatric Research, 140 Old Orangeburg Road, Orangeburg, NY 10962, USA. (Email: jk3380@columbia.edu)
Rights & Permissions [Opens in a new window]

Abstract

Background

Both language and music are thought to have evolved from a musical protolanguage that communicated social information, including emotion. Individuals with perceptual music disorders (amusia) show deficits in auditory emotion recognition (AER). Although auditory perceptual deficits have been studied in schizophrenia, their relationship with musical/protolinguistic competence has not previously been assessed.

Method

Musical ability was assessed in 31 schizophrenia/schizo-affective patients and 44 healthy controls using the Montreal Battery for Evaluation of Amusia (MBEA). AER was assessed using a novel battery in which actors provided portrayals of five separate emotions. The Disorganization factor of the Positive and Negative Syndrome Scale (PANSS) was used as a proxy for language/thought disorder and the MATRICS Consensus Cognitive Battery (MCCB) was used to assess cognition.

Results

Highly significant deficits were seen between patients and controls across auditory tasks (p < 0.001). Moreover, significant differences were seen in AER between the amusia and intact music-perceiving groups, which remained significant after controlling for group status and education. Correlations with AER were specific to the melody domain, and correlations between protolanguage (melody domain) and language were independent of overall cognition.

Discussion

This is the first study to document a specific relationship between amusia, AER and thought disorder, suggesting a shared linguistic/protolinguistic impairment. Once amusia was considered, other cognitive factors were no longer significant predictors of AER, suggesting that musical ability in general and melodic discrimination ability in particular may be crucial targets for treatment development and cognitive remediation in schizophrenia.

Type
Original Articles
Copyright
Copyright © Cambridge University Press 2014 

Introduction

Music has an ability to elicit emotions and to activate emotion-related brain regions (Juslin & Laukka, Reference Juslin and Laukka2003; Escoffier et al. Reference Escoffier, Zhong, Schirmer and Qiu2013). Darwin (Reference Darwin1871) proposed that this function of music probably arises because of its ability to tap into an innate, pre-existing ‘musical protolanguage’ system of the brain that arose in the course of primate evolution. As expanded upon by later researchers (Fitch, Reference Fitch2006; Masataka, Reference Masataka2009), this protolanguage in turn permitted communication of basic social information, for example establishing social structure or infant–mother relationships, even prior to the evolution of formal linguistic taxonomies. Musical protolanguage theory further proposes that language and music developed formally and tonally first (Jespersen, Reference Jespersen1922; Wray, Reference Wray1998; Arbib, Reference Arbib2005, Reference Arbib2008), with semantics (word meaning) coming later, suggesting that deficits in perception of language form (thought form) and protolanguage (music and emotion) perception would be inter-related.

Recently, it has been demonstrated that individuals can have isolated deficits in musical ability, a condition termed ‘congenital amusia’ (or ‘tone deafness’), and defined as an impairment in melody perception and production that cannot be explained by hearing loss, brain damage, intellectual deficiencies or lack of music exposure (Peretz, Reference Peretz and Deutsch2013). Amusia is commonly measured by the Montreal Battery for Assessment of Amusia (MBEA; Peretz et al. Reference Peretz, Champod and Hyde2003), a battery that assesses processing of complex musical phrases. The MBEA assesses perceptual musical disorders and amusia, and is operationalized both narrowly, by having melodic impairments only (Thompson et al. Reference Thompson, Marin and Stewart2012; Peretz, Reference Peretz and Deutsch2013), and broadly/globally, also incorporating deficits in rhythm and working memory (Peretz et al. Reference Peretz, Champod and Hyde2003). We are aware of one previous report (Hatada et al. Reference Hatada, Sawada, Akamatsu, Doi, Minese, Yamashita, Thornton, Honer and Inoue2013) of the MBEA in schizophrenia, finding a significant deficit in Japanese-speaking patients, highlighted by a 62% rate of amusia.

Although impairments in thought form, such as derailment, tangentiality and incoherence, illogicality and circumstantiality (Rodriguez-Ferrera et al. Reference Rodriguez-Ferrera, McCarthy and McKenna2001; Sans-Sansa et al. Reference Sans-Sansa, McKenna, Canales-Rodriguez, Ortiz-Gil, Lopez-Araquistain, Sarro, Duenas, Blanch, Salvador and Pomarol-Clotet2013), have been documented in schizophrenia since at least the time of Bleuler (Reference Bleuler1950), impairments in auditory emotion recognition (AER) and broader aspects of protolanguage perception have only been appreciated more recently (Ross et al. Reference Ross, Orbelo, Cartwright, Hansel, Burgard, Testa and Buck2001). Moreover, prominent sensory contributions to these deficits have been increasingly recognized over the past few years (Leitman et al. Reference Leitman, Laukka, Juslin, Saccente, Butler and Javitt2010; Gold et al. Reference Gold, Butler, Revheim, Leitman, Hansen, Gur, Kantrowitz, Laukka, Juslin, Silipo and Javitt2012; Kantrowitz et al. Reference Kantrowitz, Leitman, Lehrfeld, Laukka, Juslin, Butler, Silipo and Javitt2013, Reference Kantrowitz, Hoptman, Leitman, Silipo and Javitt2014).

One of the best-established and most pernicious aspects of schizophrenia is an impairment of social cognition (Green & Leitman, Reference Green and Leitman2008), including the ability to detect emotion from vocal information (Leentjens et al. Reference Leentjens, Wielaert, van Harskamp and Wilmink1998; Edwards et al. Reference Edwards, Jackson and Pattison2002; Leitman et al. Reference Leitman, Foxe, Butler, Saperstein, Revheim and Javitt2005, Reference Leitman, Laukka, Juslin, Saccente, Butler and Javitt2010; Bozikas et al. Reference Bozikas, Kosmidis, Anezoulaki, Giannakou, Andreou and Karavatos2006; Hoekert et al. Reference Hoekert, Kahn, Pijnenborg and Aleman2007; Bach et al. Reference Bach, Buxtorf, Grandjean and Strik2009a ,Reference Bach, Herdener, Grandjean, Sander, Seifritz and Strik b ; Gold et al. Reference Gold, Butler, Revheim, Leitman, Hansen, Gur, Kantrowitz, Laukka, Juslin, Silipo and Javitt2012; Yang et al. Reference Yang, Chen, Chen, Khan, Forchelli and Javitt2012; Pinheiro et al. Reference Pinheiro, Del Re, Mezin, Nestor, Rauber, McCarley, Goncalves and Niznikiewicz2013). For example, it has been proposed that deficits in social cognition mediate effects of neurocognitive impairments on ability to engage in competitive employment (Combs et al. Reference Combs, Waguspack, Chapman, Basso and Penn2011; Luedtke et al. Reference Luedtke, Kukla, Renard, Dimaggio, Buck and Lysaker2012).

Over the past several years, the role of auditory dysfunction as a contributor to social cognitive impairment has become increasingly appreciated. Schizophrenia patients show impairments not only in tone matching (Strous et al. Reference Strous, Cowan, Ritter and Javitt1995) but also in generation of early auditory event-related potentials (ERPs) such as mismatch negativity (MMN) (Umbricht & Krljes, Reference Umbricht and Krljes2005; Light et al. Reference Light, Swerdlow and Braff2007; Wynn et al. Reference Wynn, Sugar, Horan, Kern and Green2010; Friedman et al. Reference Friedman, Sehatpour, Dias, Perrin and Javitt2012). Deficits in both tone matching (Gold et al. Reference Gold, Butler, Revheim, Leitman, Hansen, Gur, Kantrowitz, Laukka, Juslin, Silipo and Javitt2012) and MMN (Jahshan et al. Reference Jahshan, Wynn and Green2013) contribute strongly to impaired AER ability, consistent with sensory-based contributions to impaired AER.

Given the hypothesized relationship between the perception of complex musical phrases, language form and AER, the MBEA may also be useful for investigations of protolinguistic competence. For example, one recent study evaluated individuals with congenital amusia who were otherwise without psychiatric or neurological disorder (Thompson et al. Reference Thompson, Marin and Stewart2012). In that study, amusic individuals showed highly significant AER deficits, despite having otherwise intact neurocognitive function.

The concept of amusia, as assessed by the MBEA, goes beyond simple tone matching by using complex musical phrases that we hypothesize are better matches of complex emotional speech than simple tone matching. To assess this hypothesis, we have incorporated measures of both tone matching and general neurocognitive impairment, along with the MBEA, to evaluate relative contribution of these measures to the well-described impairments of AER in schizophrenia.

Musical protolanguage theory also predicts shared processing of music and language form, as supported by recent neuroimaging studies (Sammler et al. Reference Sammler, Koelsch, Ball, Brandt, Elger, Friederici, Grigutsch, Huppertz, Knosche, Wellmer, Widman and Schulze-Bonhage2009, Reference Sammler, Koelsch, Ball, Brandt, Grigutsch, Huppertz, Knosche, Wellmer, Widman, Elger, Friederici and Schulze-Bonhage2013; Asaridou & McQueen, Reference Asaridou and McQueen2013). We used the Disorganization (Cognition) factor (Lindenmayer et al. Reference Lindenmayer, Bernstein-Hyman and Grochowski1994) of the Positive and Negative Syndrome Scale (PANSS; Kay et al. Reference Kay, Fiszbein and Opler1987) to obtain an operationalized measure of formal language dysfunction. This factor was chosen because of its inclusion of the language-specific items conceptual disorganization (P2) and abstract thinking (N5). We predicted that correlations between the MBEA and language would be independent of control for cognitive ability.

Finally, we have previously demonstrated (Leitman et al. Reference Leitman, Laukka, Juslin, Saccente, Butler and Javitt2010) that, when identifying emotion based upon tone of voice, patients tend to misidentify not only the intended emotion but also the intended strength of characterization, which we term auditory emotion intensity recognition (AEIR). In particular, patients differentiate less between weak and strong portrayals of emotion, primarily because of overestimation of the intended strength of weakly portrayed emotions. As with impairment in emotion recognition itself, AEIR deficits correlate with impaired early auditory processing ability, although its relationship with musical processing has not been evaluated. Therefore, as with AER, the present study evaluated substrates of impaired AEIR in schizophrenia.

Method

Subjects

Subjects consisted of 31 medicated patients recruited from long-term in-patient (n = 12) and supervised residential sites (n = 19) and 44 healthy volunteers, of whom 16 performed the MBEA only. All subjects gave their signed informed consent to participate in the study, following a full description of the study procedures.

Patients met DSM-IV-TR criteria for a diagnosis of schizophrenia (n = 23) or schizo-affective disorder (n = 8), with no significant between-diagnosis differences on auditory tasks (p = 0.8). We excluded controls with a history of an Axis I psychiatric disorder, as defined by the SCID (First et al. Reference First, Spitzer, Gibbon and Williams1997). Patients and controls were excluded if they had any neurological or auditory disorders noted on their medical history or in prior records, or for alcohol or substance dependence within the past 6 months and/or abuse within the past month (First et al. Reference First, Spitzer, Gibbon and Williams1997).

In addition, a subsample of subjects completed the AER task, the PANSS (Kay et al. Reference Kay, Fiszbein and Opler1987; Lindenmayer et al. Reference Lindenmayer, Bernstein-Hyman and Grochowski1994), with the Disorganization (Cognition) factor used as a proxy for language ability. The Measurement and Treatment Research to Improve Cognition in Schizophrenia (MATRICS) Consensus Cognitive Battery (MCCB; Nuechterlein & Green, Reference Nuechterlein and Green2006) was used to assess general cognition. The Social Cognition domain was not included. Data in the text are given as means ± standard deviation (s.d.). Patients were receiving a mean antipsychotic dose of 774 ± 565 chlorpromazine equivalents (Woods, Reference Woods2003). Demographics, behavioral ratings and scores for individual tasks are presented in Table 1.

Table 1. Study demographics

SES, Socio-economic status (measured by the four-factor Hollingshead scale); MCCB, MATRICS Consensus Cognitive Battery; PANSS, Positive and Negative Syndrome Scale (Lindenmayer et al. Reference Lindenmayer, Bernstein-Hyman and Grochowski1994).

Values given as mean ± standard deviation (with number in parentheses).

a p < 0.05 on Mann–Whitney for categorical values and independent-sample t tests for continuous values.

Auditory tasks

All auditory tasks were presented on a CD player at a sound level that was comfortable for each listener in a sound-attenuated room. Subjects worked 1:1 with an experienced tester, and were encouraged to take breaks when needed.

MBEA

The MBEA is described fully in Peretz et al. (Reference Peretz, Champod and Hyde2003). In brief, it contains six subtests across the three domains: Melody (three tests: scale, contour, interval) and Rhythm (two tests: rhythm, meter) organization and Memory (one test: incidental music memory recognition). Two to four examples were provided preceding each task to ensure that the subject clearly understood the task instructions (see online Supplementary Material). These examples were repeated if necessary.

All six tests use the same pool of 30 novel musical phrases that were composed according to the rules of the Western tonal system, written with sufficient complexity to guarantee its processing as a meaningful structure rather than as a simple sequence of tones. The selections last a mean of 5.1 s in all but the metric test, in which the stimuli lasted twice as long (mean = 11 s).

Melody and rhythm tests require subjects to perform a same–different classification task in which pairs are either the same or have one note that differs in pitch or duration. The memory task is an incidental-memory recognition task in which subjects are asked whether they have previously heard the selection during the task. Amusia was operationalized using published norms (Peretz, Reference Peretz and Deutsch2013) by a < 73.3% on the scale subtest.

AER and AEIR

AER was assessed using a new battery of 248 stimuli (Laukka & Elfenbein, Reference Laukka and Elfenbein2012). To refine the task for future use, analysis focused on the subset (n = 137) of items in which both groups scored above 35% on emotional identification [1.5 s.d. above chance performance (20%)]. Unlike the previous battery, which used actors who spoke with British-accented English (Gold et al. Reference Gold, Butler, Revheim, Leitman, Hansen, Gur, Kantrowitz, Laukka, Juslin, Silipo and Javitt2012), stimuli in the new battery were spoken by native American-English speakers and were scored based upon the speaker's intended emotion (happy, sad, angry, fear or neutral) and intensity level (high, medium and low). The verbal material consisted of short emotionally neutral phrases (i.e. ‘Let me tell you something’).

AEIR was rated on a scale from 1 (very low intensity) to 10 (very high intensity) for weak, medium and strong emotional portrayals independently. AEIR discrimination was defined as the mean rating for high-intensity items minus the mean rating for low-intensity items.

Tone-matching task

Pitch processing was assessed using a simple tone-matching task (Leitman et al. Reference Leitman, Laukka, Juslin, Saccente, Butler and Javitt2010). This task consists of pairs of 100-ms tones in series, with 500-ms intertone intervals. Within each pair, tones are either identical or differed in frequency (Hz) by specified amounts in each block (2.5, 5, 10, 20 or 50%). In each block, 12 of the tones are identical and 14 are dissimilar. Tones are derived from three reference frequencies (500, 1000 and 2000 Hz) to avoid learning effects. In all, the test consisted of five blocks of 26 pairs of tones.

Statistical analyses

Demographics for the two groups were compared by a Mann–Whitney test or Fisher's exact test for categorical values and by independent-sample t tests for continuous values. Between-group effects were assessed using a multivariate analysis of variance (MANOVA) or a repeated-measure ANOVA (rmANOVA), with stimulus type as a within-subject factor and diagnostic group as a between-subject factor, and follow-up univariate ANOVAs or independent-sample t tests were conducted.

Relationships among measures were determined by Pearson correlations and multivariate linear regression. Within the linear regression, partial correlations were used to assess significance of association. Measures of sensitivity (d′) and bias (criterion c) (Stanislaw & Todorov, Reference Stanislaw and Todorov1999) were calculated for same–different trials of the MBEA. Two-tailed statistics were used throughout with preset α level of significance of p < 0.05.

Results

Between-group analysis

MBEA

Patients showed a highly significant reduction in MBEA total score relative to all controls (F 1,77 = 49.5, p < 0.001), which remained highly significant after controlling for years of education, gender and age (F 1,70 = 12.6, p = 0.001). Equivalent deficits were seen across the individual melody, rhythm and memory domains (Fig. 1 a), corresponding to a large effect-size impairment (d = 1.30).

Fig. 1. (a) Bar graph (mean ± s.e.m.) of overall percentage correct scores for Montreal Battery for Evaluation of Amusia (MBEA) domains and auditory emotion recognition (AER). (b) Scatter plot of MBEA versus AER. Correlations remained significant after controlling for group status (partial r = 0.37, p = 0.004). *** p < 0.001 on independent-samples t test.

Given this large difference, patterns of impairment were further analyzed by comparing accuracy on ‘same’ and ‘different’ trials. The groups had relatively high rates of correct response on ‘same’ trials (controls: 83.9 ± 11.1%; patients: 80.3 ± 15.0%) but significantly lower accuracy in ‘different’ trials (controls: 80.4 ± 13.5%; patients: 58.3 ± 20.6), yielding a highly significant group × error type interaction (F 1,72 = 34.8, p < 0.001). This pattern suggests that, like controls, patients adopted a strategy of saying ‘same’ unless they accumulated evidence of a difference. This was confirmed by signal detection analyses showing highly significant between-group differences in sensitivity (controls: d′ = 2.9 ± 0.7; patients: d′ = 1.9 ± 0.7; t 72 = 5.3, p < 0.0001) and bias (t 72 = 2.9, p = 0.004), also yielding a highly significant group × signal interaction (F 1,72 = 29.8, p < 0.001), suggesting that patients were actively engaged in the task but mainly had reduced sensitivity to between-stimulus differences in the dimensions tested by the MBEA.

MBEA and AER

As in previous studies, patients showed highly significant impairment in detecting intended emotion (F 1,60 = 27.1, p < 0.001; Fig. 1 a), with no significant group × emotion interaction (F 4,53 = 0.7, p = 0.58). Patients also showed significant deficits in neurocognitive function, as reflected in the MCCB composite score (F 1,58 = 56.6, p < 0.001), and in early auditory function, as reflected in tone-matching (F 1,60 = 18.9, p < 0.001) (Table 1).

As predicted, a highly significant correlation was observed between AER and MBEA total scores (r = 0.56, n = 58, p < 0.001; Fig. 1 b). Significant independent correlations were also observed between the melody (r = 0.55, n = 58, p < 0.001), rhythm (r = 0.48, n = 58, p < 0.001) and memory (r = 0.44, n = 58, p = 0.001) domains and AER.

Correlations with total scores (partial r = 0.37, p = 0.004) and melody (partial r = 0.37, p = 0.005), but not rhythm (partial r = 0.06, p = 0.67), remained significant after control for group status. Similarly, correlations remained significant after covariation for MCCB (total–partial r = 0.31, p = 0.023; melody–partial r = 0.32, p = 0.019) or tone-matching ability (total–partial r = 0.32, p = 0.018; melody–partial r = 0.31, p = 0.022), suggesting that correlations were also independent of general cognitive and sensory processing deficits. Moreover, when group status, total MBEA and MCCB composite scores were entered simultaneously into a regression versus AER, a significant correlation was observed with MBEA (partial r = 0.28, p = 0.046) but not MCCB (partial r = 0.20, p = 0.16).

AEIR

Because the battery included weak, medium and strong portrayals of emotion, it was also possible to evaluate the degree to which patients experienced and differentiated emotion intensity. Patients' overall ratings for stimulus intensity were similar to those of controls across all levels (F 1,56 = 1.9, p = 0.17). Nevertheless, patients differentiated between weak and strong portrayals to a significantly smaller degree than controls (Fig. 2 a), leading to a highly significant group × portrayal strength interaction (F 2,55 = 15.8, p < 0.0001).

Fig. 2. Bar graph (mean ± s.e.m.) of (a) auditory emotion intensity recognition (AEIR) ratings for high and low intended intensity items and (b) AEIR discrimination (mean rating for high-intensity items minus mean rating for low-intensity items). A highly significant group × portrayal strength interaction (F 2,63 = 19.8, p < 0.0001) and a between-group difference in discrimination (t64 = 4.7, p < 0.001) were seen. (c) Scatter plot of Montreal Battery for Evaluation of Amusia (MBEA) versus AEIR discrimination. Black shading indicates amusia. Correlations remained significant after controlling for group status (partial r = 0.30, p = 0.023) *** p < 0.001 on independent-samples t test.

AEIR discrimination scores, defined as the difference between ratings for strong and weak portrayal by subject, differed significantly between groups (Fig. 2 b; F 1,56 = 17.7, p < 0.001). This difference remained significant when controlling for correct response (F 2,55 = 10.9, p < 0.001), further demonstrating reduced ability to discriminate intended emotional strength in patients.

As with AER, impaired AEIR discrimination correlated significantly with MBEA total score (r = 0.49, n = 58, p < 0.001; Fig. 2 c). The correlation also remained significant after control for group status (partial r = 0.31, p = 0.023). However, unlike AER, AEIR correlated primarily with scores on the rhythm domain of the MBEA (partial r = 0.46, p < 0.001) rather than on the melody domain (partial r = 0.12, p = 0.39) in a stepwise regression. Correlations with the rhythm domain remained significant after controlling for group status (partial r = 0.28, p = 0.037).

Protolanguage

To assess the hypothesis that language form and protolanguage were related over and above general cognition, we performed a stepwise, multivariate regression with the PANSS disorganization factor as the dependent variable, MCCB entered in the first step and MBEA domains in the second step. The melody subscale (partial r = −0.38, p = 0.044) correlated significantly with the disorganization factor, with poor melody recognition scores predicting increased disorganization. By contrast, MCCB scores (overall cognition) correlated only at trend level (partial r = −0.34, p = 0.073), as did musical memory (p = 0.09).

To further test the relationship between language form and protolanguage, we extracted the language-specific items of the PANSS disorganization factor (P2 conceptual disorganization; N5 abstract thinking) to create more a specific language/thought disorder PANSS subscale. When the analysis was repeated using this ‘language’ subscale, melody remained a significant predictor (partial r = −0.43, p = 0.025) whereas the MCCB (partial r = −0.16, p = 0.41) was not predictive (Fig. 3 a). By contrast, neither positive nor negative symptoms correlated significantly with any of the MBEA domains. Correlations between the PANSS language subscale and the melody domain remained significant even using non-parametric (Spearman) correlations (r s = −0.45, n = 29, p = 0.014), suggesting that the relationship was not due to outliers.

Fig. 3. Scatter plot of (a) Melody domain versus exploratory Positive and Negative Syndrome Scale (PANSS) language subscale (P2 conceptual disorganization and N5 abstract thinking). (b) Bar graph (mean ± s.e.m.) for auditory emotion recognition (AER) divided by group and amusia classification. *** p < 0.0001, which remained significant after controlling for group status and the MATRICS Consensus Cognitive Battery (MCCB).

Groups by amusia

Based on published cut-offs (Peretz, Reference Peretz and Deutsch2013), 45% of patients and 9% of controls met criteria for amusia. As predicted, even after controlling for education, significant main effects of AER ability were seen for both amusia (F 1,49 = 7.2, p = 0.01) and group (F 1,49 = 7.4, p = 0.009; Fig. 2 b).

Memory domain

Although patients showed reduced musical memory ability versus controls (Fig. 1 a), correlations with AER (partial r = 0.11, p = 0.43) did not remain significant after controlling for group status and the melody domain. Similarly, correlations with AEIR (partial r = 0.35, p = 0.057) did not remain significant after controlling for group status and the rhythm domain. By contrast, impaired memory on the MBEA correlated significantly with overall cognitive ability across groups (r = 0.51, p < 0.001). The correlation, moreover, remained highly significant even after controlling for group status in a stepwise regression (partial r = 0.38, p = 0.003).

Discussion

Although Bleuler (Reference Bleuler1950) considered sensory processing as an ‘intact simple function’, increasing evidence suggests that sensory processing in schizophrenia is neither simple nor intact. The present study used a test battery, the MBEA, developed specifically for the assessment of musical perceptual deficits, and therefore also useful to probe the underlying musical protolanguage that is known to convey social emotion during human interaction.

As predicted, patients showed deficits across MBEA domains, and these deficits correlated significantly with impaired AER ability. In addition, patients showed significant difference in the ability to discriminate between intended strength of emotion portrayal, with a tendency to overestimate intended strength of weak emotions. Finally, as predicted, patients showed significant correlation between impairments in melodic processing and impairments in clinician-rated spontaneous language generation, as assessed using the PANSS. These correlations remained significant even after controlling for contributions of general cognitive ability using the MCCB, suggesting a significant association between prosodic and structural aspects of language generation in schizophrenia.

Impaired AER function in schizophrenia patients was most highly correlated with impairments in the processing of melody, similar to recent results in individuals with amusia (Thompson et al. Reference Thompson, Marin and Stewart2012). Furthermore, consistent with these recent findings, we found specific contributions for both ‘amusia’ and patient status to impairments in AER function (Fig. 3 b), even after controlling for education. Nevertheless, a far higher percentage of schizophrenia patients than controls met criteria for amusia, and amusic controls were otherwise neurocognitively normal, suggesting that controls may be more able than schizophrenia patients to compensate for primary deficits in musical ability through the general use of cognitive, problem-solving abilities.

AEIR

Unlike AER correlations with melody impairment, impaired AEIR correlated more specifically with deficits in rhythm. Of note, others have also recently observed that aspects of rhythm are associated with the arousal level of musically expressed emotions (Laukka et al. Reference Laukka, Eerola, Thingujam, Yamasaki and Beller2013). As we have found previously (Leitman et al. Reference Leitman, Laukka, Juslin, Saccente, Butler and Javitt2010), patients tended to overestimate the intensity of the intended weak emotions (Fig. 2 b), leading to reduced differentiation of weak and strong portrayals. The majority of emotion batteries do not incorporate items with different intended strength of portrayal, so that the literature regarding substrates of emotion intensity discrimination has been less studied than those involving AER itself.

One of the primary ways in which individuals communicate calmness in social situations is by speaking softly and slowly, leading others to understand that a low level of emotionality is being communicated. The correlation between MBEA rhythm domain scores and AEIR discrimination suggests that, in schizophrenia patients, and also in other amusic individuals, such communication may be misperceived, leading to patients feeling that others are speaking with more intensity than is truly intended.

Protolanguage

We also assessed whether deficits in music discrimination would be correlated with other aspects of language processing. In general, thought disorder in schizophrenia (Barrera et al. Reference Barrera, McKenna and Berrios2005, Reference Barrera, McKenna and Berrios2009) is considered to reflect dysfunction within both frontal and temporal brain regions. Recently, however, imaging studies have suggested that fluent disorganization in patients is most related to left temporal lobe deficits involving Broca and Wernicke's regions, whereas poverty in content of speech is more related to frontal lobe changes and dysfunction (Sans-Sansa et al. Reference Sans-Sansa, McKenna, Canales-Rodriguez, Ortiz-Gil, Lopez-Araquistain, Sarro, Duenas, Blanch, Salvador and Pomarol-Clotet2013). Melody and rhythm, by contrast, are thought to be processed in the Wernicke homologue region of the right temporal cortex (Albouy et al. Reference Albouy, Mattout, Bouet, Maby, Sanchez, Aguera, Daligault, Delpuech, Bertrand, Caclin and Tillmann2013). The specific correlation we observed between fluent thought disorder and music perception (i.e. the PANSS language and melodic processing) is thus consistent with correlated bi-temporal involvement in schizophrenia. Given the exploratory nature of the extracted language subscale, this correlation should be verified using a validated scale of language form.

Prevalence of deficit

The rate of amusia is commonly cited as around 4% (Cuddy et al. Reference Cuddy, Balkwill, Peretz and Holden2005; Sloboda et al. Reference Sloboda, Wise and Peretz2005), although absolute rates of amusia differ based upon cut-off. For example, the online MBEA database (www.brams.umontreal.ca/plab/publications/article/57) reports an amusia rate of 5.2% using the same criteria we used in this report. Others have argued for d′-based scoring (Henry & McAuley, Reference Henry and McAuley2010). Regardless of definition used, however, schizophrenia patients showed significantly higher amusia rates than the matched controls, similar to previously published cohorts (Hatada et al. Reference Hatada, Sawada, Akamatsu, Doi, Minese, Yamashita, Thornton, Honer and Inoue2013).

Treatment implications

Music therapy was widely used in the 1950s and 1960s for treatment of schizophrenia (Skelly & Haslerud, Reference Skelly and Haslerud1952; Gillis et al. Reference Gillis, Lascelles and Crone1958; Weintraub, Reference Weintraub1961), but was largely abandoned with budget cuts and increased reliance on medication during the 1970s to 1980s. Despite the current disuse of music therapy in most clinical settings, more recent clinical trials (Bloch et al. Reference Bloch, Reshef, Vadas, Haliba, Ziv, Kremer and Haimov2010; Mossler et al. Reference Mossler, Chen, Heldal and Gold2011) continue to show benefit despite a lack of understanding of the mediating effects. Our results suggest that development of schizophrenia is associated with loss of musical ability in general, rather than just loss of specific sensory functions such as tone-matching ability or MMN generation, and that these deficits may contribute not only to AER impairments but also to disturbances in thought and communication.

Given the importance of musical function to perception of emotion, and of emotion perception to social outcome (Kee et al. Reference Kee, Green, Mintz and Brekke2003; Leitman et al. Reference Leitman, Foxe, Butler, Saperstein, Revheim and Javitt2005, Reference Leitman, Hoptman, Foxe, Saccente, Wylie, Nierenberg, Jalbrzikowski, Lim and Javitt2007; Grant & Beck, Reference Grant and Beck2010), our findings provide a potential mechanistic basis for reported efficacy of musical therapy approaches in schizophrenia. The present study suggests that impaired musical ability, over and above impairments in general cognitive ability, underlie impaired social interactional abilities in schizophrenia and thus may represent a crucial target for music-based remediation approaches (Herholz & Zatorre, Reference Herholz and Zatorre2012). Furthermore, the MBEA may be useful in schizophrenia both for identifying subjects who would be likely to benefit from intervention and as a way to monitor response. Compared with medication, music therapy is associated with limited side-effects, and potentially weight loss, and so should be reconsidered as a potential tool for remediation of AER/AEIR and social cognitive impairments in schizophrenia.

In conclusion, this study demonstrates the importance of protolanguage, as measured by music perception, along with language deficits in communicatory and social cognitive disturbances in both schizophrenia and congenital amusia. Furthermore, our results argue for greater screening of patients for amusia and, potentially, for the revival of the study of music therapy in schizophrenia, particularly the study of neuroscientifically music-based remediation approaches (Herholz & Zatorre, Reference Herholz and Zatorre2012) that have been shown to improve brain plasticity, including cognition, in older adults and mobility after stroke. Protolanguage dysfunction, including impairments in melody and rhythm perception, remains strongly significant even following covariation for general cognitive ability, and should therefore be considered a separate domain of neurocognitive impairment that is not captured within present cognitive assessment approaches for schizophrenia.

Supplementary material

For supplementary material accompanying this paper, please visit http://dx.doi.org/10.1017/S0033291714000373.

Acknowledgments

Preparation of this manuscript was supported in part by the National Center for Research Resources and the National Center for Advancing Translational Sciences, National Institutes of Health, through grant number UL1 RR024156 and the Dr Joseph E. and Lillian Pisetsky Young Investigator Award for Clinical Research in Serious Mental Illness to J.T.K., and R01 DA03383, P50 MH086385 and R37 MH49334 to D.C.J. We thank N. Gaine for administrative support.

Declaration of Interest

None.

References

Albouy, P, Mattout, J, Bouet, R, Maby, E, Sanchez, G, Aguera, PE, Daligault, S, Delpuech, C, Bertrand, O, Caclin, A, Tillmann, B (2013). Impaired pitch perception and memory in congenital amusia: the deficit starts in the auditory cortex. Brain 136, 16391661.Google Scholar
Arbib, MA (2005). From monkey-like action recognition to human language: an evolutionary framework for neurolinguistics. Behavioral and Brain Sciences 28, 105124; discussion 125–167.Google Scholar
Arbib, MA (2008). From grasp to language: embodied concepts and the challenge of abstraction. Journal of Physiology, Paris 102, 420.Google Scholar
Asaridou, SS, McQueen, JM (2013). Speech and music shape the listening brain: evidence for shared domain-general mechanisms. Frontiers in Psychology 4, 321.Google Scholar
Bach, DR, Buxtorf, K, Grandjean, D, Strik, WK (2009 a). The influence of emotion clarity on emotional prosody identification in paranoid schizophrenia. Psychological Medicine 39, 927938.CrossRefGoogle ScholarPubMed
Bach, DR, Herdener, M, Grandjean, D, Sander, D, Seifritz, E, Strik, WK (2009 b). Altered lateralisation of emotional prosody processing in schizophrenia. Schizophrenia Research 110, 180187.Google Scholar
Barrera, A, McKenna, PJ, Berrios, GE (2005). Formal thought disorder in schizophrenia: an executive or a semantic deficit? Psychological Medicine 35, 121132.CrossRefGoogle ScholarPubMed
Barrera, A, McKenna, PJ, Berrios, GE (2009). Formal thought disorder, neuropsychology and insight in schizophrenia. Psychopathology 42, 264269.Google Scholar
Bleuler, E (1950). Dementia Praecox or the Group of Schizophrenias. International Universities Press: New York, NY.Google Scholar
Bloch, B, Reshef, A, Vadas, L, Haliba, Y, Ziv, N, Kremer, I, Haimov, I (2010). The effects of music relaxation on sleep quality and emotional measures in people living with schizophrenia. Journal of Music Therapy 47, 2752.Google Scholar
Bozikas, VP, Kosmidis, MH, Anezoulaki, D, Giannakou, M, Andreou, C, Karavatos, A (2006). Impaired perception of affective prosody in schizophrenia. Journal of Neuropsychiatry and Clinical Neurosciences 18, 8185.Google Scholar
Combs, DR, Waguspack, J, Chapman, D, Basso, MR, Penn, DL (2011). An examination of social cognition, neurocognition, and symptoms as predictors of social functioning in schizophrenia. Schizophrenia Research 128, 177178.Google Scholar
Cuddy, LL, Balkwill, LL, Peretz, I, Holden, RR (2005). Musical difficulties are rare: a study of ‘tone deafness’ among university students. Annals of the New York Academy of Sciences 1060, 311324.Google Scholar
Darwin, C (1871). The Descent of Man, and Selection in Relation to Sex, 1st edn. John Murray: London.Google Scholar
Edwards, J, Jackson, HJ, Pattison, PE (2002). Emotion recognition via facial expression and affective prosody in schizophrenia: a methodological review. Clinical Psychology Review 22, 789832.CrossRefGoogle ScholarPubMed
Escoffier, N, Zhong, J, Schirmer, A, Qiu, A (2013). Emotional expressions in voice and music: same code, same effect? Human Brain Mapping 34, 17961810.CrossRefGoogle ScholarPubMed
First, MB, Spitzer, RL, Gibbon, M, Williams, J (1997). Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I). Biometrics Research Department , New York State Psychiatric Institute: New York.Google Scholar
Fitch, WT (2006). The biology and evolution of music: a comparative perspective. Cognition 100, 173215.Google Scholar
Friedman, T, Sehatpour, P, Dias, E, Perrin, M, Javitt, DC (2012). Differential relationships of mismatch negativity and visual P1 deficits to premorbid characteristics and functional outcome in schizophrenia. Biological Psychiatry 71, 521–509.CrossRefGoogle ScholarPubMed
Gillis, A, Lascelles, CF, Crone, N (1958). A comparison of rhythmic and non-rhythmic music in chronic schizophrenia. American Journal of Psychiatry 114, 11111112.Google Scholar
Gold, R, Butler, PD, Revheim, N, Leitman, DI, Hansen, JA, Gur, RC, Kantrowitz, JT, Laukka, P, Juslin, PN, Silipo, GS, Javitt, DC (2012). Auditory emotion recognition impairments in schizophrenia: relationship to acoustic features and cognition. American Journal of Psychiatry 169, 424432.Google Scholar
Grant, PM, Beck, AT (2010). Asocial beliefs as predictors of asocial behavior in schizophrenia. Psychiatry Research 177, 6570.Google Scholar
Green, M, Leitman, D (2008). Social cognition in schizophrenia. Schizophrenia Bulletin 34, 670672.Google Scholar
Hatada, S, Sawada, K, Akamatsu, M, Doi, E, Minese, M, Yamashita, M, Thornton, AE, Honer, WG, Inoue, S (2013). Impaired musical ability in people with schizophrenia. Journal of Psychiatry and Neuroscience 38, 120207.Google Scholar
Henry, MJ, McAuley, JD (2010). On the prevalance of congenital amusia. Music Perception 27, 413418.Google Scholar
Herholz, SC, Zatorre, RJ (2012). Musical training as a framework for brain plasticity: behavior, function, and structure. Neuron 76, 486502.Google Scholar
Hoekert, M, Kahn, RS, Pijnenborg, M, Aleman, A (2007). Impaired recognition and expression of emotional prosody in schizophrenia: review and meta-analysis. Schizophrenia Research 96, 135145.Google Scholar
Jahshan, C, Wynn, JK, Green, MF (2013). Relationship between auditory processing and affective prosody in schizophrenia. Schizophrenia Research 143, 348353.Google Scholar
Jespersen, O (1922). Language: Its Nature, Development and Origin. W. W. Norton & Co.: New York.Google Scholar
Juslin, P, Laukka, P (2003). Communication of emotions in vocal expression and music performance: different channels, same code? Psychological Bulletin 129, 770814.Google Scholar
Kantrowitz, JT, Hoptman, MJ, Leitman, DI, Silipo, G, Javitt, DC (2014). The 5% difference: early sensory processing predicts sarcasm perception in schizophrenia and schizo-affective disorder. Psychological Medicine 44, 2536.Google Scholar
Kantrowitz, JT, Leitman, DI, Lehrfeld, JM, Laukka, P, Juslin, PN, Butler, PD, Silipo, G, Javitt, DC (2013). Reduction in tonal discriminations predicts receptive emotion processing deficits in schizophrenia and schizoaffective disorder. Schizophrenia Bulletin 39, 8693.Google Scholar
Kay, SR, Fiszbein, A, Opler, LA (1987). The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophrenia Bulletin 13, 261276.Google Scholar
Kee, KS, Green, MF, Mintz, J, Brekke, JC (2003). Is emotion processing a predictor of functional outcome in schizophrenia? Schizophrenia Bulletin 29, 487497.Google Scholar
Laukka, P, Eerola, T, Thingujam, NS, Yamasaki, T, Beller, G (2013). Universal and culture-specific factors in the recognition and performance of musical affect expressions. Emotion 13, 434449.Google Scholar
Laukka, P, Elfenbein, HA (2012). Emotion appraisals can be inferred from vocal expressions. Social Psychological and Personality Science 3, 529536.Google Scholar
Leentjens, AF, Wielaert, SM, van Harskamp, F, Wilmink, FW (1998). Disturbances of affective prosody in patients with schizophrenia: a cross sectional study. Journal of Neurology, Neurosurgery, and Psychiatry 64, 375378.Google Scholar
Leitman, DI, Foxe, JJ, Butler, PD, Saperstein, A, Revheim, N, Javitt, DC (2005). Sensory contributions to impaired prosodic processing in schizophrenia. Biological Psychiatry 58, 5661.Google Scholar
Leitman, DI, Hoptman, MJ, Foxe, JJ, Saccente, E, Wylie, GR, Nierenberg, J, Jalbrzikowski, M, Lim, KO, Javitt, DC (2007). The neural substrates of impaired prosodic detection in schizophrenia and its sensorial antecedents. American Journal of Psychiatry 164, 474482.Google Scholar
Leitman, DI, Laukka, P, Juslin, PN, Saccente, E, Butler, P, Javitt, DC (2010). Getting the cue: sensory contributions to auditory emotion recognition impairments in schizophrenia. Schizophrenia Bulletin 36, 545556.Google Scholar
Light, G, Swerdlow, N, Braff, D (2007). Preattentive sensory processing as indexed by the MMN and P3a brain responses is associated with cognitive and psychosocial functioning in healthy adults. Journal of Cognitive Neuroscience 19, 16241632.Google Scholar
Lindenmayer, JP, Bernstein-Hyman, R, Grochowski, S (1994). Five-factor model of schizophrenia: initial validation. Journal of Nervous and Mental Disease 182, 631638.Google Scholar
Luedtke, BL, Kukla, M, Renard, S, Dimaggio, G, Buck, KD, Lysaker, PH (2012). Metacognitive functioning and social cognition as predictors of accuracy of self-appraisals of vocational function in schizophrenia. Schizophrenia Research 137, 260261.Google Scholar
Masataka, N (2009). The origins of language and the evolution of music: a comparative perspective. Physics of Life Reviews 6, 1122.Google Scholar
Mossler, K, Chen, X, Heldal, TO, Gold, C (2011). Music therapy for people with schizophrenia and schizophrenia-like disorders. Cochrane Database of Systematic Reviews. Issue 12, Art. No. CD004025.Google Scholar
Nuechterlein, KH, Green, MF (2006). MATRICS Consensus Cognitive Battery. MATRICS Assessment, Inc.: Los Angeles, CA.Google Scholar
Peretz, I (2013). The biological foundations of music: insights from congenital amusia. In The Psychology of Music, 3rd edn (ed. Deutsch, D.), pp. 551564. Academic Press: London, UK.Google Scholar
Peretz, I, Champod, AS, Hyde, K (2003). Varieties of musical disorders. The Montreal Battery of Evaluation of Amusia. Annals of the New York Academy of Sciences 999, 5875.Google Scholar
Pinheiro, AP, Del Re, E, Mezin, J, Nestor, PG, Rauber, A, McCarley, RW, Goncalves, OF, Niznikiewicz, MA (2013). Sensory-based and higher-order operations contribute to abnormal emotional prosody processing in schizophrenia: an electrophysiological investigation. Psychological Medicine 43, 603618.Google Scholar
Rodriguez-Ferrera, S, McCarthy, RA, McKenna, PJ (2001). Language in schizophrenia and its relationship to formal thought disorder. Psychological Medicine 31, 197205.Google Scholar
Ross, ED, Orbelo, DM, Cartwright, J, Hansel, S, Burgard, M, Testa, JA, Buck, R (2001). Affective-prosodic deficits in schizophrenia: comparison to patients with brain damage and relation to schizophrenic symptoms [corrected]. Journal of Neurology, Neurosurgery, and Psychiatry 70, 597604.Google Scholar
Sammler, D, Koelsch, S, Ball, T, Brandt, A, Elger, CE, Friederici, AD, Grigutsch, M, Huppertz, HJ, Knosche, TR, Wellmer, J, Widman, G, Schulze-Bonhage, A (2009). Overlap of musical and linguistic syntax processing: intracranial ERP evidence. Annals of the New York Academy of Sciences 1169, 494498.Google Scholar
Sammler, D, Koelsch, S, Ball, T, Brandt, A, Grigutsch, M, Huppertz, HJ, Knosche, TR, Wellmer, J, Widman, G, Elger, CE, Friederici, AD, Schulze-Bonhage, A (2013). Co-localizing linguistic and musical syntax with intracranial EEG. NeuroImage 64, 134146.CrossRefGoogle ScholarPubMed
Sans-Sansa, B, McKenna, PJ, Canales-Rodriguez, EJ, Ortiz-Gil, J, Lopez-Araquistain, L, Sarro, S, Duenas, RM, Blanch, J, Salvador, R, Pomarol-Clotet, E (2013). Association of formal thought disorder in schizophrenia with structural brain abnormalities in language-related cortical regions. Schizophrenia Research 146, 308313.Google Scholar
Skelly, CG, Haslerud, GM (1952). Music and the general activity of apathetic schizophrenics. Journal of Abnormal Psychology 47, 188192.Google Scholar
Sloboda, JA, Wise, KJ, Peretz, I (2005). Quantifying tone deafness in the general population. Annals of the New York Academy of Sciences 1060, 255261.Google Scholar
Stanislaw, H, Todorov, N (1999). Calculation of signal detection theory measures. Behavior Research Methods, Instruments, and Computers 31, 137149.Google Scholar
Strous, RD, Cowan, N, Ritter, W, Javitt, DC (1995). Auditory sensory (‘echoic’) memory dysfunction in schizophrenia. American Journal of Psychiatry 152, 15171519.Google Scholar
Thompson, WF, Marin, MM, Stewart, L (2012). Reduced sensitivity to emotional prosody in congenital amusia rekindles the musical protolanguage hypothesis. Proceedings of the National Academy of Sciences USA 109, 1902719032.Google Scholar
Umbricht, D, Krljes, S (2005). Mismatch negativity in schizophrenia: a meta-analysis. Schizophrenia Research 76, 123.Google Scholar
Weintraub, IG (1961). Emotional responses of schizophrenics to selected musical compositions. Delaware Medical Journal 33, 186187.Google Scholar
Woods, SW (2003). Chlorpromazine equivalent doses for the newer atypical antipsychotics. Journal of Clinical Psychiatry 64, 663667.Google Scholar
Wray, A (1998). Protolanguage as a holistic system for social interaction. Language and Communication 18, 4767.Google Scholar
Wynn, JK, Sugar, C, Horan, WP, Kern, R, Green, MF (2010). Mismatch negativity, social cognition, and functioning in schizophrenia patients. Biological Psychiatry 67, 940947.Google Scholar
Yang, L, Chen, S, Chen, CM, Khan, F, Forchelli, G, Javitt, DC (2012). Schizophrenia, culture and neuropsychology: sensory deficits, language impairments and social functioning in Chinese-speaking schizophrenia patients. Psychological Medicine 42, 14851494.Google Scholar
Figure 0

Table 1. Study demographics

Figure 1

Fig. 1. (a) Bar graph (mean ± s.e.m.) of overall percentage correct scores for Montreal Battery for Evaluation of Amusia (MBEA) domains and auditory emotion recognition (AER). (b) Scatter plot of MBEA versus AER. Correlations remained significant after controlling for group status (partial r = 0.37, p = 0.004). *** p < 0.001 on independent-samples t test.

Figure 2

Fig. 2. Bar graph (mean ± s.e.m.) of (a) auditory emotion intensity recognition (AEIR) ratings for high and low intended intensity items and (b) AEIR discrimination (mean rating for high-intensity items minus mean rating for low-intensity items). A highly significant group × portrayal strength interaction (F2,63 = 19.8, p < 0.0001) and a between-group difference in discrimination (t64 = 4.7, p < 0.001) were seen. (c) Scatter plot of Montreal Battery for Evaluation of Amusia (MBEA) versus AEIR discrimination. Black shading indicates amusia. Correlations remained significant after controlling for group status (partial r = 0.30, p = 0.023) *** p < 0.001 on independent-samples t test.

Figure 3

Fig. 3. Scatter plot of (a) Melody domain versus exploratory Positive and Negative Syndrome Scale (PANSS) language subscale (P2 conceptual disorganization and N5 abstract thinking). (b) Bar graph (mean ± s.e.m.) for auditory emotion recognition (AER) divided by group and amusia classification. *** p < 0.0001, which remained significant after controlling for group status and the MATRICS Consensus Cognitive Battery (MCCB).

Supplementary material: File

Kantrowitz supplementary material

Kantrowitz supplementary material

Download Kantrowitz supplementary material(File)
File 25.1 KB