The evolutionary benefit of less-credible affective musical signals for emotion induction during storytelling

Caitlyn Trevor; Sascha Frühholz

doi:10.1017/S0140525X20001004

The evolutionary benefit of less-credible affective musical signals for emotion induction during storytelling

Published online by Cambridge University Press: 30 September 2021

Caitlyn Trevor

and

Sascha Frühholz

Show author details

Caitlyn Trevor: Affiliation:
Department of Psychology, University of Zurich, Cognitive and Affective Neuroscience Unit, Binzmuehlestrasse 14, 8050Zurich, Switzerlandcaitlyn.trevor@psychologie.uzh.ch
Sascha Frühholz: Affiliation:
Department of Psychology, University of Zurich, Cognitive and Affective Neuroscience Unit, Binzmuehlestrasse 14, 8050Zurich, Switzerlandcaitlyn.trevor@psychologie.uzh.ch Department of Psychology, University of Oslo, Blindern, 0317Oslo, Norwaysascha.fruehholz@uzh.chhttps://www.psychology.uzh.ch/en/areas/nec/kaneuro.html

Article contents

Abstract
References

Rights & Permissions

Abstract

The credible signaling theory underexplains the evolutionary added value of less-credible affective musical signals compared to vocal signals. The theory might be extended to account for the motivation for, and consequences of, culturally decontextualizing a biologically contextualized signal. Musical signals are twofold, communicating “emotional fiction” alongside biological meaning, and could have filled an adaptive need for affect induction during storytelling.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 44 , 2021 , e118

DOI: https://doi.org/10.1017/S0140525X20001004 [Opens in a new window]
Copyright: Copyright © The Author(s), 2021. Published by Cambridge University Press

Although we generally agree with the credible signaling hypothesis and provide evidence for credible signaling in contemporary music, an important issue remains unaddressed by the theory. Recent research suggests that contemporary credible musical signals are less emotionally impactful than their vocal counterparts. Musical signals take far more time and energy to manufacture compared to vocal ones. The theory falls short of explaining the evolutionary added value of these more taxing and less affective musical signals. The credibility hypothesis should be extended to account for this counterintuitive observation by including a component regarding the motivation for, and consequences of, culturally decontextualizing a biologically contextualized signal (Frühholz, Trost, & Kotz, Reference Frühholz, Trost and Kotz2016). Specifically, we hypothesize that these affectively weaker musical signals communicate “emotional fiction” alongside their biological meanings and may have been motivated by the adaptive need for emotionally impactful storytelling.

Although we agree with the authors' claim that today's actual domain of music is far removed from its proper domain, recent findings of credible signals in contemporary music show that some ancient, vocal-inspired signals have resiliently persisted throughout the diverse cultural metamorphoses that music has undergone over centuries across the world. For example, one contemporary credible signal feature in music to convey affective meaning is roughness, a harsh, buzzing, raspy sound quality (Vassilakis & Kendall, Reference Vassilakis and Kendall2010). Roughness has a long evolutionary trajectory in human and animal alarm calls (Arnal, Flinker, Kleinschmidt, Giraud, & Poeppel, Reference Arnal, Flinker, Kleinschmidt, Giraud and Poeppel2015; Engelberg & Gouzoules, Reference Engelberg and Gouzoules2019; Schwartz, Engelberg, & Gouzoules, Reference Schwartz, Engelberg and Gouzoules2019) and has been found to be present in terrifying excerpts from horror film music (Trevor, Arnal, & Frühholz, Reference Trevor, Arnal and Frühholz2020). Another contemporary credible signal in music is the sigh, a vocal signal generated by both humans and animals that typically expresses sadness or frustration (Li & Yackle, Reference Li and Yackle2017; Teigen, Reference Teigen2008). In music, sighs are mimicked by a falling narrow melodic motion with a decreasing loudness, a standard device in Western classical music used to signal grief to the listener (Monelle, Reference Monelle2000). Music has also been found to imitate the staccato acoustic profile of laughter, a credible signal found in both humans and many animal species (Bryant, Reference Bryant, Floyd and Weber2020), when communicating humor (Trevor & Huron, Reference Trevor and Huron2018). These instances of credible signals in contemporary music are indicative of the continued presence of biologically rooted credible signals in music today, extending the reach of Mehr and colleagues' theory to present day music.

Although such mimicry of vocal signals exists as predicted by the credible signaling theory, many cross-comparisons between music and voices have shown that affective meaning is signaled and perceived more poorly in music than in voices (Frühholz, Trost, & Grandjean, Reference Frühholz, Trost and Grandjean2014; Juslin & Laukka, Reference Juslin and Laukka2003; Paquette, Takerkart, Saget, Peretz, & Belin, Reference Paquette, Takerkart, Saget, Peretz and Belin2018; Scherer, Reference Scherer1995). For example, Paquette et al. (Reference Paquette, Takerkart, Saget, Peretz and Belin2018) report overall lower recognition accuracies for fearful, sad, happy, and neutral emotions expressed in music compared to voices. Furthermore, one of our recent studies showed that vocal screams are perceived as significantly more intense and emotionally negative than horror film music excerpts that mimic human screams even though both use the credible signal roughness (Trevor et al., Reference Trevor, Arnal and Frühholz2020). Affective meaning seems thus less well signaled and recognized in music compared to voices, a difference that is not accounted for in the credibility hypothesis and therefore could be a downside to this theory.

To address these perceptual differences, we propose that the credibility hypothesis could be extended to include a component regarding culturally de-contextualized biological signals. A similar functional de-contextualization component has been described for the evolution of human reasoning (Stanovich & West, Reference Stanovich and West2000). Vocal signals have biological significance, are largely triggered by situational cues, and have direct contextual meanings to listeners (Frühholz & Schweinberger, Reference Frühholz and Schweinberger2020; Frühholz et al., Reference Frühholz, Trost and Kotz2016). On the contrary, musical imitations of these vocal signals are of a more “symbolic” and “fictional” nature, are voluntarily produced along musical principles and cultural rules, and are meant to capture the attention and emotional sway of the listener. The weaker credibility of musically signaled affective meaning could be because of this difference in signal goals and the de-contextualization of the signal. What then is the evolutionary value of these musical signals? The de-contextualized nature of these signals results in the communication of two pieces of information: “emotional fiction” and the biological meaning of the natural signal being imitated. Music-induced emotions are sometimes regarded as “make-believe” emotions, as fictional tools in de-contextualized settings (Walton, Reference Walton1990). In communicating “emotional fiction,” the musical signal tells the listener that the situation is not real, it is a simulation. That information might weaken the second part of the signal, the affective impression of the imitated vocal expression. Given this “emotional fiction” component, perhaps the creation of biologically rooted affective musical signals was motivated by an adaptive need for simulating emotional situations.

What evolutionary role do simulations of emotional situations serve? There is a theory that nightmares may have evolved to simulate threatening situations to increase threat preparedness and survival chances in early humans (Revonsuo, Reference Revonsuo2000). Part of such threat preparedness would include emotional preparedness, or resilience and emotion regulation skills, because nightmares induce fearful emotions. Some research on other threat simulating activities (horror films and violent videogames) supports this theory. People who enjoy horror movies have been found to be more resilient in the face of real-life dangers, such as the COVID pandemic (Scrivner, Johnson, Kjeldgaard-Christiansen, & Clasen, Reference Scrivner, Johnson, Kjeldgaard-Christiansen and Clasen2020). Similarly, people who play violent video games have fewer nightmares, suggesting that videogame simulations actually fill that adaptive need for threat simulation (Bown & Gackenbach, Reference Bown, Gackenbach, Tettegah and Huang2016). In ancient human cultures, threat simulations were conveyed through storytelling. Storytelling is a universal human practice with ancient roots (Smith et al., Reference Smith, Schlaepfer, Major, Dyble, Page, Thompson and Astete2017) and it often involved musical instruments (Pellowski, Reference Pellowski1990). Perhaps storytellers were motivated to create sounds that would be similar to real-life signals but also clearly fictional, increasing the emotional impression of the stories and enabling listeners to rehearse the emotions of the tale in a safe, imaginary, and cooperative space.

Financial support

C.T. received funding from the European Union's Horizon 2020 research and innovation program under the Marie Skłodowska-Curie Grant Agreement (No. 835682). S.F. received funding from Swiss National Science Foundation (Grants Nos. SNSF PP00P1_157409/1 and PP00P1_183711/1).

Conflict of interest

None.

References

Arnal, L. H., Flinker, A., Kleinschmidt, A., Giraud, A. L., & Poeppel, D. (2015). Human screams occupy a privileged niche in the communication soundscape. Current Biology, 25(15), 2051–2056. https://doi.org/10.1016/j.cub.2015.06.043.CrossRef Google Scholar PubMed

Bown, J., & Gackenbach, J. (2016). Video games, nightmares, and emotional processing. In Tettegah, S. Y. & Huang, W. D. (Eds.), Emotions, technology, and digital games (pp. 3–14). Academic Press.CrossRef Google Scholar

Bryant, G. A. (2020). Evolution, structure, and functions of human laughter. In Floyd, K. & Weber, R. (Eds.), The handbook of communication science and biology (pp. 63–77). Routledge.CrossRef Google Scholar

Engelberg, J. W. M., & Gouzoules, H. (2019). The credibility of acted screams: Implications for emotional communication research. Quarterly Journal of Experimental Psychology, 72(8), 1889–1902. https://doi.org/10.1177/1747021818816307.CrossRef Google Scholar PubMed

Frühholz, S, & Schweinberger, S. (2020). Nonverbal auditory communication – evidence for integrated neural systems for voice signal production and perception. Progress in Neurobiology, 199, 101948.CrossRef Google Scholar PubMed

Frühholz, S., Trost, W., & Grandjean, D. (2014). The role of the medial temporal limbic system in processing emotions in voice and music. Progress in Neurobiology, 123, 1–17. https://doi.org/10.1016/j.pneurobio.2014.09.003.CrossRef Google Scholar PubMed

Frühholz, S., Trost, W., & Kotz, S. A. (2016). The sound of emotions – towards a unifying neural network perspective of affective sound processing. Neuroscience and Biobehavioral Reviews, 68, 1–15. doi:10.1016/j.neubiorev.2016.05.002.CrossRef Google Scholar PubMed

Juslin, P. N., & Laukka, P. (2003). Communication of emotions in vocal expression and music performance: Different channels, same code?. Psychological Bulletin, 129(5), 770–814. https://doi.org/10.1037/0033-2909.129.5.770.CrossRef Google Scholar PubMed

Li, P., & Yackle, K. (2017). Sighing. Current Biology, 27(3), R88–R89.CrossRef Google Scholar PubMed

Monelle, R. (2000). The sense of music: Semiotic essays. Princeton University Press.Google Scholar

Paquette, S., Takerkart, S., Saget, S., Peretz, I., & Belin, P. (2018). Cross-classification of musical and vocal emotions in the auditory cortex. Annals of the New York Academy of Sciences, 1423(1), 329–337. https://doi.org/10.1111/nyas.13666.CrossRef Google Scholar

Pellowski, A. (1990). The world of storytelling. H.W. Wilson.Google Scholar

Revonsuo, A. (2000). The reinterpretation of dreams: An evolutionary hypothesis of the function of dreaming. Behavioral and Brain Sciences, 23(6), 877–901.CrossRef Google Scholar PubMed

Scherer, K. R. (1995). Expression of emotion in voice and music. Journal of Voice, 9(3), 235–248. https://doi.org/10.1016/S0892-1997(05)80231-0.CrossRef Google Scholar PubMed

Schwartz, J. W., Engelberg, J. W., & Gouzoules, H. (2019). What is a scream? Acoustic characteristics of a human call type. The Journal of the Acoustical Society of America, 145(3), 1776–1776. https://doi.org/10.1121/1.5101500.CrossRef Google Scholar

Scrivner, C., Johnson, J. A., Kjeldgaard-Christiansen, J., & Clasen, M. (2020). Pandemic practice: Horror fans and morbidly curious individuals are more psychologically resilient during the COVID-19 pandemic. Personality and Individual Differences, 168, 110397.CrossRef Google Scholar PubMed

Smith, D., Schlaepfer, P., Major, K., Dyble, M., Page, A. E., Thompson, J., … Astete, L. (2017). Cooperation and the evolution of hunter-gatherer storytelling. Nature Communications, 8(1), 1–9.CrossRef Google Scholar PubMed

Stanovich, K. E., & West, R. F. (2000). Individual differences in reasoning: Implications for the rationality debate? Behavioral and Brain Sciences, 23(5), 645–665.CrossRef Google Scholar PubMed

Teigen, K. H. (2008). Is a sigh “just a sigh”? Sighs as emotional signals and responses to a difficult task. Scandinavian Journal of Psychology, 49(1), 49–57.CrossRef Google Scholar PubMed

Trevor, C., Arnal, L. H., & Frühholz, S. (2020). Terrifying film music mimics alarming acoustic feature of human screams. The Journal of the Acoustical Society of America, 147(6), EL540–EL545. https://doi.org/10.1121/10.0001459.CrossRef Google Scholar PubMed

Trevor, C., & Huron, D. (2018). Are humoresques humorous? On the similarity between laughter and staccato. Empirical Musicology Review, 13(1–2), 66. https://doi.org/10.18061/emr.v13i1-2.5608.CrossRef Google Scholar

Vassilakis, P. N., & Kendall, R. A. (2010). Psychoacoustic and cognitive aspects of auditory roughness: Definitions, models, and applications. Proceedings of Human Vision and Electronic Imaging XV, 7527, 1–7. https://doi.org/10.1117/12.845457.Google Scholar

Walton, K. L. (1990). Mimesis as make-believe: On the foundations of the representational arts. Harvard University Press.Google Scholar