Skip to main content Accessibility help
×
Hostname: page-component-745bb68f8f-hvd4g Total loading time: 0 Render date: 2025-02-05T20:45:34.205Z Has data issue: false hasContentIssue false

Part II - Performing Rhythm

Published online by Cambridge University Press:  18 September 2020

Russell Hartenberger
Affiliation:
University of Toronto
Ryan McClelland
Affiliation:
University of Toronto

Summary

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2020

3 Visualizing the Rhythms of Performance

Alan Dodson

Traditional staff notation provides a quantized view of musical time: rhythm symbols place each note at a fixed position within a metric framework consisting of integer multiples and fractions of the beat. This familiar representation of rhythm conceals the temporal elasticity of music in performance, including the nuances of tempo rubato in Western art music as well as the distinctive rhythmic irregularities in other musical traditions, such as the unequal or “swung” eighth-note subdivisions of jazz and blues and the speech-like rhythms of hip hop and other genres of groove-based music. To promote a clearer understanding of such rhythmic practices, collectively known as expressive timing or microtiming, several new methods of visualizing rhythm have been proposed over the past century, both in the context of Western art music – the focus of this chapter – and in other contexts.

I begin with some historical background on rhythm notation and a discussion of some pioneering studies of expressive timing from the twentieth century. Next, I turn to scholarly literature published since 2000 that offers novel illustrations of expressive timing alongside sensitive remarks on the motion qualities of Western art music in performance. I conclude by proposing a new visualization strategy for recordings that deviate from the notational meter, with examples from piano recordings by Ignacy Paderewski, Guiomar Novaes, and Claude Debussy.

The figures and examples for this chapter are given on a supplementary webpage: www.rptm.ca/essays/vrp

Beginnings

The idea of showing proportional, integer-related durations through the shapes of the notes on the page can be traced back to the thirteenth-century treatise Ars cantus mensurabilis (“The Art of Measured Song”) by Franco of Cologne.1 This was a watershed moment in the history of Western music, because proportional notation freed composers from their former reliance on a limited set of stock rhythmic formulas, the rhythmic modes of the ars antiqua, and opened new possibilities for rhythmic flexibility, complexity, and innovation. Since the time of Franco, proportional rhythmic symbols have been used in all of the leading systems of music notation in the West, from the mensural notation of the late Middle Ages and Renaissance to the modern notation that emerged in the seventeenth century and remains in common use today.

There is no denying that proportional rhythmic notation presents a simplified image of musical time, because the durations in actual performances rarely form integer relationships with each other. Aware of this limitation, some scholars began seeking new, performance-sensitive representations of rhythm by the early twentieth century.2 Among these, the most influential was Carl E. Seashore, a pioneering figure in the field of music psychology who assembled a research team at the University of Iowa during the 1920s and 1930s to analyze commercial recordings and live performances. One of the main goals of Seashore’s work, summarized in his 1938 textbook on the psychology of music, was to demystify the artistic dimensions of performance, which he equated with “deviation from the fixed and the regular: from rigid pitch, uniform intensity, fixed rhythm, pure tone, and perfect harmony.”3 Seashore sought a practical visual representation of these artistic deviations, and his solution is a graphic format that he calls the “performance score.”4 Figure 3.1 is his performance score showing measure-level durations in two renditions of the beginning of Chopin’s Polonaise in A Major, Op. 40, No. 1, as performed in the lab by the concert pianist Harold Bauer.5

This graph maps the relationship between two aspects of musical time. The horizontal axis shows what we might call “score time”; it divides the performance into discrete increments based on the barlines in the score, a chain of units of time that are conceptually equal to each other.6 The vertical axis shows these units’ duration in “clock time,” which can be thought of as a continuous timescale, a free-flowing stream of musical time. In this type of graph, the changes in vertical position from measure to measure represent expressive timing. Duration is the inverse of tempo, so downward slope means acceleration and upward slope deceleration.

Bauer was asked to play through the entire passage twice with the same expression, and this figure is meant to show that he could do that rather well. The main expressive features here are a “phrase arch” – that is, an acceleration followed by a deceleration – within mm. 1–8, another phrase arch in mm. 9–16, and an acceleration all the way through mm. 17–24.

To give a clearer sense of what the performance score represents, Multimedia example 3.1 includes an excerpt from another recording of the A-major Polonaise from the 1930s, along with my own Seashore-style analysis. The recording is by Artur Rubinstein, who repeats mm. 1–8 as prescribed in the score but does not repeat mm. 9–24.7 To track Rubinstein’s recording, I located measure onsets in Sonic Visualiser, a widely used, open-source performance analysis program.8 Like Bauer’s performance in the lab, Rubinstein’s recording has a clear phrase arch in mm. 1–8 as well as a ritardando in m. 16, but in place of Bauer’s eight-bar phrase arch in mm. 9–16, Rubinstein gives a pair of four-bar phrase arches that reflect this material’s antecedent-consequent design. Another difference is that Rubinstein has a further phrase arch in mm. 17–24 when the opening material returns, in place of Bauer’s race-to-the-finish-line accelerando.

Some might object that this performance score misrepresents the musical experience, because our attention may very well be drawn to expressive features in the recording that are not shown in the graph. For example, we might be struck by certain rhythmic details within the measure, such as the elongated eighth notes and the compressed sixteenths, as well as certain details of dynamic accentuation, not to mention the wrong notes in the repeat of the opening phrase. Why aren’t these features shown in the graph? Seashore would not deny their importance or discourage us from exploring them empirically, but he places a premium on the clarity that can be obtained when variables are isolated. Because the graphs discussed above only show durations at the measure level, they draw our attention to phrase-level patterns that are subtle but nonetheless readily audible and, moreover, musically meaningful given their close correspondence to the phrase structure of the composition and their stability across repeated performances. For these reasons, I find it helpful to avoid thinking of these graphs as representations of the overall effect or “essence” of a performance. Instead, I prefer to think of them as lenses or filters that draw attention to certain expressive features that the analyst considers meaningful.

After Seashore’s retirement in 1946, a second wave of empirical performance analysis began to emerge in the 1960s and continued into the 1980s and 1990s.9 New technologies such as MIDI and the personal computer greatly facilitated this type of research. Most empirical studies from this period echo Seashore in equating performance expression with measurable deviations from exact regularity, and they often feature graphs that closely resemble Seashore’s performance scores. Innovation during this period lay mostly in the development of new theoretical models that map aspects of musical structure onto patterns gleaned from performance data. Among these generative models of performance, as they came to be known, the one cited most often in recent literature is Neil Todd’s computational model of phrase arching, the expressive device we observed in the timing graphs discussed above.10 The model represents phrase arching as sets of nested parabolic curves, and it encompasses both expressive timing and variations in loudness (Figure 3.2).11 The embedding depth of each curve is proportional to the unit’s hierarchical position in the grouping structure, which is itself modeled through the rule system given in Fred Lerdahl and Ray Jackendoff’s Generative Theory of Tonal Music.12 The model predicts that the changes in tempo and dynamics at section boundaries will be more extreme than those at phrase boundaries within a section, which will in turn be more extreme than those at subphrase boundaries within a phrase. As supporting evidence, Todd provides graphs of timing and dynamics from performances by professional pianists, with patterns similar to those predicted by the model (see the broken lines in Figure 3.2).

In a thoughtful review of Todd’s work, Luke Windsor and Eric Clarke compare human and computer-generated recordings of a Schubert Impromptu and conclude that the model does not provide a complete explanation of how pieces are performed but may nonetheless be useful as a general expressive baseline or norm against which the nuances of a particular performance can be interpreted.13 This could be described as a deductive (theory-driven) approach to performance analysis. Bruno Repp offers a complementary inductive (data-driven) approach in a series of corpus studies from the 1990s.14 Here models are extracted from large sets of performance data through statistical methods (mainly averaging and principal components) and are then used as frames of reference for the interpretation and comparison of individual recordings.

In addition to this work by music psychologists, some historical research involving empirical methods of performance analysis began to emerge in the 1990s. One especially noteworthy contribution in this vein is José Bowen’s 1996 article on tempo fluctuation in recordings of orchestral music from the Classical and Romantic eras.15 Bowen tracked the tempo by tapping along on a computer keyboard while listening to recordings, a more efficient but less accurate method than those used by music psychologists.16 Bowen presents the tempo data in a series of graphs, again mainly in the style of Seashore’s performance scores but now at several levels of scale, proceeding from entire movements to individual phrases. He uses this data to support some rather broad interpretive claims about historical trends in performance practice over the course of the twentieth century and about the performing styles of several well-known conductors. For example, he shows that the tempos within the first movement of Tchaikovsky’s Sixth Symphony have become more extreme over time (Figure 3.3),17 but he also points out that the earlier recordings tend to have wide tempo fluctuations within each section, citing as a prime example the Concertgebouw Orchestra’s 1937 recording conducted by Willem Mengelberg (Multimedia example 3.2).18 Within the latter recording, Bowen highlights a series of three rather extreme phrase arches, beginning in m. 100, as well as a high level of tempo volatility from the beginning of the exposition to the first climax at m. 38.

Another tapping study from the mid 1990s, an essay by Nicholas Cook on Wilhelm Furtwängler’s recordings of Beethoven’s Ninth Symphony, forged a connection between empirical performance analysis and the discourse of music theory: Cook shows several correspondences between Furtwängler’s interpretation and the analysis in Heinrich Schenker’s monograph on the Ninth Symphony, which Furtwängler knew well.19 This sort of disciplinary cross-fertilization was a hallmark of research by scholars associated with the Centre for the History and Analysis of Recorded Music (CHARM), of which Bowen and Cook were founding members. First established at the University of Southampton in 1994, within a decade CHARM grew into a multi-institutional enterprise that hosted a series of international interdisciplinary conferences during the first decade of this century.

Recent Innovations

Over the past twenty years, partly as an outcome of CHARM and other related initiatives, the empirical literature on rhythmic aspects of performance has continued to flourish. During this period, traditional timing graphs have continued to appear routinely in performance analysis contributions, for instance in my own work on prolongational boundaries and metric dissonance in recorded music, and in writings on performing style and expression by Daniel Leech-Wilkinson and Dorrotya Fabian.20 However, several new types of analytic figures have also been proposed, often in conjunction with observations about subtle qualities of motion that arise from interactions between performing nuances and aspects of grouping and meter. To illustrate this recent trend, I’ll now turn to some musical examples from five representative publications, all of which deal with recordings of nineteenth-century music for solo piano. (This repertory has probably been emphasized because piano note onsets are percussive and therefore relatively easy to locate in a sound file, and because tempo rubato tends to be more prominent in recordings of music from the nineteenth century as compared to other periods.)

In a 2012 essay on three recordings of the first movement of Beethoven’s “Moonlight” Sonata, Elaine Chew superimposes curved arrows on timing graphs to show what she calls “high-level phrase arcs” (Multimedia example 3.3).21 Chew’s conception of the phrase arch phenomenon differs fundamentally from Todd’s, because she does not invoke Lerdahl and Jackendoff’s rule system, whose input consists of compositional features such as melodic parallelism and cadence.22 Instead, her conception of the phrase rests entirely on expressive timing: for Chew, a “performed phrase” is simply a segment of the recording that begins and ends with minima in the timing graph.23 She compares the rhythmic effect of different recordings by considering the relationship between these performed phrases and the metric notation within the first fifteen measures, showing that Daniel Barenboim and Maurizio Pollini slow down at the barlines, whereas Artur Schnabel does so only at formal boundaries marked by cadences. Chew notes that prior to these cadence points, “the consistent ebb and flow across the bar lines creates a sense of continuity, perhaps a clue as to how [Schnabel] creates these long, long lines.”24 The arrow notation in Multimedia example 3.3a highlights the high-level performed phrases in Schnabel’s recording. In a follow-up study, Chew considers the formal implications of the high-level phrase arcs in other recordings. For example, she describes the differences between Schnabel’s and Pollini’s interpretations (Multimedia example 3.3b) in the following way:

While Schnabel’s [recording] draws a long line to the first modulation, to E major, Pollini’s long line is reserved for bars 5 through 15. This suggests that Pollini may have heard the first four bars as an introduction, after which the long line begins and stretches to the downbeat of bar 15, which is the cadence in b minor.25

Thus Chew’s annotated tempo graphs point not only to differences in performing style, but also to different ways of hearing the succession of formal functions in this passage.26

Phrasing hierarchy is also the subject of an article on recordings of Chopin’s Mazurkas by Mitchell Ohriner, who builds on the traditional timing graph in two ways.27 As a first approximation, he uses simple annotations to show how different performances resolve a grouping ambiguity in different ways: in Multimedia example 3.4, the third beat of m. 24 in the Mazurka in C Major, Op. 24, No. 2, can be heard either as (a) the ending of a group that begins in m. 21 or (b) the beginning of a group that ends in m. 28.28 Recordings by Frederic Chiu and Vladimir Ashkenazy tip the balance one way or the other. Furthermore, at the measure level (not shown here), Chiu accelerates in m. 25 but Ashkenazy does not, so there is also a difference in the salience of different levels of the grouping hierarchy in these two performances, as shown through the boldfacing in Multimedia example 3.4c.29

The second method involves using an algorithm to describe grouping similarities and differences within a large set of recordings at three levels of scale. As shown in Multimedia example 3.5, the segments of a timing graph are translated into durational contours, that is, ordered sets that represent the durations within the segment as integers from 0 (the shortest unit) to n–1 (the longest unit), where n is the number of units.30 Next, a contour reduction algorithm checks for group-final lengthening (GFL) – that is, a slow ending – at the 2-, 4-, and 8-measure levels, in reference to a hypothetical grouping structure of (2+2)+(2+2) measures.31 The algorithm omits the midsized duration within a rolling 3-unit window, and this process is repeated recursively until the contour cannot be reduced further. Only contours that reduce to a pattern of decelerating throughout (<01>), or accelerating and then decelerating (<102> or <201>) are considered GFL-reflective. The results are displayed visually as a set of boxes that are either thick or thin, indicating the presence or absence of GFL. Multimedia example 3.6 shows the results for 6 of the 29 recordings of the C-major Mazurka that Ohriner analyzes with this method.32

Because the reference structure is the same for all recordings, the second method is less sensitive to fine grouping boundary differences than the first, but it can nonetheless reveal more fundamental differences in the pacing of the recordings. For example, it shows that Brailowsky’s recording differs from both Chiu’s and Ashkenazy’s insofar as it lacks GFL at the four-bar level, giving it a markedly different feel from the other two. (Multimedia example 3.6 includes an excerpt from the Brailowsky recording.) The basic idea here is that the hypothetical (2+2)+(2+2) pattern exists in reality only to the extent that its constituent parts are performed in a GFL-reflective way. In this sense Ohriner’s approach (like Chew’s) offers a less deterministic representation of grouping than earlier generative approaches grounded in Lerdahl and Jackendoff’s rule system.

Meter rather than grouping forms the framework for a new visualization strategy proposed in a paper by Olivier Senn, Lorenz Kilchenmann, and Antoine Camp, who prepared a note-by-note durational analysis of the first four measures in Martha Argerich’s recording of Chopin’s Prelude in E Minor (Multimedia example 3.7).33 In this case, units at various levels of the metric hierarchy – measures, half-measures, beats, and beat subdivisions – are represented in both the horizontal and the vertical axis, resulting in a series of nested squares.34 The left and right hands are tracked separately, and asynchronies at points aligned in the score are shown as positive or negative integers in the middle of the graph, between the two streams of nested squares. Performance expression is multileveled, and it is very helpful to be able to survey several levels of activity at a single glance. This hierarchical representation is thus a significant advance over the traditional timing graph, which tracks durations on only one level of scale at a time, typically the measure or beat level.

Senn et al. point out some recurring patterns in the Argerich recording: the second half-measure is consistently longer than the first, there are consistent “melody leads” (i.e., the melody notes begin a few milliseconds before the chords), and there are beat- and subdivision-level ritardandos into the first three barlines as well as the midpoint of m. 4.35 The authors describe the overall effect of these patterns as follows: “Argerich seems to make a new effort to gain some speed with each new bar (hence the faster first halves), but this effort is lost [and] as the bar progresses, the tension relaxes and the music seems to stagnate.”36 From the loose fit between Argerich’s performed durations and the rhythmic notation, Senn and his co-authors offer this musical narrative-in-miniature as a speculative “inverse interpretation” that is presented “strictly from the listener’s point of view.”37 This is a preliminary study of modest scope, limited to four measures from one recording.38 However, it has the special distinction of having inspired several creative applications, in the form of a series of compositions by the American composer Richard Beaudoin. In these compositions, the durations in the score are derived from various nested-squares performance analyses, including the one in Multimedia example 3.7.39

Some other recent studies have explored phrase-level patterns of dynamics as well as expressive timing. Jörg Langner and Werner Goebl explore interaction between these two domains of performance expression through innovative “performance worm” animations involving a string of overlapping discs in a space defined by tempo on the horizontal axis and loudness on the vertical.40 Multimedia example 3.8 demonstrates this approach through an analysis of a recording of Chopin’s Etude in E Major, Op. 10, No. 3, by the pianist Mauricio Pollini.41 As noted above, Todd’s generative model suggests that both tempo and dynamics tend to increase and decrease in arch-shaped patterns over the course of a phrase. If the dynamic and tempo arches were congruent, then one would expect the performance worm to move diagonally most of the time, first to the northeast and then to the southwest. However, Langner and Goebl observe that in many recordings, such as the Pollini recording in Multimedia example 3.8, the tempo tends to increase before the dynamic level in the approach to a phrase’s climax, and counterclockwise patterns are prevalent overall.42 Though based on a relatively small sample of recordings, this observation suggests a possible refinement of our understanding of phrase arching. Follow-up studies have sought further recurring patterns of motion in the tempo-loudness space, including performer-specific patterns as well as generic patterns, through artificial intelligence models.43

Dynamic and temporal aspects of phrase arching are explored in a different way by Nicholas Cook in the chapter on recordings of Chopin’s Mazurkas that forms the centerpiece of his broad-ranging book on performance analysis, Beyond the Score: Music as Performance.44 Cook presents the data in the form of “scape plots” in which beat-level data are represented at the base of a triangle and adjacent beats’ values are averaged recursively at higher levels, as shown schematically in Figure 3.4.45 The loudness triangle is inverted and aligned with the timing triangle along the base, creating a diamond shape overall, and the numerical information is color-coded in such a way that flame-like patterns emerge at the loudest and slowest points in an excerpt. Multimedia example 3.9a is Cook’s analysis of a recording that he describes as a textbook example of phrase arching, namely Heinrich Neuhaus’s recording of Chopin’s Mazurka in C-sharp Minor, Op. 63, No. 3.46 Through graphs of several other recordings of this piece, Cook demonstrates that the technique of phrase arching, though prevalent since the Second World War (especially among Russian-trained pianists), was less widely used in the first half of the twentieth century, when a more rhetorical and improvisatory style of performance prevailed. In Ignaz Friedman’s recording (Multimedia example 3.9b), for instance, Cook hears a “focus on moment-to-moment expression” arising from features such as the unusually long opening upbeat and abrupt changes of tempo and articulation later in the excerpt.47 Thus Cook uses his flaming diamond graphs to support the argument that phrase arching is a historically and culturally specific practice, not a universal or “hard-wired” psychomotor phenomenon, as Todd’s model and other generative models seem to imply.

Despite their sharp differences in design, all of the recent examples I have surveyed help bring into focus musically meaningful patterns that might otherwise escape our notice, as well as the qualitative effects that these patterns help to generate. Like Seashore’s performance scores, then, these new types of graphs can be thought of as analytic lenses or filters that help us reach a clearer understanding of the music we are actually hearing, as distinct from the music we read on the page. Another way in which these new graphs resemble Seashore’s performance scores is that they use the notational meter – the beats and measures shown in the score – as a frame of reference: durations are tracked at the level of the notated beats, and measure numbers are shown either on the horizontal axis (Chew, Ohriner, Cook), at the top of the nested squares (Senn et al.), or on the face of the performance worm (Langner and Goebl). In my view, this is not problematic in the particular examples surveyed above, because these beats and measures can easily be heard and felt in these recordings. However, reliance on the metric notation limits the generalizability of these analytic approaches, because many other recordings do not conform to that notation – especially those by performers trained at the beginning of the twentieth century, a period when many musicians and critics railed against the “tyranny of the barline.”48 I consider the variations in meter within such non-literalist recordings to be a distinct type of “artistic deviation from the regular” – in other words, a special type of performance expression.

Visualizing Metric Variation in Performance

Let’s consider two versions the opening of Chopin’s Mazurka in A Minor, Op. 17, No. 4, one recorded by Ignacy Paderewski in 1923 and the other by Guiomar Novaes in 1954.49 According to the score (Multimedia example 3.10a), this excerpt consists of eight bars in triple meter, grouped as 4+4.50 Timing graphs of the two recordings are very similar (Multimedia example 3.10b), and the corresponding timescape graphs are nearly identical (Multimedia example 3.10c).51 However, these illustrations conceal some fundamental differences between the recordings in the domain of experiential meter, by which I mean the meter felt by the listener, as opposed to the meter represented in the score.52 To get at these differences, I will borrow some concepts and analytic symbols from Christopher Hasty’s Meter as Rhythm, which casts meter not as a structure or schematic framework but instead as a dynamic, emergent aspect of the listening experience.53 Hasty represents experiential meter through static images, but I will instead use animations in an effort to represent more directly the temporal qualities that the theory describes, an approach suggested in John Roeder’s review of Meter as Rhythm.54 I created Multimedia example 3.11–3.15 using Adobe Animate, a multimedia computer animation program.

For Hasty, the basis of the experience of meter is projection, the feeling that once a duration has been articulated, it will immediately be reproduced: after hearing two sounds, we expect a third sound at a specific future moment (Multimedia example 3.11a).55 A complex projection occurs when a long projection is coordinated with two or more shorter projections (Multimedia example 3.11b),56 and within a complex projection, the dominant beginning (|) launches a projection that remains active for more than one beat, while a continuation (\) initiates a shorter projection between two dominant beginnings.57

One of the benefits of Hasty’s analytic notation, as compared to traditional metric symbols like time signatures and barlines, is that it allows for more subtlety and flexibility in tracking changes of meter. Hasty defines three types of metric change relevant to the excerpts discussed below: denial occurs when a timespan does not fulfill its projected duration and instead launches a new projection that is considerably shorter or longer than the one before it (Multimedia example 3.12a),58 deferral happens when an extra continuation postpones a dominant beginning and therefore alters the metric type, for instance by shifting from duple to triple meter (Multimedia example 3.12b),59 and interruption occurs when the ending of a projected duration is pre-empted by a new, early beginning (Multimedia example 3.12c).60

Hasty’s own analyses center on compositional applications of these processes of metric change, mainly in music from the twentieth century, and he uses scores rather than recordings as his objects of analysis. I often hear these same metric processes when listening to recordings of earlier, common-practice repertoire, including the Paderewski and Novaes excerpts discussed above, to which I now return.61

Multimedia example 3.13a shows the variations in meter that I hear in the Paderewski excerpt.62 Up to the boundary of the introduction and theme, I hear a projection of triple meter without anacrusis. This triple meter is so well established that I feel a fifth downbeat during the sustained chord that begins in the fourth experiential measure, but the mazurka theme enters before this fifth measure is completed – a clear case of interruption. A further complication occurs during the first three beats of the theme, which Paderewski plays as weak–strong–weak. This can be heard either as a syncopation or an anacrusis in its local context, but once the theme’s second downbeat is articulated the ambiguity resolves in favor of syncopation. From that point on, I hear an unambiguous triple meter until the end of the excerpt.

Multimedia example 3.13b maps the hypermeter in Paderewski’s recording, which is equally flexible. Although a potential for duple hypermeter can be sensed throughout the excerpt, this potential is fully realized only toward the end.63 Duple hypermeter is ubiquitous in Romantic piano music, so we might reasonably expect it at the outset of the recording. Instead, the second hyperdownbeat is deferred until the sustained chord at the end of the introduction, which gives the fourth experiential measure a much stronger sense of “beginning” than the third, and suggests that the recording begins with triple hypermeter. This effect is enhanced by the increased tempo and rhythmic activity leading up to the sustained chord, which bring a sense of anacrusis, of into-the-beginning. However, the projection of this triple hypermeter is interrupted by the entry of the mazurka theme, which itself projects the duple hypermeter expected from the outset.

In Novaes’s recording, I hear a fundamentally different metric process involving a shift from duple to triple meter at the surface level (Multimedia example 3.14a). Duple meter is projected throughout the introduction, and the triplet figure near the end of the introduction sounds like a quarter-note triplet, not an eighth-note triplet as in Paderewski’s recording and in the score. This quarter-note triplet foreshadows a shift from duple to triple meter at the beginning of the theme, and the ritardando that surrounds the triplet helps smooth out this metric transition. In conjunction with this ritardando, a shift from duple to triple hypermeter can also be discerned in the introduction (Multimedia example 3.14b). The accents on the first and fifth chords of the introduction project duple hypermeter, and the triplet figure defers the next hyperdownbeat until the sustained chord. Then, as in Paderewski’s recording, this triple hypermeter is interrupted by the entry of the mazurka theme, which itself projects duple hypermeter throughout.

The Paderewski and Novaes example show some ways in which an investigation of experiential meter may shed light on distinctive and meaningful patterns that would be obscured in an analysis framed by the notational meter. To demonstrate this approach further, I will now turn to a more metrically intricate passage from Debussy’s 1913 piano roll recording of his piece D’un cahier d’esquisses (“From a Sketchbook”) as reissued on CD.64 The passage includes a cadenza that features two gong-like bass notes, each of which is followed by a rapid ascending passage and a short melodic fragment. The cadenza is preceded by a short transition and followed by a coda.65 Multimedia example 3.15a maps the metric fluctuations that I hear in this excerpt.

A clear and continuous tactus can be felt throughout the excerpt, but the underlying (measure-level) projections are considerably less clear and continuous: a slow duple meter is established at the outset, but two processes of metric change, namely, denial and deferral, can be traced within the cadenza. Because of the slow tempo, these processes do not bring as much tension as the interruptions and deferrals in the Paderewski and Novaes recordings. Instead, they bring a subtle ebb and flow of metric determinacy over the course of the excerpt.

The melodic fragment after the cadenza’s initial gong stroke includes an extra continuation (“weak beat”) that defers the next downbeat until the second gong stroke. This establishes a potential triple meter, but when the melodic fragment returns soon after the second gong stroke, its rhythm is altered in such a way that the gesture now encompasses four beats instead of three. The result is that the tentative triple meter recedes from our field of awareness,66 and the referential duple meter is restored.

By the end of the excerpt, the tempo has become slow enough to undermine the duple meter, which seems to dissipate shortly after the coda begins. However, there is a feeling of rhythmic continuity between the tactus pulses of the cadenza and the triple measures of the coda. In other words, there seems to be a transition from a slow duple to a moderate triple meter toward the end of the excerpt, similar to what happens in the Novaes excerpt discussed above.

The score and recording are again quite remote from each other in this case. The cadenza is mostly unmeasured in the score (Multimedia example 3.15b), and one of its main focal points in the recording, the second gong stroke, is altogether absent.67 A few beats later, at the beginning of the coda, the score indicates a change from 68 to 64 meter, as well as a tactus shift (dotted quarter = quarter), indicating that the coda should proceed at only one-third the tempo of the previous 68 measures – a dramatic discontinuity in comparison to the seamless connection heard in the recording.

These animations provide a new way to visualize and interpret the nuances of rhythm and timing in recordings that lack a consistent metric framework. As such, they fill a gap in the literature surveyed in Parts I and II of this chapter. Further applications might explore recordings of Gregorian chant and other types of monophonic singing in free rhythm, as well as free-flowing instrumental music like the unmeasured keyboard preludes of the French baroque. These are just a few of seemingly endless possibilities for research in performance analysis, a field that will surely continue to grow and develop as new technologies become available in the years ahead – thereby helping to further expand our appreciation and understanding of the rhythms of performance.

Figures and Multimedia examples for Chapter 3

These figures appear on the website www.rptm.ca/essays/vrp

  1. 3.1 Timing in the opening of Chopin’s Polonaise in A Major, Op. 40, No. 1, performed by Harold Bauer (Seashore, Psychology of Music)

  2. 3.2 Graphs of human and computer-generated performance expression:

    1. (a) Timing in an excerpt from Haydn’s Sonata in E-flat Major, Hob. XVI/49 (Todd, “A Computational Model of Rubato”)

    2. (b) Loudness in an excerpt from Chopin’s Prelude in F-sharp Minor, Op. 28, No. 8 (Todd, “The Dynamics of Dynamics”)

  3. 3.3 Average tempos in various recordings of the first movement of Tchaikovsky’s Symphony No. 6 in B Minor, Op. 74 (Bowen, “Tempo, Duration, and Flexibility”)

  4. 3.4 Generalized scape plot design (Sapp, “Computational Methods”)

These multimedia examples appear on the website www.rptm.ca/essays/vrp

  1. 3.1 Timing in the opening of Chopin’s Polonaise in A Major, Op. 40, No. 1, performed by Artur Rubinstein

  2. 3.2 Timing in the first movement of Tchaikovsky’s Symphony No. 6 in B Minor, Op. 74, performed by the Concertgebouw Orchestra conducted by Willem Mengelberg (Bowen, “Tempo, Duration, and Flexibility”)

  3. 3.3 Timing and high-level phrase arcs in the opening of Beethoven’s “Moonlight” Sonata, Op. 27, No. 2

    1. (a) performed by Artur Schnabel (Chew, “About Time”)

    2. (b) performed by Artur Schnabel (top half) and Maurizio Pollini (bottom half) (Chew, “From Sound to Structure”)

  4. 3.4 Timing and grouping in an excerpt from Chopin’s Mazurka in C Major, Op. 24, No. 2 (Ohriner, “Grouping Hierarchy and Trajectories of Pacing”)

    1. (a) performed by Frederic Chiu

    2. (b) performed by Vladimir Ashkenazy

    3. (c) effects of performance strategies upon grouping structure

  5. 3.5 Durational contour in Chopin’s Mazurka in C-sharp Minor, Op. 63, No. 3, performed by Stanislav Bunin (Ohriner, “Grouping Hierarchy and Trajectories of Pacing”)

  6. 3.6 Group-final lengthening in six recordings of Chopin’s Mazurka in C Major, Op. 24, No. 2 (Ohriner, “Grouping Hierarchy and Trajectories of Pacing”)

  7. 3.7 Timing and notational meter in the opening of Chopin’s Prelude in B Minor, Op. 28, No. 4, performed by Martha Argerich (Senn et al., “Expressive Timing”)

  8. 3.8 Timing and dynamics in Chopin’s Etude in E Major, Op. 10, No. 3, performed by Maurizio Pollini (Langner and Goebl, “Visualizing Expressive Performance”)

  9. 3.9 Timing and dynamics in the opening of Chopin’s Mazurka in C-sharp Minor, Op. 63, No. 3 (Cook, Beyond the Score)

    1. (a) performed by Heinrich Neuhaus

    2. (b) performed by Ignaz Friedman

  10. 3.10 Timing and notational meter in the opening of Chopin’s Mazurka in A Minor, Op. 17, No. 4

    1. (a) score

    2. (b) timing graphs of two recordings, performed by Ignacy Paderewski and Guiomar Novaes

    3. (c) scape plots of the two recordings

  11. 3.11 Animations for projection, as described in Hasty’s Meter as Rhythm

    1. (a) simple

    2. (b) complex

  12. 3.12 Three types of metric change discussed in Hasty’s Meter as Rhythm

    1. (a) denial

    2. (b) deferral

    3. (c) interruption

  13. 3.13 Experiential meter in the opening of Chopin’s Mazurka in A Minor, Op. 17, No. 4, performed by Ignacy Paderewski

    1. (a) preliminary analysis (two levels)

    2. (b) full analysis (three levels)

  14. 3.14 Experiential meter in the opening of Chopin’s Mazurka in A Minor, Op. 17, No. 4, performed by Guiomar Novaes

    1. (a) preliminary analysis (two levels)

    2. (b) full analysis (three levels)

  15. 3.15 Experiential and notational meter in the cadenza from Debussy’s D’un cahier d’esquisses

    1. (a) performed by the composer

    2. (b) score

4 A Percussionist Understands Rhythm in Five Essays of Exactly 1,000 Words Each Not Including Titles and Subtitles

Steven Schick

Rhythm as a relational language that reflects the tension between an event and its surrounding temporal context. There are no complex rhythms. The difficulty of maintaining rhythms whose ratios feature large denominators.

Imagine that you are sitting on a grassy slope near a cliff rising above the sea. In the distance, an approaching cloud formation threatens rain. You wait, lost in thought. The first raindrop surprises you. How do you react?

Does the first drop feel cold or heavy? Do you feel at one with nature, or are you more concerned with the future of your cashmere sweater? Do you feel inspired? Annoyed? Or, do you shake your head in exasperation and mutter, “That drop is late!”

The last question almost certainly doesn’t come to mind. Concepts like early or late – the lingua franca of rhythm – rely not just on a single event, but upon the relationship between that event and an external, often communal, temporal context. Temporal context, whether it is sensed, conducted, or intoned on a metronome, establishes an expectation in timing such that we can say whether a note (or raindrop) is early, late, or right on time; downbeat, upbeat, or syncopate. In particular, those terms of rhythmic art describe the tension inherent in expectation and make the difference between playing a rhythm and merely articulating a point in time.

So, the mastery of rhythm – if mastery is the right word for something that is primarily felt rather than measured – lies in the intelligent management of the tension inherent in expectation. This is a truly complex feat, involving ears, mind, and muscles in a feedback loop that is both prognosticative (Is this the right moment to play?) but also retrospective (Was I early or late, and do I need to adjust so the next note will arrive when I want it to?). If playing even a single note on time is this complex, then this musician refuses to accept a term like complex rhythm (since all relationships and therefore all rhythms are complex). And though I have been tasked to comment on complex contemporary rhythms in these pages, I will begin by observing that there is no qualitative difference in complexity between playing triplets (a ratio of 3:2 faster than the unity value of the surrounding temporal context) and a seemingly more difficult rhythm like 7:5.

So, instead of focusing on words like complexity, let’s examine the quality of tension between an event and its surrounding temporal context. At its most basic level, this is unproblematic. A note that arrives on time or a succession of notes that adheres exactly to an ongoing pulse can be thought of as “unity,” or 1:1. Set your metronome on 100 and play along at the same speed. You are at unity. A succession of notes that moves 50 percent faster than unity reaches a speed of 3:2, or 150 beats per minute. In the relational language of rhythm, we call them triplets. A ratio of 4:3 produces a speed of a bit more than 133, but most musicians would know this better as a sequence of dotted notes. Broadly speaking, the higher the denominator of the fraction when reduced to its simplest form, the greater the relational distance from unity values and the greater the amount of temporal (and therefore rhythmic) tension produced. The point here is not to mystify the rhythms we know by parsing them as ratios – please continue to think of triplets as such rather than as the polyrhythm 3:2 – but to demystify them. If everything is a ratio, then, in principle, 1:1 is not fundamentally more complex than 15:7, but simply describes a different quality of tension relative to a given unity value.

Or you might wish to think in harmonic terms: a triplet has the same relationship to rhythmic unity that a perfect fifth has to its root. The sequence of dotted rhythms has the same ratio as a perfect fourth. This might be useful if, the next time you struggle with a rhythm like 15:8, you think of it as a just-tuned major seventh – nearly but not quite at unity.

“Tuning rhythms” can be a useful strategy because arriving early to a downbeat is like playing sharp. Simply play the rhythm a little “flatter” (or later, if you will) and you’ll be right. And, like intonation, there are many versions of correctness. A just-tuned major third above C will be fourteen cents lower than a tempered major third. Both are thirds, but they behave differently in context. A just-tuned third feels settled to me, while a high seventh – a favorite for leading tones everywhere – seems skittish, unsatisfied. Triplets, quintuplets, and backbeats have many of the same variations of behavior, though in the rhythmic space, we think of these small variations in rhythmic behavior not as intonation but rather as “feel.” Imagine these distinctions as the difference between a Charlie Watts backbeat resting firmly in the pocket of a groove and an edgy, spring-loaded David Lang rhythm.

Understanding rhythms as ratios – and therefore as the creation and management of rhythmic tension – provides a useful strategy in performance. A performer who sees a 7:5 in a musical score might start by conceiving of it as 1.4 times faster than unity speed. (Simply divide 7 by 5 on your calculator.) If you already know how to play triplets (1.5 times faster than unity value) and you can manage dotted eighth notes (1.3 times faster), then 7:5 slides neatly in between.

Here the difficulty is not so much finding but maintaining the rhythm. It is fairly easy to see why. If you play 1:1, every note you play aligns with the external template and is completely supported by the rhythmic pulse you are counting to yourself. A value of 3:2 means that you align with the unity pulse every other note – still well supported. If you play 7:5, that means you align only every five notes; 13:7 aligns only every seven notes, and so on. The longer the cycle, the more chance there is to drift between points of alignment. And, for a musician wishing to explore these kinds of ratios, the more important it becomes to internalize cycles of varying lengths.

Learning in cycles. The pan-rhythmic spectrum. The transaction between the firmness of cycles and the fluidity of rhythms.

Ratios provide a simple means to understand the speed of a rhythm. But they are less useful in actual musical performance, unless relative speed is the only criterion. See Ben Johnston’s Knocking Piece (1962), for example. However, in much contemporary music, rhythmic speed aligns with or rubs against a temporal cycle.

The relationship between cycle (the extent to which the underlying external temporal context can be seen to have recurring patterns) and any given musical event that inflects it (the placement of a note on strong or weak beats) is what most people think of as rhythm. Your hard-partying cousin raises a fist during his wedding dance because a musical event (Van Halen power chord) aligns with a nodal point in the musical cycle (end of the solo; beginning of the verse). The coincidence is powerful and synesthetic: alignment between rhythmic event and cycle produces both a (potentially dangerous) muscular response and a zap of euphoria.

Powerful experiences in rhythm release powerful emotions. But no one is moved by the correct execution of 7:5. We are moved because the perception of cycles is deeply embedded in the human psyche. And, when we hear or play a rhythm that excites, transgresses, or clarifies a cycle, strong emotion often results. In a moment of profound connection with my father, I remember walking with him out onto the plowed black earth of our Iowa farm, quickly sniff the warming spring air, and in one of the greatest examples of rhythmic improvisation I have seen, say, “It’s time to plant.” He was like a great jazz musician with his metronome set on 1525,600. Human beings, migrating birds, and animals with memory all feel deeply the long cycles of sun and moon; of mating season and endocrine system. We know that the good movies usually come out in December, and we understand the inevitability of D major at the end of Beethoven’s Ninth Symphony. We intuit the cyclical architecture of tragedy when Anna Karenina meets Vronsky at a train station after someone has thrown herself to the tracks and then, several hundreds of pages later, is herself crushed by a train.

These are powerfully felt cycles. But unarticulated cycles, like all balanced systems, create no heat. To make art, cycles must be articulated by events, which parse them on a human scale. Note that large planting/harvest cycles consist of multiple mini-cycles of cultivation, detasseling, and irrigation. D major returns many times in sub-articulations of Beethoven’s Ninth before we arrive at the final cadence. Articulation of large cycles through deliberate rhythmic interventions is the trade of all good artists and farmers.

It’s a complex process, because even identical points of time function very differently as rhythm depending on the quality of a given musical event. Take the thousand-plus downbeats in our Beethoven symphony. On one level, they’re equal – each is theoretically the strongest beat of the measure. But a downbeat at an important change of harmony, or another marked by a timpani fortissimo (note how often these are simultaneous) is not just a different sound than a less profiled downbeat; it is also a different rhythm. Musical texture, sonic weight, harmony, and intonation comprise a non-temporal component of rhythm. And, since a rhythmic event is a point of inflection within a temporal cycle, variations in the vibrancy of that inflection create not just different sounds but also, in essence, different rhythms. The resulting “pan-rhythmic” spectrum, whereby texture also functions as rhythm, rhythm as intonation, polyphony as harmony, and so forth, extrapolates the relational tension between cycle and event to include sound, impact, and emotion.

To practice hearing cycles and not just points-in-time, examine the first measure of Brian Ferneyhough’s Bone Alphabet (1992). An upper line consisting of 64th notes and dotted 32nd notes provides a duple template – cycle, if you will – against which is heard a syncopated line at 1012 of the duple speed. Two learning strategies come to mind: (1) create a composite grid on which all notes of both rhythmic strands can be found (see the discussion below of verticality) or (2) reconceive the lower line at a tempo which is 1012 of the basic speed. If the duple line is at MM54, then the 10:12 line is at 45. But neither a flattened composite that results from the first strategy, nor an unproblematized restatement of rhythm as tempo captures any of the relational tension between a pre-existing rhythmic cycle and an activation of that cycle through rhythmic events.

Try first hearing the repeated 10:12 phrase as the cycle of two dotted eighths. Set your metronome on 36, the speed of the dotted eighth, and then sing the line against it. As you do so, sense not just the speed of the rhythm but also where it feels grounded as downbeat within the cycle and where it is forwardly directed, like an upbeat. You will begin to hear not just speed but also the tension of a rhythm against its cycle. Now play only the first beats of the two cycles (eighths and dotted eighths) and hear how the asymmetrically overlaid cycles create temporal tension even before you add the details of the rhythms themselves. Try to retain this underlying sense of cycle when you play the entire rhythm in final performance.

Two interstitial topics: We use the term cycle because of its profound extra-musical associations. However, we’ll now refer to cycles when they are expressed in musical time as meter.

Second, the relationship between cycle and rhythm is fundamentally transactional: the cycle is fixed and repetitive while the rhythms within it are malleable and interchangeable. Cycles don’t change to accommodate rhythms; it’s the reverse. To explore, I am giving this chapter a rigid temporal architecture – five essays of exactly 1,000 words each – within which to express a fluid set of ideas. Weighing whether to discard something in order to add something else, I feel the tension of transaction and see that this project, like so much in life, is fundamentally rhythmic.

Anchoring and surging in polyrhythmic lines. Cyclical and non-cyclical polyrhythms. The beauties of guessing. A solution in behavior.

Every rhythm is a polyrhythm. Even a simple pulse is tied to an underlying sense of time and creates a polyrhythm of 1:1. With more interesting polyrhythmic lines – ones with higher denominators – the principle is the same. And, whether points of coincidence between rhythm and underlying pulse are proximate (1:1, in which every note aligns with its underlying pulse) or distant (10:12 which aligns every 12 beats), every polyrhythm functions both vertically and horizontally. Verticality anchors the polyrhythm to its underlying pulse; horizontality describes temporal surges within the rhythmic line and creates tension within the line.

When two rhythms occupy the same period of time – what I’ll call a cyclical polyrhythm since both rhythms are anchored to the same cycle – verticality is straightforward. In Example 4.1, the indication 10:12 means that twelve 64th notes (or three 16ths) are divided simultaneously into divisions of 12 (in the upper line) and 10 (in the lower line) subdivisions. To locate both 10:12 lines, multiply 10 and 12 to create a grid of 120. Then make two groupings of the 120 points: one every 10 units and another every 12, dividing the 120 into 12 and 10 equal parts, respectively. Every note in both rhythms can be found on this grid. Note that in a grid this large, I number the beats starting with zero, so that major subdivisions fall on 10, 20, 30, and so on, rather than 11, 21, 31.

4.1 Ferneyhough, Bone Alphabet, m. 1.

Used with kind permission of C. F. Peters Corp., Henmar Press

Some polyrhythmic lines are not multiple divisions of the same period. See Example 4.2, from Bone Alphabet for a polyrhythm that does not have identical beginning and ending points. By contrast to Example 4.1, this is a non-cyclical polyrhythm. The lower 6:7 line begins its second cycle on the eighth 64th note of the measure, while the upper line begins its second cycle on the ninth 64th note. Without aligned cycles, the simple grid described above does not work. One could design a very large grid in which each line relates to a third, unstated, line – in this case a 64th note subdivision. But the grid would be so large as to be unusable. In a further complication, the 4:3, a nested polyrhythm within the 6:7, adds another layer to the creation of a grid.

4.2 Bone Alphabet, m. 2.

Used with kind permission of C. F. Peters Corp., Henmar Press

The solutions here are effective but not exacting. First, I view the nested 4:3 rhythm not as a new rhythm, but as a surge in a preexisting one. Play the 6:7 rhythm as a straightforward sextuplet at tempo 46 (67 of 54), and then play a simple 4 over the last three notes. It should feel like an inflection of the sextuplet – a surge.

The next step is the correlation of the upper and lower lines – 6:7 and triplets. A global grid that reconciles the two is not possible, so I start by guessing (which is a vastly underutilized strategy in calculating rhythms!). I start with the more detailed line, in this case 6:7, and guess where the triplets might fall. A workable “first draft” of vertical alignments comes quickly. Then, I fine-tune by ear: I toggle mentally between the two lines as I practice the composite and test, by ear, whether the triplets sound accurate, and whether the 6:7 rhythm still has the right kind of surge.

Does guessing feel too irrational an approach for such a seemingly rational problem? For me, precisely how a guess is constructed is of critical importance. If my guess is based upon a secure sense of the rightness of each individual line, and if it accounts for meter in such a way that the tension with a basic pulse(s) (here 54 and 46) inheres in the polyrhythm, then it works very well. A guess like this is how we play the correct rhythm at the beginning of Beethoven’s Fifth Symphony, or how we might play the groove at the top of “Ticket to Ride.” But as Ronald Reagan said, “Trust and verify.” Make a recording or ask for help from a colleague to determine how well you have guessed.

Sometimes the ebbs and flows of nested polyrhythms feel more like variations of behavior than musical performance. Note Example 4.3, measure 7, also from Bone Alphabet. The 18 measure is now at tempo 46 – a metric modulation prepared by the 6:7 polyrhythm in measure 2. The outermost polyrhythm subdivides the measure into six parts with a speed of six times faster than the eighth note (thus MM276). The second polyrhythm layer takes the last four sextuple values and divides them into three, slowing their speed to MM207. Practically, this is not unmanageable: simply count two beats at MM276 and switch for three beats to 207. The quintuplets at MM207 produces a final tempo of MM1035 for the individual notes. Percussionists can test their hand speed by playing 32nd notes at tempo 130. That’s very close to the limits of playability, especially given that those notes cover the entirety of the seven-instrument set-up.

4.3 Bone Alphabet, m. 7.

Used with kind permission of C. F. Peters Corp., Henmar Press

But there are problems. My conversion of a triply nested polyrhythm into tempo tells you how fast to play, but it doesn’t help you coordinate the lower nested polyrhythm, a fascinating simultaneity of 8:7 and 7:8. And it runs contrary to my repeated insinuations that rhythm is always expressed by a tension between rhythm and an underlying pulse. And, with three nested polyrhythms, two of which increase the speed of the basic pulse, and a third (the sextuple marking) that slows the pulse, what reasonably should we consider as the basic pulse: 46? 207? 1035?

So, let’s try this as behavior. The sextuple subdivision – the one that slows the pulse – might feel grounded, pesante, restrained. Perhaps this is how you play the very first upper staff mfz note. By contrast, the fast quintuple subdivisions might feel mercurial, light, or capricious. Remember the panrhythmic spectrum. Rhythm is more than simple location in time; it’s behavior, transaction, and tension. A deft solution is always one in which rhythmic and metric parameters are supported in performance by involving texture, weight, and timbre.

Rephrasing Rhythm

At the 60 percent point of this essay, I wonder what I have offered to our conversation about rhythm. We’ve spoken of rhythm in transactional terms as the tension between an underlying pulse, cycle, or meter and its surface inflections. We touched on the panrhythmic spectrum, whereby rhythm is expressed in both temporal and non-temporal terms. We engaged in some problem solving with cyclical and non-cyclical polyrhythms, and discussed the temporal ebb and flow embedded within nested polyrhythms. But surely you could have figured these things out for yourself!

And, while this is important prefatory material, none of it touches the fundamental concept that rhythm is the musical product of the interface between mind, body, and sound. If that’s a definition so broad as to be useless, I take comfort from centuries of musical thinkers who have felt similarly, from Pythagoras, who expressed rhythm as universal and interdependent harmonies, to Karlheinz Stockhausen, whose seminal comments in Vier Kriterien der Elektronischen Musik (1973) speak to the relationship between rhythm and tone and annotate one of his most memorable musical moments: the dizzying seven-octave descent of a complex tone and its ultimate dissolution into rhythm in the extraordinary Kontakte (1959).

What is clear from Pythagoras, Stockhausen, and others is that rhythm is a physical state of musical material, standing in relation to tone, form, and texture just as ice is a physical state of water related to vapor and liquid water. For many mid-century modernists, searching not just for an answer but for the answer, treating rhythm as an inseparable component of an inclusive musical continuum was the brave new world. Their vision was a unified temporal field consisting of regular or irregular oscillations, perceived at the upper end of frequency as pitch and at the lower end as form. Rhythm falls in between. They proposed that serial technique, the great equalizer in the realm of pitch, could be applied to rhythm (as time-point technique in the language of East Coast Americans) and to form (as Moment Form in Stockhausen’s nomenclature.)

At its best, rhythmic serial technique provided the mechanics of permutation, in which variations were rooted in a single source code that created consistency. If all rhythms were related to the same set, then they were, by definition, related to each other. (If A=B and B=C, then A=C.) For this listener, this creates a powerful sense of organicity and inevitability in Olivier Messiaen’s modal approach to rhythm, but in lesser composers often produces an unappealing uniformity. As tantalizing as a grand unified theory of rhythm might seem, it often fails for the same reason it’s not possible to swim in steam. The fundamental material may be the same, but a change of state necessitates a change in usage.

So, we are left with a koan: rhythm consists of the same stuff as pitch and form, and comprises with them the pitch-form continuum. But boundaries of rhythm are delimited by perception, which firmly distinguishes it from pitch and form. Take a pitch and gradually slow the speed of the oscillations until we hear discrete impulses. We call this rhythm. And, getting slower yet, soon we can no longer tell whether or not the oscillations are equal. We call this form. On the fast end of the spectrum, our rhythmic sense is limited by the speed of cognition and on the slower end by the capacity of memory.

If rhythm is a property of human perception, then Pythagoras’s search for the cosmic origins of music was partly in vain. This also means that our delight at Gérard Grisey’s musical rendering of the regular rhythms of distant pulsars in Le noir de l’étoile (1990) celebrates not the universality of pulse in the cosmos but the miraculous ability of a human listener to track patterns in time. It turns out that the music of the spheres is really music of the ears.

Most rhythmic patterns fall on the slow side of the spectrum between pitch and form. Even the fastest moments in virtuosic piece like Bone Alphabet fall short of the range of audible tone. (A stroke speed of 1035, on the border of playability, is a meager 17.25 cycles per second, not usually audible to humans.) On the slow end of the spectrum, we can track a regular pulse up to a period of around eight seconds (in an utterly unscientific self-study). Here, questions of expectation and foreshadowing, repetition and reformulation emerge. This is the poetry of rhythm: Have we heard this before? When do we expect the next impulse? And, depending on whether this satisfies or derails expectations, what emotional or artistic impact results?

So, rhythm – or at least our perception of it – is ultimately a quality of memory. This works in the aesthetic as well as psycho-acoustic realm. The reason a reassigned downbeat in a Beethoven Scherzo (see practically every symphony!) surprises us lies in memory. We recognize and remember the downbeat and then are surprised when it is reassigned. (Beethoven’s method adds stress to a weak beat and when he is sure we perceive that as the downbeat, he shows us the true downbeat.) If we did not remember – and consequently develop an expectation – we could not be surprised.

The language of memory – corporeality, aurality, and the creation of and deviation from pattern – is also the language of rhythm. So “rephrasing” rhythm as an aspect of memory leads us gratefully away from the calculations of nested polyrhythms and the anxiety that even very fine musicians feel when faced with thorny non-cyclical polyrhythms. Even though I have made thousands of such calculations and still see the value of accurate solutions to complicated rhythmic problems, I argue that rephrasing rhythm as memory aligns us, as performers, with the mindset of listeners. It informs the late pieces of Morton Feldman, where forgetting is poignant, and governs the vocal/instrumental works of Roger Reynolds and Georges Aperghis. Each contains “complex rhythms,” but viewing rhythm as memory, each offers music on a human scale and with human rewards.

Rhythm as Memory

In the last decade of his life, Morton Feldman composed the longest and most evocative pieces of his career. Musicians are advised to know all three of the late trios for flutes, piano and percussion, but to this performer his For Philip Guston (1984) is the most beautiful. On the level of rhythm, it is also his most complex. Measures occupy equal physical space, but because the meters are different in each part, vertical alignment rarely equals simultaneity. Without significant calculations, the performers frequently do not know with what notes in the other parts they are to align.

4.4 Feldman, opening of For Philip Guston.

Morton Feldman For Philip Guston|für Flöte, Schlagzeug und Klavier.

© Copyright 1985 by Universal Edition (London) Ltd., London/UE17967, used with kind permission

In the age-old trade-off between reading and memory – think of the amount of music and literature committed to memory in the Ancient World due to the lack of written texts – if you cannot track points of alignment by reading the score, you must necessarily remember them. Thus, maintaining the correct rhythmic relationship with your partners in Guston means that tempo must be internalized (memorized as an inner click-track) and rhythm converted scrupulously to physicality (memorized as corporeal impulse). For the percussionist, the 332 note has a shorter stroke than the 38. We know that stroke length varies in response to dynamics and the surface quality of the instrument to be played. But in For Philip Guston, where the dynamics are fixed and low, and the instruments, aside from the chimes, do not require radically different stroke types, the corporeal memory of striking is a nearly exact map of rhythm.

In Michael Gordon’s XY (1998), also from the percussion repertory, corporeal memory works similarly in the service of rhythmic accuracy. If you are having trouble with your 6:5, try holding the stick playing quintuplets at a height that is 65 greater than the sextuplet stick. Then play with the same stroke speed. Other instrumentalists and vocalists will devise their own versions of these mechanical strategies.

On the level of form, we engage a different sort of rhythmic memory. Major points of arrival in tonal forms are so far apart that composers tag them with memorable characteristics – a burst of instrumental color or a memorable theme. However, in For Philip Guston, such markers are largely absent, and with a duration of nearly five hours, even the performers can forget where they are in the form. In Feldman, the poetry of remembering is contradictory: on the local level the execution of rhythm is a matter of remembering the speeds of notes, but on a global level the received sense of form is often the product of forgetting.

Note the “irrational meters” in Josh Levine’s Four Places, Many More Times (2011). By irrational, we refer to measures that contain subdivisions that do not add up exactly to the denominator of the time signature. Measure 134 in Example 4.5 is four quintuplets in length. Measure 138 is 23 of a half note. Ferneyhough and many others would name these measures as a 410 measure and 26 measure, respectively. (There are ten eighth-note quintuplets in a measure; four of them make a 410 measure. A quarter-note triplet is a “sixth note” – thus a 26 measure.) Conducting this difficult passage, I find that subdividing is less useful than simply remembering the speed of the duple, triple, and quintuple subdivisions and accessing them through corporeal memory.

4.5 Levine, from Four Places, Many More Times.

Used with kind permission of the composer, Josh Levine

Here and There (2019), a Roger Reynolds composition for percussion solo with text by Samuel Beckett, also relies on memory to stabilize rhythm. The work consists of three types of material: monologues for voice alone, arias where the performer speaks and plays simultaneously, and purely instrumental episodes. Reynolds doesn’t represent Beckett’s text in notated rhythms or meters – here the cadence of speaking provides the rhythmic template. Conversely, the instrumental passages are written in meter, and in these moments Reynolds ingeniously provides light cursive writing below rhythms. These words (unspoken in performance) are drawn from the Beckett text, and provide the rhythmic architecture of the instrumental passages. In order to remember the rhythms, just remember the text! (See Example 4.6.)

4.6 Reynolds, from Here and There.

Used with kind permission of C. F. Peters Corp., Henmar Press

An ideal performance of Here and There melds words with music in a fertile in-between world (zwischen immer und nie, “between always and never,” to use the words of novelist André Aciman). So rhythmic sense, like everything in the piece, is rooted in the memory of both words and sounds. In practice, I found I could quickly solidify the rhythm of the text, which then served as a platform not just for consistency of delivery but also as a rhythmic template for the rest of the piece. Text became the “memory palace” of Here and There: a place where the linkage between spoken word and instrumental sounds is stabilized by a shared temporal memory.

Here and There was a recent confirmation, but we intuit that many rhythmic problems are really memory problems. You can play 16th notes at MM60 without the need for a calculator because you have done so repeatedly. You remember not only how fast they are, but also the quality of tension between them and the underlying pulse. You might have difficulty with 13:5, because you have comparatively little experience with that rhythm. You have not yet memorized it. In its many guises, memory is an organizing principle: in an ethical life (see St. Thomas) or as the way we know what to buy at the store (see Hermann Ebbinghaus’s studies in the degradation of short term-memory). But memory achieves a kind of apex moment in the execution of rhythm in which corporeal, aural, mechanical, and even visual modes of memory collude.

Temporal tension directs the utility of both memory and rhythmic sense toward the future. Memory is not about the past any more than rhythm is about the present. Both require that we metabolize past and present time to give direction to the future. Whether that future is remembering a wedding toast or the impetus to lift a mallet (or bow) to play a note in a nested quintuplet is merely a matter of application. The goal in both is felicity of expression.

5 A Different Kind of Virtuosity

Russell Hartenberger

My first encounter with what has come to be known as minimalist music was at a rehearsal of Steve Reich’s iconic composition, Drumming, in his downtown New York City loft in the early spring of 1971. Reich was still composing the piece and was teaching it to the assembled musicians by rote. Two pianists, a woodwind player, and Reich – the only one of the four with percussion training – were playing on a line of eight stand-mounted bongo drums and striking them with wooden timbale sticks. Normally, drummers play on one pair of tightly tuned bongos with their hands and they hold the drums between their knees, so I was surprised to see four pairs of bongos, tuned to precise pitches, being played in this manner. However, my surprise turned to curiosity, and even a touch of bewilderment, when I watched and heard what they were playing on these drums.

Two players built up a rhythmic pattern one attack at a time, and once a full, coherent pattern was established, chaos ensued. Out of the confusion of seemingly wild and random stick attacks an intriguing composite pattern emerged. When this new rhythmic combination was established, a third player entered playing patterns that created melodic fragments on four pitched drums. Reich, who was the only person not playing bongos, began singing similar melodic phrases into a microphone placed near the drums using vocables that imitated the attack sound of sticks on bongos.

I was intrigued by this music, to say the least, and agreed to join the nascent Steve Reich and Musicians ensemble. At this point in my musical life I had completed two degrees in classical percussion performance and was enrolled in the Ph.D. program in World Music at Wesleyan University. My classical music training had not prepared me for the new concepts that I began to experience in the Reich ensemble, but I soon noticed similarities between Reich’s musical ideas and the African, Indian, and Indonesian music I was studying at Wesleyan. I realized that in order to perform this new musical style successfully, I would have to merge my facility in classical music performance with the skills that I was beginning to develop in my world music lessons; in fact, my definition of virtuosity evolved to include proficiency in these techniques. As a result, my view of virtuosic performance in pulse-based music grew to incorporate pulse, time feel/inner pulse, repetition, endurance, concentration, metric/perceptual ambiguity, rhythmic expressivity, and an enhanced sense of ensemble. In this chapter, I will describe how each of these components is essential in developing virtuosity in the performance of pulse-based music.

Pulse

An element that has become a distinctive structural component in Western classical music in the past fifty years is pulse – specifically, pulse independent of meter. Terry Riley’s In C (1964), the composition that jump-started the minimalist movement, is performed with a pianist pulsing on the instrument’s top two Cs throughout the piece. In C is made up of fifty-three modules of varying length through which performers move at their own pace. Robert Carl, in his book Terry Riley’s In C, explains that the “major stumbling block” in early rehearsals for the piece “was rhythm; as soon as the divergence of modules began, it became difficult to maintain a common tempo or metric reference point, and the work fell apart.” Steve Reich was a participant in these rehearsals and Carl quotes Reich as saying, “once a drummer always a drummer, I said we kind of need a drummer here, but since drums would be inappropriate, what about use the pianos; so Jeanie [Brechan] played some high Cs just to keep us together, and Terry said, ‘Let’s give it a try’ or something like that, and we tried it and voila everyone was together.” Carl concludes, “And so the Pulse was born.”1

Reich began including a pulse in his own compositions, and other composers followed suit. To delineate the pulse, Reich used various instruments: maracas in Four Organs (1970) and later in Tehillim (1981); a pulse clave in Music for Pieces of Wood (1973); combinations of pulse played on marimbas, xylophones, pianos, strings, winds, voices, and maracas in Music for 18 Musicians (1976); and even the handles of percussion mallets tapped together in portions of Sextet (1984) and The Desert Music (1984). Reich continues to employ a pulse in his compositions, both explicitly and implicitly, including the appropriately named Pulse written in 2015. Two other prominent composers who began writing pulse-based music in the 1960s and 1970s are Philip Glass, who used repeated arpeggiated figures and an additive rhythmic process in his early compositions to outline a steady pulse, and John Adams, who used pulsing, repetitive cell structures in his solo piano work Phrygian Gates (1977) and placed the pulse in the wood block in his orchestral work Short Ride on a Fast Machine (1986).

Classical musicians are not generally trained to play strict pulse-based music. Orchestral players, for example, develop an association with pulse from the historical development of the repertoire and learn to adjust regular attacks in music according to conductors’ motions, the amorphous attacks of entire sections of strings, the varied articulation of wind and brass players, and the more precise attacks of harp, piano, and percussion instruments. Although In C was written for a large chamber ensemble rather than a symphony orchestra, the performers face the same problem of creating a unified approach to attack placement and ensemble coherence. However, musicians who perform In C have the additional issue of achieving this amid a web of overlapping rhythmic patterns. The person who plays the upper two Cs on the piano keyboard throughout In C must keep the pulse as steady as possible while being swayed by other players as they wind their way through the work’s fifty-three modules. This responsibility is not unlike that of a jazz drummer and bass player working together to outline a rhythmic grid while soloists improvise freely but with a constant sense of the pulse. The pianist in the Riley work has no one else with whom to maintain a steady pulse and must rely on some internal mechanism to keep playing regularly for an hour or more.

Time Feel/Inner Pulse

At a rehearsal of Steve Reich’s Six Pianos (1973) for a 2007 concert in Toronto, pianist Gregory Oh, who was playing the piece for the first time, remarked to percussionist Bob Becker, who had played it many times with the Reich ensemble, “How can you play with such steadiness but appear so motionless?” Becker replied, “Because I have a sense of inner pulse.” Inner pulse is something we can all develop and is an ability that is essential in the performance of pulse-based music. Becker was trained as a classical musician and like most percussionists has played forms of pulse-based music in various non-orchestral formats. His sense of inner pulse was heightened, however, by his experiences with West African drumming and North Indian tabla drumming.

West African drumming ensemble music is structured around a cyclic rhythmic pattern played on an iron bell. The twelve-unit cycle of the most commonly used bell patterns can be felt with a steady pulse of 1, 2, 3, 4, 6, or 12, although the drummers, singers, and dancers generally perform with a pulse sense of four within the twelve-unit cycle in order to stay together and have a unified feel. Ghanaian master drummer Abraham Adzenyah, the West African drum teacher at Wesleyan with whom both Becker and I studied, used the terms hidden beat and invisible conductor to describe the performance subtleties that are required to play this music correctly. The hidden beat is the inner pulse that is felt communally by all the performers, and the invisible conductor (one of my favorite descriptive phrases) is the time feel sense that is generated in the music.

Tabla drummers of North India, as well as their counterparts in South India, use an additive, or sometimes subtractive, system of rhythmic construction in creating complex mathematical rhythmic patterns. These rhythms are played within a tala cycle that establishes a structural framework for the music. The inner pulse for these drummers is an inviolable sense of the tala cycle, and their rhythms are placed within and across the cycles. Philip Glass first experienced Indian music when he worked with Ravi Shankar on a soundtrack for the film Chappaqua in Paris in the mid-1960s; he later visited India and studied with tabla master Alla Rakha. This introduction to a new way to view rhythm had a significant impact on Glass’s early musical output. Although he did not adopt the concept of tala, Glass did incorporate the idea of cyclic structure in his music. His systematic use of additive rhythms is most clearly seen in 1+1 (1968) and Two Pages (1969). In the score of 1+1, Glass wrote two rhythmic units: (a) two sixteenth notes and an eighth note, and (b) an eighth note. In his instructions in the score, he stated that “1+1 is realized by combining the above two units in continuous, regular arithmetic progressions.” In Two Pages, Glass wrote two, three, and four eighth-note groupings with varying numbers of repeats. Wes York, in an analysis of Two Pages in his essay “Form and Process” (1981), writes, “Ultimately, the structure of Two Pages can be understood as first, the exposition and juxtaposition of two sets of opposing processes, and then, the coordination of all shapes which both emerge from, and reflect back on, those processes. … With respect to the various processes at work one finds two types which are responsible for creating motion and change within the composition. One of these is a subtractive process; the other is additive.”2

In Les Moutons de Panurge (1969), composer Frederic Rzewski uses the additive/subtractive process to the extreme. The piece begins with any number of musicians playing the notated score in a sequence of 1, 1-2, 1-2-3, 1-2-3-4, and so on until reaching note 65. They repeat the complete cycle of 65 notes, then begin subtracting notes from the beginning: 2 through 65, 3 through 65, 4 through 65, and so on until reaching 65 alone. Even though Rzewski allows for errors on the part of the musicians in sequencing the notes, he is clear that a pulse must be established and maintained by all players.

In the works of Glass and Rzewski described above, there is no explicit pulse played by any instrument and the success of the performance depends on a sense of pulse felt by the performers. While expertise in non-Western music is not an imperative for Western musicians attempting to play these compositions, a familiarity with the additive process provides a level of comfort in keeping a steady pulse while navigating the requirements of the scores. As with Becker in his performance of Six Pianos, experience with non-Western music and the sense of inner pulse in the midst of a quilt-work of pulse options helps Western-trained classical musicians play pulse-based music with confidence.

Jazz is also primarily a pulse-based music, and great jazz musicians have this same sense and control of inner pulse. Drummer Kenny Clarke was influential in creating the bebop form of jazz. He changed the style of jazz drumming by moving the basic cymbal pattern from the hi-hat cymbal to the ride cymbal, thus freeing his left hand to play fills on the snare drum. He then played syncopated accents on the bass drum instead of delineating each beat of the meter, thereby creating a flow to the music. Prior to Clarke’s innovations, jazz drummers played the bass drum on every beat so the other musicians could hear where the time was in each measure. Clarke’s random bass drum bombs, as they were called, and his ride cymbal rhythm, obscured the sense of barline and created a feeling of longer phrases stretching over several measures at a time. Clarke was able to create this flow by keeping a sense of time in his head when he played. In an interview with Helen Oakley Dance in 1977, Clarke remarked that musicians in his band complained and said, “Kenny keeps breakin’ up the time. Why doesn’t he keep four beats on the bass drum?” Clarke replied, “If you’re playing, the tempo should be in your head. Don’t depend on me. Depend on yourself. Because if you’re playing music, the tempo you’re playing in is in your head.”3

Steve Reich has often said he admired Clarke’s time feel and that he tries to achieve this same feel in his own music. In a conversation with me in 2003, Reich explained, “That feeling of time and time sense is getting it ‘right’ which might show up on an oscilloscope as ‘slightly wrong.’ A lot of very good players lack that magic because they are very concentrated on being right … some people have that quality of magic in their playing and others just don’t – but you know it when you hear it.”

Time feel is an essential ingredient in the performance and appreciation of jazz as well as in many kinds of pulse-based music. A performer’s ability to control attack placement in relation to a steady pulse becomes an expressive element in the music. When an ensemble has a unified time feel, it can create the lilting sense of “magic time” that Reich admires.

Repetition

In her book titled On Repeat: How Music Plays the Mind, Elizabeth Hellmuth Margulis writes, “In a famous essay, [Gilles] Deleuze references [David] Hume to the effect that ‘repetition changes nothing in the object repeated, but does change something in the mind which contemplates it.’ Deleuze identifies repetition as a phenomenon well-suited to exposing the elements that the mechanisms of perception bring to an experience over and above the elements that literally exist in the world.” Margulis continues, “Since two iterations are never precisely repetitions in their deepest essence – they’re composed of different atoms or occur at different time points – it is perception that abstracts both a relationship of shared identity and a relationship of difference. At a minimum, a repeated element will sound different from its initial presentation by virtue of coming later and having been heard before.”4

In the music of Adams, Glass, Reich, Riley, and Rzewski, repeating the same pattern for a long period of time can be perceptually interesting for the player, but it is also fraught with challenges. For the performer, nothing should change in the pattern that is to be repeated because it is the musician’s job to present a soundscape that allows something to change in the mind of the listener who contemplates the music. The performer is the facilitator, and as such must find a way to control muscle memory in order to articulate each repetition similarly, yet keep mental control.

In West African drumming, musicians who play the drum and bell parts that form the accompaniment to the master drum, singers, and dancers repeat their patterns for extremely long stretches of time. At a harvest festival celebration in a Ga village near Accra during the summer I spent in Ghana in 1971, I witnessed two bell players playing the same interlocking pattern throughout the night (although the two original players were relieved by others as the hours passed). But even under normal performance circumstances, a Ghanaian drummer might repeat the same pattern for an hour or more.

Abraham Adzenyah instructs his students to move their bodies in order to transmit the right feeling while playing repeated patterns on the drums. He also teaches them to hear checkpoints in the music so they know how to recover if their mental control falters and they stray from their patterns. These same instructions apply to the performance of repetitive, pulse-based Western music. The performers must first repeat the assigned pattern until muscle memory begins to take over. Body movements, in this case, cannot be excessive, but subtle motions can be utilized to facilitate muscle memory. Once the pattern is secure, the players can expand their sonic perception to hear connections between the patterns they are playing and other patterns that they hear; in particular, the players can locate points of coincidence in the composite pattern relationships. These checkpoints add security by indicating where to get back on track if a mistake is made.

Early minimalist music moves from event to event, so performers are relieved of the necessity of counting numbers of repeats; they have to know the overall structure of the piece and then pay attention as the process unfolds. This frees their minds from the restriction of keeping track of the number of repeats and allows them to become immersed in the musical details of the performance style. The performers learn to play their patterns as evenly as possible to enable listeners to hear a seemingly unchanging stasis in different ways. In other words, the listeners’ minds are allowed, and in fact encouraged, to hear something change.

Endurance

This repetition of the same pattern for several minutes creates an endurance problem for the performers. String players are accustomed to playing almost continuously throughout an hour-long symphony, chamber piece, or solo work. However, in this conventional kind of music they vary their musical patterns, dynamics, articulation, and other musical expressions. Shem Guibbory, violinist in the early rehearsals and performances of Music for 18 Musicians, said:

The violin repertoire is nothing if not virtuosic. It’s virtuosic in terms of quantity of notes, the variety of expression, vastness of registers - full four octaves - articulation with the bow, double stops, scores of issues. But with [Reich’s] music … the requirements were for repetition. …To make my violin part happen within the group … and with the power that you guys in the percussion could deliver, I had to develop an incredible efficiency of motion and the ability to let my spirit sustain it throughout the entire work.5

Audrey Wright, a violinist commenting on her preparation to play Music for 18 Musicians for the first time, said she did not realize “how physically and psychologically demanding it would be, nor how rewarding. With my violinist mentality, I originally scanned the score thinking, ‘OK, this is long. But playing some sustained fourths here, a few pages of repeated eighth notes there, and catching a bass clarinet cue once in a while is a piece of cake compared to a Mahler symphony.’ … As it turns out, it is not a piece of cake.” Wright continued:

One of the greatest challenges as a violinist playing Music for 18 Musicians is the physical endurance required of playing repeated eighth notes, ostinato figures, and sustained double-stops. The repeated eighth notes, which can last for up to roughly ten minutes, are like a marathon for the bow arm. These eighth notes are written as double-stops rather than single notes, so it is not only a challenge to produce the same clear tone on down- and up-bows when playing up to tempo, but to keep that tone consistent throughout the dynamic crescendos and decrescendos which occur over and over like waves. In order to work up bow stamina and consistency for this, I practiced scales and double-stop scales with this stroke for the week leading up to the first rehearsal with an overall goal of continually releasing tension from my right shoulder, elbow, and hand. In this way, I was able to build up the strength I’d need in my bow arm to play this music without muscling through and forcing the sound.6

Other string players, wind players, and vocalists have commented on the need to prepare for the physical demands of Reich’s music. Even the seemingly simple maraca parts in Reich’s compositions create endurance issues. In Four Organs, Music for 18 Musicians, and Tehillim, the maraca player is the pulse-keeper in the ensemble for long periods of time and must play strongly with the maracas placed high in the air in order to produce the desired volume. Gary Schall, who was the regular maraca player in the Reich ensemble, said that prior to performances of these pieces he practiced maracas thirty minutes a day for a month in order to develop stamina.

In playing Reich’s music and West African drumming, I use something I call energy-shifting, a technique I developed from practicing yoga postures and doing breathing exercises. I find that it is necessary to have a certain amount of tension while playing in order to produce a strong, centered sound, especially when I have to generate this sound with consistency over long periods of time. In order to prevent tension buildup, I keep up the necessary energy by shifting the tension around in my body. For example, if I feel tension mounting in my wrists, I concentrate on releasing the tightness by focusing my breathing in that area and allowing my arms or another part of my body to absorb the tension. By practicing this technique, I developed the ability to energy-shift quickly and easily.

Musicians often make small changes in technique to help their endurance. Stephanie Chua, a pianist in a 2018 performance of Reich’s Six Pianos in Toronto, told me that to help with relaxation she uses fingerings that force her to change positions in the middle of a pattern. Sometimes it is a very small physical change like Stephanie made that enables performers to endure long periods of repetition of the same pattern and thus heighten their ability to concentrate on musicality.

Concentration

Repeating a pattern for long periods of time requires physical adaptation, but the hypnotic state it induces in listeners creates a concentration problem for performers. I often find myself succumbing to a trance-like state while playing In C, Music for 18 Musicians, Drumming, and other pulse-based compositions. In order to enjoy this sensation and still maintain my sense of equilibrium, I use techniques I learned while playing in West African drum ensembles. First, I determine the placement of the general “one” of the ensemble or the beginning of the most audible phrase, and I place my part accordingly. I then relate the pattern I am playing to other patterns that are easily discernible. Finally, I find checkpoints in the music that are easy to detect so I can return to my proper rhythmic alignment if I happen to stray off course. When these elements are in place, I feel secure enough to allow my mind to enjoy the mesmerizing aspects of the music. I rely on my muscle memory to perform the many repetitions of my pattern, but I periodically check in mentally to make sure I am playing correctly.

Commenting on his concerns in performing Reich’s Six Pianos, Gregory Oh said, “The challenge for the performer is managing two states of being: one must be conscious of the notes, the precision of the notes and the time and pacing and fitting in, while simultaneously suppressing any thoughts of control or ego. Rather than ‘Just Do It,’ it’s more like ‘Just Let It Happen.’ The biggest danger in this piece is getting in one’s own way.”7

To keep from “getting in one’s own way,” some performers may find that practices such as yoga, tai chi, or meditation are helpful in developing the ability to concentrate. These disciplines train their practitioners to block out unnecessary thoughts and focus on breathing and staying grounded. However, it is important for the performers to maintain awareness of the form of the compositions. It is incumbent on the musicians to hear and understand the progression of events in order to move from one section to the next in proper sequence. All these factors are important in maintaining concentration when performing music that frequently shifts the player’s sense of perception.

Metric/Perceptual Ambiguity

In his performing directions that accompany the score to In C, Riley states, “The group should aim to merge into a unison at least once or twice during the performance. At the same time, if the players seem to be consistently too much in the same alignment of a pattern, they should try shifting their alignment by an eighth note or quarter note with what’s going on in the rest of the ensemble.” This shift of eighth-note alignment occurs early in the piece in modules 4 and 5. Each of the modules consists of three ascending eighth notes, E, F, G, but module 4 is an eighth rest followed by the three eighth notes and module 5 begins with the three eighth notes followed by an eighth rest. In his instructions, Riley further encourages this ambiguity when he says, “Each pattern can be played in unison or canonically in any alignment with itself or with its neighboring patterns. One of the joys of IN C is the interaction of the players in polyrhythmic combinations that spontaneously arise between patterns. Some quite fantastic shapes will arise and disintegrate as the group moves through the piece when it is properly played.” Riley’s intentional pulse displacement allows the listener (and performer) to hear the same composite patterns differently.

A rhythm by itself can be intriguing, but when it is heard against a pulse or another rhythm it can become metrically ambiguous. The metric implications of standard notation in Western music, such as strong impulse at the beginning of a measure, are replaced by options for the listener. Rhythmic groupings and pulse placement are created by the listener, allowing the same music to be heard differently as the listener’s mind changes its perceptual vantage point. David Locke, author of the chapter on rhythm in West African music in this volume, coined the term Gestalt flip to describe the sensation of hearing a rhythmic pattern one way in relation to a pulse and then changing perception of the same pattern to hear it a new way in relation to a different pulse.

Reich’s use of canons played on like instruments creating interlocking composite patterns makes it problematic for the musicians to tell who is playing the parts that they hear. This is especially true in Drumming when players facing each other across bongo drums see their sticks moving in conjunction with their playing partner but can have trouble determining whose sticks are making which sounds. Leslie Tilley, in her chapter on Balinese gamelan in this book, describes a similar effect in Balinese gamelan playing when she writes, “Many non-Balinese musicians are first drawn to Balinese music … because of a distinctly Balinese brand of interlocking, where a melody or rhythm is seamlessly shared between two or more performers such that the resultant composite is a single, smooth strand of music. One often cannot discern which musician has performed which note in an interlocking passage; the perceptual effect is of a group of musicians each playing the entire passage in perfect synchrony, much faster than humanly possible.”8

In the amadinda xylophone music of Uganda, musicians play separate patterns on either side of a twelve-note xylophone, alternating their attacks. To achieve the virtuosity required, both players feel their pulses as downbeats. In Music for 18 Musicians, an on-beat and off-beat pulse is played for most of the composition by pianos, marimbas, and xylophones. Philip Bush, who played in the Reich ensemble for many performances of Music for 18 Musicians, was the pianist who played the off-beat pulse. He told me that in order to play the pulse consistently, he had to (Gestalt) flip his sense of pulse so that he felt that his off-beat pulse was on the beat. Consequently, Bush heard the entire composition in a parallel universe from most other people.

In Indian music, rhythmic sequences are overlaid across tala cycles and inserted within tala cycles. The juxtaposition of the irregularity of these rhythms against the regularity of a cycle creates metric ambiguity. Glass uses the process of additive rhythms to create an unexpected conflict with the usual regularity of expectation in Western music. The metric intrigue in all these genres creates perceptual ambiguity for the listener that is first unsettling, but then becomes aesthetically satisfying.

Rhythmic Expressivity

In his book Repeating Ourselves: American Minimal Music as Cultural Practice, Robert Fink writes that in the early days of minimalism, critics such as Donal Henahan and Harold Schonberg (in the New York Times), Samuel Lipman, and Christopher Lasch described the music as a kind of social pathology.9 Pwyll ap Siôn, in his article “Moving Forward, Looking Back,” notes that “German critics also seized upon other aspects [of Reich’s music], including what they identified as the music’s mimetic representation of industrial production and machine-like processes. Worse still, some went further and accused Reich of creating a form of musical fascism that ‘suppressed social criticism and manipulated the listener’s emotions.’”10

One of the implications of this criticism was that minimalist composers were dictating every aspect of the music and that the performers had no room for expression or creativity. The reality within the composer-based ensembles of Riley, Glass, Reich, and others was that the performers not only had input into the composition of the music, but they also established the expressiveness and feel of this pulse-based music.

Riley spent much of his early career as a pianist playing ragtime, blues, and jazz. He also visited Morocco where he heard traditional maqamat music. All these types of music use improvisation and Riley’s music allows for the sense of improvisation that characterizes these various musical styles. In six paragraphs of instructions that accompany In C, Riley outlines ways the performers can improvise their way through the composition.

Glass’s early ensemble was made up primarily of electric keyboards, woodwinds, strings, and voices. The nature of the attack of these instruments and voices gives Glass’s music a less precise rhythmic feel, but still imbues the music with a time feel that is different from that of more traditional classical music.

Reich taught Drumming to the percussionists in his ensemble by rote, demonstrating each part until the players learned the patterns and where the patterns fit into the ensemble. Once the percussionists, all of whom were classically trained, learned the music and had an idea of Reich’s stylistic approach, they developed a rhythmic nuance, time feel, and sense of centered sound that became the identifying characteristic of Reich’s music. This rhythmic expression had its basis in classical training but added elements from other pulse-based music including music of Africa, India, Indonesia, and jazz. As Reich’s compositional output increased to include strings, winds, and additional vocalists, the other musicians imitated the percussionists’ sound and feel. A detailed description of the early rehearsals and development of the performance style for Drumming and other early Reich works can be found in my book Performance Practice in the Music of Steve Reich.11

Attack placement is an essential element in giving feeling to any music, but it is of critical importance in playing rhythms expressively in pulse-based music. Musicians often talk about playing behind the beat, in the middle of the beat, or ahead of the beat, and some musicians instinctively place their attacks in one of these ways, giving their performance a certain style or character. Other musicians are able to alter their attack placements according to the musical expression they wish to convey. Regardless of which of these ways the interpretation of attack is deployed by musicians who are performing pulse-based music, the aim should be to have a unified sense of feel from the entire ensemble.

Sense of Ensemble

When the minimalist composers’ ensembles were created, they rehearsed compositions for long periods of time before the premieres. Rehearsals of Drumming, for example, took place weekly for nearly a year prior to the first performances. The lengthy rehearsal periods gave the musicians an opportunity to develop the techniques that resulted in a distinctive sound with a unified time feel in the music. The structure of the music and the inner communication that was required to perform it helped create a bonding experience in the ensembles and create a sense of purpose and community. Steven Schick, in his book The Percussionist’s Art, says:

Beyond pointing to a model of cultural coexistence, Drumming also demonstrates a new way of musical interaction within a chamber ensemble. This is simple to describe: no one leads Drumming. In fact, nowhere else in the entire chamber music repertoire for percussion is there an example of such a mutually dependent and communally reinforced musical structure. Stewardship of the piece is a group concern, progressing as one player after another completes his or her specific task(s) from building up to phasing to playing resultant patterns. … The great parable of this music – that the health and vitality of the whole is tied to the health and vitality of the smallest of its parts – requires the presence of human beings who need each other and who make space for each other.12

Robert Carl describes the sense of ensemble in Terry Riley’s composition this way:

In C provides a delicate balance between the individual and the group. … It demands of its players a high degree of individual responsibility. No matter how many performers participate, they must listen carefully to one another for the performance to have any chance of success. … The music is the result of a group decision, but each entity retains its separate character and autonomy. … Indeed, one can look at the piece as an exercise in anarchy, though of the most benign and constructive form. … But In C is also very much a product of community. That act of listening implies that all the players devote themselves to the greater good of the piece, that they not only listen to their interaction with immediate neighbors but also hear the influence of their actions on the total work. One must listen out to the edges of the piece as one plays and adjust decision-making to the amorphous but real will of the collective.13

The ensembles that were created by Riley, Reich, and Glass in the 1960s and 1970s brought a new approach to the performance of contemporary music and resulted in tightly knit groups that were well rehearsed. The composers and musicians combined to develop a special sound and feel to the compositions that allowed the musicians to have a sense of ownership as important contributors to the final product. The sense of community that Schick and Carl mention helped the performers and composers establish the techniques that formed the basis of the new virtuosity.

Conclusion

The techniques I have described are not, individually, exclusive to the performance of pulse-based music, but collectively they form a performance practice that requires a different kind of virtuosity than has been a part of the traditional training of a Western classical musician. Now, more than fifty years since the premiere of In C, second and third generations of musicians are assimilating these techniques into their musical development, and gradually this different kind of virtuosity will become standard practice for musicians who have an interest in rhythm and a desire to play pulse-based music.

6 Conducting Rhythm

David Robertson

The great conducting teacher Hans Swarowsky told his students at Vienna’s Academy of Music and Performing Arts that a conductor has only three jobs: start the piece, make any changes within it, and finish it. Indeed, certain elements of time keeping would seem to render that task in conducting rather simple.

At its most basic level, a conductor’s gestures convey, through a repeated, patterned series of pulses with the hands and arms, a visual counterpart to the rhythmic structure of the score that the musicians hear as they play. While this information can be helpful to those who are playing, it is often crucial to those in a group who may be looking at bars of rest, as it helps them confirm the passage of musical time, including when they may be called upon to rejoin the musical discourse.

Here is the question anyone who sees a conductor is likely to ask: How do the gestures of a conductor help musicians stay in place? In the vast majority of cases, musicians usually have in front of their eyes only their own lines to play in a piece for ensemble or orchestra. Thus, at the first reading of a piece, players start creating in their minds a virtual score that resides in their memory. This virtual score is constructed from what they hear in relation to what they play and to a conductor’s clear, consistent indication of the beats in the piece. This process, usually done in rehearsal, helps the players assess securely whether the other parts they hear are before, after, or coincide with what they play. For the conductor, determining what will be most important in gestural terms has to do with allowing the players to make the best, most accurate virtual score in their minds. The better musicians understand what their colleagues are doing, the more flexibility and interpretative liberty they obtain.

In scores before the early 1800s, the rhythmic organization for most Western concert music did not really require such a time beater; performances could be led from the first violin chair or from the keyboard. While some notable exceptions existed, Beethoven’s Fifth Symphony being a good example, to be played properly, most compositions had few major changes that would need a conductor.

However, early into the new century composers such as Berlioz, Schumann, and Mendelssohn were writing works that enjoyed the possibilities in expanding the size of larger instrumental groups, combined with a less strictly organized rhythmic pulse. As a well-known example, performing the opening measures of Berlioz’s Symphonie Fantastique without a conductor’s help in the shaping of how all the players interpret time’s flow would be a major challenge. It is certainly possible, but a good deal of rehearsal will be consumed in an activity that is vastly simpler with a good conductor. And this is before any consideration of the musicians being in an acoustic where transparent hearing is not possible.

Such demanding scores led to the rise of conducting as a profession during the nineteenth century. Romantic composers’ interests in explorations of varying moods, often within a single piece of music, meant that a conductor could be called upon to negotiate quite a number of changes in a short period of time. As an example, the second movement of Rimsky-Korsakov’s Scheherazade begins with a small cadenza for solo violin and harp, followed by a recitative-like melody for bassoon and divided bass section. Rimsky-Korsakov continues to develop this material while adding new ideas expressed in different meters and speeds.

Before the nineteenth century, rubato was a way of using rhythm where two parts could float in freedom as long as they came back together at some point. The classic rule of thumb was that, for a pianist, the right hand steals time (forward or backward) without the left hand being aware of anything amiss. Starting with the Romantic composers, rubato might be viewed as a structural device, that is to say, within-phrase tempo variation could be applied as a way of changing or influencing the arc of the piece on a larger scale. This kind of rubato might not be connected within a bar or bars, but rather used to delineate one part in a work that has great tempo freedom in contrast with other parts that are more strict. The second movement of Tchaikovsky’s Fifth Symphony is a good example, as the composer sets up a first section where melodic material and even the opening harmony respond to the tempo of Andante Cantabile, con alcuna licenza by being flexible even before the many printed indications of animando, ritenuto, sostenuto, animato, and the like. This is contrasted by a second section where a tempo indication of Moderato con anima and an unwavering series of regular syncopations in the accompaniment make it clear that rubato is to be put aside. Indeed, this section ushers back in the motto of the work, itself of a rhythmic quality unrelenting in its steadiness. In this way, for Tchaikovsky, the desired tempo indications that include rubato actually serve a formal as well as an expressive purpose.

Nineteenth-century opera was a major driver in the importance of the conductor’s role. The addition of ever larger groups of soloists, choruses, and dancers created a compelling need for the conductor to be able to present the passage of time and its changes visually to the performers. The dramatic narrative began to evolve away from opera as a collection of scenes that might contain widely different expressive material presented with quite wide-ranging musical means. While some earlier composers such as Mozart had been expanding numbers in their operas for expressive purposes (the finales of acts two and four of The Marriage of Figaro, or the finale of Act One of The Magic Flute come to mind), the division between separate numbers as a way to articulate a story becomes much less rigid. Opera composition transforms from a set of scenes into a flowing narrative with wide ranging musical means to express the many moods and situations succeeding each other without interruption. The wink in Swarowsky’s eye is evident when one considers just how challenging “making any changes” could be.

At the dawn of the twentieth century, the influence of rhythm’s importance in Western art music was being felt through an increasing number of challenging scores. Composers were using syncopation, complex meter changes, and polyrhythms in their experiments with freeing up musical time. Dance choreography was also pushing against the traditional boundaries set by traditional ballet. While Stravinsky’s Rite of Spring is held up as the poster child for this newer, complex use of rhythm, works such as Debussy’s Jeux, Schoenberg’s Five Pieces for Orchestra, Webern’s Six Pieces for Orchestra, Ravel’s Daphnis and Chloe, Ives’s Three Places in New England, and Grainger’s The Warriors all demand a very organized mind and clear hand to be played well.

After the First World War, the number and variety of rhythmic explorations continued to increase. The influence and incorporation of jazz elements, ethnomusicological research into both European and non-European folk musics, as well as the type of experimental ideas put forward in Henry Cowell’s book New Musical Resources, which imagines a future of superimposed speeds and time signatures, led to a perplexing set of challenges that might face a conductor when encountering a new score.

Following 1945, with the continued experimental ideas of the composers who came of age during this period such as Boulez, Maderna, Stockhausen, Berio, Cage, Nono, Carter, Xenakis, and Birtwistle, among many others, conductors might be forgiven for approaching a new score with a certain amount of trepidation. The technical demands of works by these composers might require a unique rhythmic approach for every new opus. What is one to do, for example, at the end of Carter’s Double Concerto where the two different groups play in 44 and 98 simultaneously? Personally, in preparing this passage, I worked for several months to beat four equal beats with my left hand while hearing all the music in the 98 group in my head, followed by conducting three equal beats with my right hand while hearing all the 44 musical material in my head. The trick was then to put these two beat patterns together simultaneously and flexibly (no mean feat in a bilaterally symmetrical body!) while imagining the whole in one’s inner ear. As a small footnote, it also meant making a copy of all the pages of the score and putting them on one enormous sheet of music so that I didn’t have to turn pages!

With the rhythmic choices available to composers today, it is often helpful for the conductor to have a few rules of thumb to help make decisions when encountering a new score for the first time. In a practical sense, the rhythm of most scores will be roughly divided, from the conductor’s point of view, into two categories: (1) the use of a pulse with a certain degree of regularity that is subdivided into various groupings, as in Modulations by Gérard Grisey or (2) a small duration used as the basis for combinations of different bar lengths, as seen throughout Tehillim by Steve Reich. The reality, however, is quite often a combination of these two approaches in addition to any tempo flexibility that may be required. Hence a piece might require a conductor to beat what appear to be simple time signatures, but within those bars some players are playing seven notes in the time of three while others may have a syncopated rhythm of five against four. This is the sort of approach one might find in a score by György Ligeti. In another score, there could be combinations of successive bars with alternating lengths of 716, 38, 24, 1116, and so on, as seen in works by Australian composer Brett Dean.

A number of composers have developed quite sophisticated uses of tempo changes in close succession and of superimposed tempi to give the effect of music that moves at several speeds at once. Elliott Carter’s metric modulation is one example in which an underlying subdivision of, say, five sixteenth notes on one side of a barline equaling four triplet eighth notes on the other, keeps the change of speed in a mathematical relationship. Karlheinz Stockhausen developed a scale of tempo that would be parallel to a scale of tones by dividing the speeds of 60 and 120 beats per minute into twelve discrete gradations, equivalent to the twelve semitones within an octave. This allows him in his composition Inori, for example, to have a tempo that switches up or down with an immediacy equivalent to the change of pitch. This is why, in that score, the three beats in a bar for the conductor may require three quite different speeds. It also accounts for the raised eyebrows of musicians when encountering 63.5 as a metronome marking.

Thomas Adès uses a system of metric time signatures to denote the different speeds of notes and bars in his scores that at first glance may appear confusing. In traditional notation we use a fraction such as 54 to indicate that the length of that bar is five quarter notes long. Each one of those quarter notes could be combined or subdivided to make for larger or smaller units of half notes, eighths, or sixteenths within the bar. But those units also could be parsed differently. If we combine the last two quarters into a half note and then subdivide that into triplets, we have a bar of three normal quarters followed by three triplet quarters. Adès used the value and speed of the last three notes to write them out as “sixth” notes (notes whose speed is a relation of six in the time of four) and our so our familiar 54 could be written out as 34+36. This may require some explanation in rehearsal when players are confronted with a series of times signatures of 16, 24, 38, 24, 26, 44 as found in the third movement of Adès’s Asyla. It is, however, simply a very practical way to notate the idea of metric modulation on a smaller time scale and becomes intuitive quickly.

Sometimes the requirement is to stay exactly synchronized with an electronic component, as in Désintégrations by Tristan Murail, or a visual element, as in Three Tales by Steve Reich. This degree of precision and alignment with other clock sources necessitates the use of a click track in which a series of audible clicks is fed to the conductor via headphones. Indeed, it is not unusual in the realm of soundtrack recordings for film that an entire group of musicians has a click given to them directly on headphones while they play. In this studio setting, it might appear that the conductor no longer needs to fulfill the duties of a timekeeper, but it glosses over an important fact. Conductors work with their own bodies to create a personal language in which they are able to express the quality and feeling of the rhythms to be played while they are also providing the regular pattern of the time signature and speed of the tempo. This is, thankfully, not something that the click will ever accomplish on its own and is an essential part of unifying any performance of music, with or without a click provided.

Thus, even when it seems like one might simply replace the timekeeper with a machine of some sort, the conductor’s job returns, in a very real sense, back to Professor Swarowsky’s adage and demonstrates how, ironically in a purely aural medium, a conductor visually embodies time.

Footnotes

3 Visualizing the Rhythms of Performance

4 A Percussionist Understands Rhythm in Five Essays of Exactly 1,000 Words Each Not Including Titles and Subtitles

5 A Different Kind of Virtuosity

6 Conducting Rhythm

Figure 0

4.4 Feldman, opening of For Philip Guston.Morton Feldman For Philip Guston|für Flöte, Schlagzeug und Klavier.

© Copyright 1985 by Universal Edition (London) Ltd., London/UE17967, used with kind permission
Figure 1

4.5 Levine, from Four Places, Many More Times.

Used with kind permission of the composer, Josh Levine
Figure 2

4.6 Reynolds, from Here and There.

Used with kind permission of C. F. Peters Corp., Henmar Press

Save book to Kindle

To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×