1. Introduction
Human animals – and no other – build fires and wheels, diagnose each other's illnesses, communicate using symbols, navigate with maps, risk their lives for ideals, collaborate with each other, explain the world in terms of hypothetical causes, punish strangers for breaking rules, imagine impossible scenarios, and teach each other how to do all of the above. At first blush, it might appear obvious that human minds are qualitatively different from those of every other animal on the planet. Ever since Darwin, however, the dominant tendency in comparative cognitive psychology has been to emphasize the continuity between human and nonhuman minds and to downplay the differences as “one of degree and not of kind” (Darwin Reference Darwin1871). Particularly in the last quarter century, many prominent comparative researchers have claimed that the traditional hallmarks of human cognition – for example, complex tool use, grammatically structured language, causal-logical reasoning, mental state attribution, metacognition, analogical inferences, mental time travel, culture, and so on – are not nearly as unique as we once thought (see, e.g., Bekoff et al. Reference Bekoff, Allen and Burghardt2002; Call Reference Call, Hurley and Nudds2006; Clayton et al. Reference Clayton, Bussey and Dickinson2003; de Waal & Tyack Reference de Waal and Tyack2003; Matsuzawa Reference Matsuzawa2001; Pepperberg Reference Pepperberg2002; Rendell & Whitehead Reference Rendell and Whitehead2001; Savage-Rumbaugh et al. Reference Savage-Rumbaugh, Shanker and Taylor1998; Smith et al. Reference Smith, Shields and Washburn2003; Tomasello et al. Reference Tomasello, Call and Hare2003a). Pepperberg (Reference Pepperberg, Hurley and Nudds2005, p. 469) aptly sums up the comparative consensus as follows: “for over 35 years, researchers have been demonstrating through tests both in the field and in the laboratory that the capacities of nonhuman animals to solve complex problems form a continuum with those of humans.”
Of course, many scholars continue to claim that there is something qualitatively different about at least some human faculties, particularly those associated with language and a representational theory of mind (see, e.g., Bermudez Reference Bermúdez2003; Carruthers Reference Carruthers2002; Donald Reference Donald2001; Mithen Reference Mithen1996; Premack Reference Premack2007; Suddendorf & Corballis Reference Suddendorf and Corballis2007a). Nearly everyone agrees that there is something uniquely human about our ability to represent and reason about our own and others' mental states (e.g., Tomasello et al. Reference Tomasello, Carpenter, Call, Behne and Moll2005). And most linguists and psycho-linguists argue that there is a fundamental discontinuity between human and nonhuman forms of communication (e.g., Chomsky Reference Chomsky1980; Jackendoff Reference Jackendoff2002; Pinker Reference Pinker1994). But the trend among comparative researchers is to construe the uniquely human aspect of these faculties in increasingly narrow terms. Hauser et al. (Reference Hauser, Chomsky and Fitch2002a), for example, continue to claim that grammatically structured languages are unique to the human species, but suggest that the only component of the human language faculty that is, in fact, uniquely human is the computational mechanism of recursion. The rest of our “conceptual-intentional” system, they argue, differs from that of nonhuman animals only in “quantity rather than kind” (Hauser et al. Reference Hauser, Chomsky and Fitch2002a, p. 1573). Similarly, Tomasello and Rakoczy (Reference Tomasello and Rakoczy2003, p. 121) argue that the ability to participate in cultural activities with shared goals and intentions is uniquely human, but claim that the cognitive skills of a human child born on a desert island and somehow magically kept alive by itself until adulthood “would not differ very much – perhaps a little, but not very much” from the cognitive skills of other great apes (see also Tomasello et al. Reference Tomasello, Call and Hare2003a; Tomasello et al. Reference Tomasello, Carpenter, Call, Behne and Moll2005).
Notwithstanding the broad comparative consensus arrayed against us, the hypothesis we will be proposing in the present paper is that Darwin was mistaken: The profound biological continuity between human and nonhuman animals masks an equally profound functional discontinuity between the human and nonhuman mind.Footnote 1 Indeed, we will argue that the functional discontinuity between human and nonhuman minds pervades nearly every domain of cognition – from reasoning about spatial relations to deceiving conspecifics – and runs much deeper than even the spectacular scaffolding provided by language or culture alone can explain.
At the same time, we know from Darwin's more well-grounded principles that there are no unbridgeable gaps in evolution. Therefore, one of the most important challenges confronting cognitive scientists of all stripes, in our view, is to explain how the manifest functional discontinuity between extant human and nonhuman minds could have evolved in a biologically plausible manner.
The first – and probably most important – step in answering this question is to clearly identify the similarities and the dissimilarities between human and nonhuman cognition from a purely functional point of view. We therefore spend the bulk of the paper reexamining the evidence for “human-like” cognitive abilities among nonhuman animals at a functional level, before speculating as to how these processes might be implemented. We cover a wide variety of domains, species, and experimental protocols – ranging from spatial relations and mental state reasoning in the lab to dominance relations and transitive inferences in the wild. Across all these disparate cases, a consistent pattern emerges: Although there is a profound similarity between human and nonhuman animals' abilities to learn about and act on the perceptual relations between events, properties, and objects in the world, only humans appear capable of reinterpreting the higher-order relation between these perceptual relations in a structurally systematic and inferentially productive fashion. In particular, only humans form general categories based on structural rather than perceptual criteria, find analogies between perceptually disparate relations, draw inferences based on the hierarchical or logical relation between relations, cognize the abstract functional role played by constituents in a relation as distinct from the constituents' perceptual characteristics, or postulate relations involving unobservable causes such as mental states and hypothetical physical forces. There is not simply a consistent absence of evidence for any of these higher-order relational operations in nonhuman animals; there is compelling evidence of an absence.
In the last part of the article, we argue for the representational-level implications of our analysis. Povinelli and colleagues have previously proposed that humans alone are able to “reinterpret” the world in terms of unobservable, hypothetical entities such as mental states and causal forces and that our ability to do so relies on a unique representational system that has been grafted onto the cognitive architecture we inherited from our nonhuman ancestors (Povinelli Reference Povinelli2000; Reference Povinelli2004; Povinelli & Giambrone Reference Povinelli and Giambrone2001; Povinelli & Preuss Reference Povinelli and Preuss1995; Povinelli & Vonk Reference Povinelli and Vonk2003; Reference Povinelli and Vonk2004; Vonk & Povinelli Reference Vonk, Povinelli, Zentall and Wasserman2006). Independently, Holyoak, Hummel, and colleagues have argued that the ability to reason about higher-order relations in a structurally systematic and inferentially productive fashion is a defining feature of the human mind and requires the distinctive representational capabilities of a “biological symbol system” (Holyoak & Hummel Reference Holyoak, Hummel, Dietrich and Markman2000; Reference Holyoak, Hummel, Gentner, Holyoak and Kokinov2001; Hummel & Holyoak Reference Hummel and Holyoak1997; Reference Hummel, Holyoak and Gattis2001; Reference Hummel and Holyoak2003; Kroger et al. Reference Kroger, Holyoak and Hummel2004; Robin & Holyoak Reference Robin, Holyoak and Gazzaniga1995). Herein we combine, revise, and substantially expand on the hypotheses proposed by these two research groups.
We argue that most of the salient functional discontinuities between human and nonhuman minds – including our species' unique linguistic, mentalistic, cultural, logical, and causal reasoning abilities – result in part from the difference in degree to which human and nonhuman cognitive architectures are able to approximate the higher-order, systematic, relational capabilities of a physical symbol system (Newell Reference Newell1980; Newell & Simon Reference Newell and Simon1976). Although human and nonhuman animals share many similar cognitive mechanisms, our relational reinterpretation hypothesis (RR) is that only human animals possess the representational processes necessary for systematically reinterpreting first-order perceptual relations in terms of higher-order, role-governed relational structures akin to those found in a physical symbol system (PSS). We conclude by suggesting that recent advances in symbolic-connectionist models of cognition provide one possible explanation for how our species' unique ability to approximate the higher-order relational capabilities of a physical symbol system might have been grafted onto the proto-symbolic cognitive architecture we inherited from our nonhuman ancestors in a biologically plausible manner.
2. Similarity
2.1. Perceptual versus relational similarity
We begin our review of the similarities and differences between human and nonhuman cognition with what William James (Reference James1890/1950) called “the very keel and backbone of our thinking”: sameness. The ability to evaluate the perceptual similarity between stimuli is clearly the sine qua non of biological cognition, subserving nearly every cognitive process from stimulus generalization and Pavlovian conditioning to object recognition, categorization, and inductive reasoning. Humans, however, are not limited to evaluating the similarity between objects based on perceptual regularities alone. Humans not only recognize when two physical stimuli are perceptually similar, they can also recognize that two ideas, two mental states, two grammatical constructions, or two causal-logical relations are similar as well. Even preschool-age children understand that the relation between a bird and its nest is similar to the relation between a dog and its doghouse despite the fact that there is little “surface” or “object” similarity between the relations' constituents (Goswami & Brown Reference Goswami and Brown1989; Reference Goswami and Brown1990). Indeed, as numerous researchers have shown, the propensity to evaluate the similarity between states of affairs based on the causal-logical and structural characteristics of the underlying relations rather than on their shared perceptual features appears quite early and spontaneously in all normal humans – as early as 2–5 years of age, depending on the domain and complexity of the task (Gentner Reference Gentner1977; Goswami Reference Goswami, Gentner, Holyoak and Kokinov2001; Halford Reference Halford1993; Holyoak et al. Reference Holyoak, Junn and Billman1984; Namy & Gentner Reference Namy and Gentner2002; Rattermann & Gentner Reference Rattermann and Gentner1998a; Richland et al. Reference Richland, Morrison and Holyoak2006).
In short, there appear to be at least two kinds of similarity judgments at work in human thought: judgments of perceptual similarity based on the relation between observed features of stimuli; and judgments of non-perceptual relational similarity based on logical, functional, and/or structural similarities between relations and systematic correspondences between the abstract roles that elements play in those relations (Gentner Reference Gentner1983; Gick & Holyoak Reference Gick and Holyoak1980; Reference Gick and Holyoak1983; Goswami Reference Goswami, Gentner, Holyoak and Kokinov2001; Markman & Gentner Reference Markman and Gentner2000). The question we are interested in here is whether or not there is any evidence for non-perceptual relational similarity judgments in nonhuman animals as well.
2.2. Same-different relations
Among comparative researchers, the most widely replicated test of relational concept learning over the last quarter century has been the simultaneous same-different (S/D) task, in which the subject is trained to respond one way if two simultaneously presented stimuli are the same and to respond a different way if the two stimuli are different. In the purportedly more challenging relational match-to-sample (RMTS) task, the subject must select the choice display in which the perceptual similarity among elements in the display is the same as the perceptual similarity among elements in the sample stimulus. For example, presented with a pair of identical objects, AA, as a sample stimulus, the subject should select BB rather than CD; presented with a pair of dissimilar objects, EF, as the sample stimulus, the subject should select GH rather than JJ (see Thompson & Oden Reference Thompson and Oden2000 for a seminal discussion).
Although Premack (Reference Premack1983a; Reference Premack1983b) initially reported that only language-trained chimpanzees passed S/D and RMTS tasks, success on two-item S/D tasks has since been demonstrated in parrots (Pepperberg Reference Pepperberg1987), dolphins (Herman et al. Reference Herman, Pack, Morrel-Samuels, Roitblat, Herman and Nachtigall1993b; Mercado et al. Reference Mercado, Killebrew, Pack, Macha and Herman2000), baboons (Bovet & Vauclair Reference Bovet and Vauclair2001), and pigeons (Blaisdell & Cook Reference Blaisdell and Cook2005; Katz & Wright Reference Katz and Wright2006), among others. Thompson et al. (Reference Thompson, Oden and Boysen1997) showed that language-naive chimpanzees with some exposure to token-based symbol systems are able to pass a two-item RMTS task (cf. Premack Reference Premack and Weiskrantz1988). Vonk (Reference Vonk2003) has reported that three orangutans and one gorilla were able to pass a complex two-item RMTS task without any explicit symbol or language training at all. Fagot et al. (Reference Fagot, Wasserman and Young2001) have shown that language-naïve baboons can pass an RMTS task involving arrays of elements (see discussion below); and Cook and Wasserman (in press) have reported successful results on an array-based RMTS task with pigeons. So passing S/D and RMTS tasks does not appear to be limited to language-trained apes or even primates.
Regardless of which nonhuman species are capable of passing S/D and RMTS tasks, the more critical and largely overlooked point is this: Both of these experimental protocols lack the power, even in principle, of demonstrating that a subject cognizes sameness and difference as abstract, relational concepts which are (1) independent of any particular source of stimulus control, and (2) available to serve in a variety of further higher-order inferences in a systematic fashion. A functional decomposition of the S/D and RMTS protocols reveals that the minimum cognitive capabilities necessary to pass these tests are much more modest.
The fundamental problem is that the same-different relation at stake in the classic S/D task can be reduced to a continuous, analog estimate of the degree of perceptual variability between the elements in each display. Halford et al. (Reference Halford, Wilson and Phillips1998a) refer to this type of cognitive trick as “conceptual chunking.” Chunking reduces the complexity of processing a relation at the cost of losing the original structure and components of the relation itself, but suffices when the task does not require the structure of the relation itself to be taken into account. A cognizer could pass a classic S/D task by calculating an analog estimate of the variability between items in the sample display and then employ a simple conditional discrimination to select the appropriate behavioral response to this chunked result. Hence, success on an S/D task may imply that a subject can generalize a rule-like discrimination beyond any particular feature in the training stimuli; but it cannot be taken as evidence that the subject has understood sameness and difference as structured relations that are mutually exclusive or that can be freely generalized beyond the modality-specific rule the subject used in a particular learning context.
The same deflationary functional analysis applies, mutatis mutandis, to the RMTS task. The apparent relational complexity of the RMTS task can be significantly reduced by segmenting the task into separate chunked operations that are evaluated sequentially. First, the subject can evaluate the variability within the first-order relations by chunking them into analog variables. Second, the subject can employ a straightforward conditional discrimination to select the appropriate choice display: for example, <if the variability of the sample display is low, select the choice display with a low variability>. Although this may qualify as a “higher-order” operation, it does not qualify as a higher-order relational operation since the constituent structures of the first-order relations are no longer relevant or available to the higher-order process (see again, Halford et al. Reference Halford, Wilson and Phillips1998a). At best, the RMTS task demonstrates that nonhuman animals can select the choice display that has the same degree of between-item variability as the sample display. But the task says nothing about nonhuman animals' ability to evaluate the non-perceptual relational similarity between those relations.
The preceding functional decomposition of the S/D and RMTS tasks is not merely a hypothetical possibility. There is now good experimental evidence that chunking and segmentation are precisely the tactics that nonhuman animals employ when they succeed at S/D and RMTS tasks. Wasserman and colleagues, for example, have shown that both pigeons and baboons have much less difficulty passing S/D tasks when there are 16 items in each set than when there are only 2 items in each set (Wasserman et al. Reference Wasserman, Young and Fagot2001; Young & Wasserman Reference Young and Wasserman1997). Wasserman et al. showed that a simple measure of item variability, based on Shannon and Weaver's (Reference Shannon and Weaver1949) measure of informational entropy, nicely captures the functional pattern of nonhuman subjects' discriminations across a variety of experimental conditions (reviewed in Wasserman et al. (Reference Wasserman, Young and Cook2004). Nonhuman animals' performance on S/D tasks differs markedly from the categorical, logical distinction that humans make between sameness and difference. Human subjects' responses to S/D tasks are also influenced by the degree of variability in the stimuli (Castro et al. Reference Castro, Young and Wasserman2007; Young & Wasserman Reference Young and Wasserman2001); but most human subjects exhibit a categorical distinction between displays with no item variability (i.e., same) and those with any item variability at all (i.e., different).
An analogous discontinuity between human and nonhuman judgments of similarity has also been documented on RMTS tasks. Fagot et al. (Reference Fagot, Wasserman and Young2001) presented two adult baboons and two adult human subjects with an RMTS task using arrays of 16 visual icons that were either all alike or all different. Both baboon and human subjects learned to pass the RMTS test and successfully generalized to novel sets of stimuli. When the authors reduced the number of items in the sample set from 16 to 2 icons, the difference between the two species, however, was notable. The impact on the human subjects' responses was insignificant. The baboons' performance, however, fell to chance on different trials, whereas their performance on same trials remained unchanged. This markedly asymmetric effect is exactly what one would expect if the baboons were discriminating between second-order same and different relations by comparing the amount of variability (e.g., entropy) in the two displays. That is, same trials with 2 icons continue to yield zero entropy, but different trials now yield a small entropy value that is more difficult to discriminate from zero.
Entropy is certainly not the only factor modulating nonhuman subjects' judgments of sameness and difference. Stimulus oddity as well as spatial organization and degree of similarity also play an important role (see Cook & Wasserman (Reference Cook, Wasserman, Wasserman and Zentall2006 for an important review). Vonk (Reference Vonk2003) has shown that language-naive apes can judge variability along specific perceptual dimensions (e.g., color rather than size or shape). And Bovet and Vauclair (Reference Bovet and Vauclair2001) have shown that baboons can pass a “conceptual” S/D task in which pairs of objects are to be treated as same if they share a similar learning history or biological significance (e.g., objects-I-have-eaten vs. objects-I-have-not-eaten). These results demonstrate that nonhuman animals – and not just language-trained chimpanzees – are capable of learning novel, sophisticated, rule-governed discriminations that generalize beyond any specific perceptual cue. But in all of the results reported to date, the relevant discriminations are bound to a particular source of stimulus control (e.g., entropy, oddity, edibility). There is no evidence that nonhuman animals understand what “sameness” in one task has in common with “sameness” in another. For example, after passing a “perceptual” S/D task and having been trained to categorize objects as either “food” or “not food,” the baboons in Bovet and Vauclair's (2001) study nevertheless required an average of 14,576 additional trials on the “conceptual” S/D task before their responses were correct 80% of the time on trials involving novel pairs of objects.
The available evidence therefore suggests that the formative discontinuity in same-different reasoning lies not between monkeys and apes, as Thompson and Oden (Reference Thompson and Oden2000) proposed, but between nonhumans and humans. Chimpanzees and other nonhuman apes can pass RMTS tasks with only 2 items in the sample display (e.g., Thompson et al. Reference Thompson, Oden and Boysen1997; Vonk Reference Vonk2003). Baboons can pass RMTS tasks with as few as 3–4 items in each sample (Fagot et al. Reference Fagot, Wasserman and Young2001); and pigeons can pass RMTS tasks with 16 items in each sample (Cook & Wasserman, in press). The difference between the performance of language-naive pigeons and language-trained chimps on these tasks often comes down to a question of the number of items in each set and the number of trials necessary to reach criterion. As Katz and colleagues point out (see Katz & Wright Reference Katz and Wright2006; Katz et al. Reference Katz, Wright and Bachevalier2002), this strongly suggests that there is a difference in degree between various nonhuman species' sensitivity to similarity discriminations (influenced by training regimen), not a difference in kind between their conceptual abilities to predicate same-different relations.
The performance of human subjects, on the other hand, contrasts sharply with the performance of all other animal species. Humans manifest an abrupt, categorical distinction between displays in which there is no variability and displays in which there is any variability at all (Cook & Wasserman Reference Cook, Wasserman, Wasserman and Zentall2006; Wasserman et al. Reference Wasserman, Young and Cook2004). More importantly, contra Castro et al. (Reference Castro, Young and Wasserman2007), we believe that human subjects possess a qualitatively distinct system for reinterpreting sameness and difference in a logical and abstract fashion that generalizes beyond any particular source of stimulus control. In short, even with respect to the most basic and ubiquitous of all cognitive phenomena – judgments of similarity – there is already a distinctive seam between human and nonhuman minds.
2.3. Analogical relations
Premack (Reference Premack1983a, p. 357) suggested that the RMTS task is an implicit form of analogy and claimed that “animals that can make same/different judgments should be able to do analogies.” Indeed, it is still widely accepted that the ability to pass an RMTS task is the “cognitive primitive” for analogical reasoning (see, e.g., Thompson & Oden Reference Thompson and Oden2000, p. 378). We disagree. While recognizing perceptual similarities is certainly a necessary condition for making analogical inferences (inter alia), there is a qualitative difference between the kind of cognitive processes necessary to pass an S/D or RMTS task and the kind of cognitive processes necessary to reason in an analogical fashion. The relations at issue in S/D and RMTS tasks are based solely on the perceptual features of the constituents; and the constituents play undifferentiated and symmetrical roles in those relations (e.g., two objects are symmetrically either the same or different).
Most true analogies, on the other hand, are based on relations in which the constituents play asymmetrical, causal-logical roles (e.g., the role that John plays in forming the relation, John loves Mary, is not equivalent to the role that Mary plays, perhaps to John's dismay). Furthermore, genuine analogical inferences are made by finding systematic structural similarities between perceptually disparate relations, allowing the cognizer to draw novel inferences about the target domain independently from the perceptual similarity between the relations' constituents (Gentner Reference Gentner1983; Gentner & Markman Reference Gentner and Markman1997; Holyoak & Thagard Reference Holyoak and Thagard1995). Accordingly, analogical relations sensu stricto cannot be reduced via chunking and segmentation, but require the cognizer to evaluate the abstract, higher-order relations at stake in a structurally systematic and inferentially productive fashion.
Analogical reasoning is a fundamental and ubiquitous aspect of human thought. It is at the core of creative problem solving, scientific heuristics, causal reasoning, and poetic metaphor (Gentner Reference Gentner, Gentner and Goldin-Meadow2003; Gentner et al. Reference Gentner, Holyoak and Kokinov2001; Holyoak & Thagard Reference Holyoak and Thagard1995; Reference Holyoak and Thagard1997; Lien & Cheng Reference Lien and Cheng2000). And it is also central to the more prosaic ways that typical human children learn about the world and each other (Goswami Reference Goswami1992; Reference Goswami, Gentner, Holyoak and Kokinov2001; Halford Reference Halford1993; Holyoak et al. Reference Holyoak, Junn and Billman1984). To date, however, the only evidence that any nonhuman animal is capable of analogical reasoning sensu stricto comes from the unreplicated feats of a single chimpanzee, Sarah, reported more than 25 years ago by Gillan et al. (Reference Gillan, Premack and Woodruff1981). Sarah reportedly constructed and completed two distinct kinds of analogies. The first was based on judging whether or not two geometric relationships were the same or different (e.g., large blue triangle is to small blue triangle as large yellow crescent is to small yellow crescent). The second was based on judging the similarity between two “functional” relationships (e.g., padlock is to key as tin can is to can opener). Gillan et al. (Reference Gillan, Premack and Woodruff1981) reported that Sarah was successful on both tests.
Savage-Rumbaugh was the first to point out that Sarah's performance on the geometric version of the original tests could have been the result of a simple, feature-matching heuristic (cited by Oden et al. (Reference Oden, Thompson, Premack, Gentner, Holyoak and Kokinov2001). In response, Oden et al. (Reference Oden, Thompson, Premack, Gentner, Holyoak and Kokinov2001) followed up Gillan et al.'s original experiment on geometric analogies with a series of more carefully constructed tests designed to flesh out Sarah's actual cognitive strategy. These new experiments used geometric forms that varied along one or more featural dimensions (e.g., size, color, shape, and/or fill). After extensive testing, Oden et al. showed that Sarah was actually tracking the number of within-pair featural differences rather than the kind of relation between pairs of figures. For example, whereas a human would see a color plus a shape change as differing from a size plus a fill change, Sarah saw these two transformations as equivalent because they both entailed two featural changes.
Oden et al. (Reference Oden, Thompson, Premack, Gentner, Holyoak and Kokinov2001) argued that this strategy still demonstrates Sarah's ability to reason about the “relation between relations.” But there is a profound difference between the feature-based heuristic Sarah apparently adopted and the role-based structural operations that are the basis of analogical inference sensu stricto. To be sure, keeping track of the number of within-pair featural changes certainly requires quite sophisticated representational processes. But the fact that Sarah apparently ignored the structure of the relation between pairs of figures suggests that she represented any featural change as an undifferentiated chunk for the purposes of this task. Therefore, her strategy on this task appears to be computationally equivalent to the kind of chunking and segmentation strategies other nonhuman primates use to solve RMTS tasks. According to Oden et al.'s (2001) own analysis, Sarah failed to demonstrate a systematic sensitivity to the higher-order structural relation between relations. It is this systematic sensitivity to higher-order structural relations which is, as Gentner (Reference Gentner1983) has long argued, the hallmark of analogical reasoning in humans.
Therefore, the claim that nonhuman animals are capable of analogical inferences rests solely on Sarah's performance in the test of functional analogies reported by Gillan et al. (Reference Gillan, Premack and Woodruff1981). There are many reasons to be skeptical of these results as well. For one, Sarah's performance on these analogies has never been replicated either by Sarah herself or by any other nonhuman subject. Second, of the two experiments (3A and 3B) devoted to functional analogies, the authors themselves admit that the first, 3A, is open to an alternative feature-based account. Furthermore, the second experiment, 3B, did not require Sarah to complete or construct analogies. It merely required her to respond to the relation between two pairs of objects with one of two plastic tokens that her experimenters interpreted as meaning same and different. Sarah's extensive prior exposure to the objects used in this experiment, however, makes it very difficult to judge how she learned to cognize the relation between these objects (e.g., how exactly did Sarah understand that the relation between “torn cloth” and “needle and thread” is the same as the relation between “marked, torn paper” and “tape”?). Indeed, the authors themselves admit that Sarah's “unique experimental history” may have contributed to her success on these tasks (Gillan et al. (Reference Gillan, Premack and Woodruff1981, p. 11).
In short, what is sorely needed is a more extensive series of tests, like those carried out by Oden et al. (Reference Oden, Thompson, Premack, Gentner, Holyoak and Kokinov2001), to systematically tease apart the salient parameters in Sarah's cognitive strategy. Until then, Sarah's remarkable and unreplicated success on experiment 3B as reported by Gillan et al. (Reference Gillan, Premack and Woodruff1981) constitutes thin support for claiming that nonhuman animals are capable of analogical reasoning.
3. Rules
One of the hallmarks of human cognition is our ability to freely generalize abstract relational operations to novel cases beyond the scope in which the relation was originally learned (see Marcus Reference Marcus2001 for a lucid exposition). It is widely recognized, for example, that the ability to freely generalize relational operations over role-based variables is a necessary condition for using human languages (Gomez & Gerken Reference Gomez and Gerken2000). Furthermore, experiments in artificial grammar learning (AGL) have shown that human subjects' ability to learn and generalize abstract relations over role-based abstractions is not limited to natural languages (e.g., Altmann et al. Reference Altmann, Dienes and Goode1995; Gomez Reference Gomez1997; Marcus et al. Reference Marcus, Vijayan, Bandi Rao and Vishton1999; Reber Reference Reber1967). Although it is quite controversial how the human cognitive architecture performs these rule-like feats (see, e.g., Marcus Reference Marcus1999; McClelland & Plaut Reference McClelland and Plaut1999; Seidenberg & Elman Reference Seidenberg and Elman1999), the fact that human subjects manifest these rule-like generalizations is “undisputed” (Perruchet & Pacton Reference Perruchet and Pacton2006). The question we want to focus on here is whether or not this undisputable behavioral “fact” also holds for nonhuman animals.
To date, the strongest positive evidence that nonhuman animals are able to generalize novel rules in a systematic fashion comes from an experiment with tamarin monkeys (Hauser et al. Reference Hauser, Weiss and Marcus2002b), which replicated an AGL experiment that Marcus et al. (Reference Marcus, Vijayan, Bandi Rao and Vishton1999) had previously performed on 7-month-old children. In this “ga ti ga” protocol, subjects were habituated to sequences of nonsense syllables in one of two patterns (e.g., AAB vs. ABB). Following habituation, the subjects were presented with test sequences drawn from an entirely novel set of syllables. Some of the test sequences followed the grammatical pattern presented during habituation and some did not. Hauser et al. (Reference Hauser, Weiss and Marcus2002b) showed that tamarin monkeys, like human children, were more likely to dishabituate to the novel, “ungrammatical” pattern.
In our view, the claim that this experiment provides evidence for “rule learning” in a nonhuman species is not entirely unfounded; but it needs to be carefully qualified, as the kind of rules that tamarin monkeys learned in this experiment is qualitatively different from the kind of rules that is characteristic of human language and thought. Many early AGL experiments failed to distinguish between tasks that required subjects to learn perceptually bound relations from tasks that required subjects to learn non-perceptual structural relations over role-based variables (for a critical review, see Redington & Chater (Reference Redington and Chater1996). Tunney and Altmann (Reference Tunney and Altmann1999), for example, point out that there are at least two forms of sequential dependencies that might be learned in an AGL experiment: “repeating” dependencies in which the occurrence of an element in one position determines the occurrence of the same element in a subsequent position, and “nonrepeating” dependencies in which the occurrence of an element in one position determines the occurrence of a different element in a subsequent position. Repeating elements share a higher-order perceptual regularity (i.e., perceptual similarity), whereas purely structural dependencies between non-repeating elements do not. Therefore, sensitivity to sequential dependencies between repeating elements does not necessarily imply sensitivity to sequential dependencies between nonrepeating elements. Indeed, Tunney and Altmann (Reference Tunney and Altmann2001) demonstrate that adult human subjects appear to have distinct and dissociable mechanisms for learning each kind of dependency. At best, Hauser et al.'s (2002b) results demonstrate that tamarin monkeys possess the ability to learn repeating, perceptually based dependencies.
Similarly, Gomez and Gerken (Reference Gomez and Gerken2000) distinguish between “pattern-based” and “category-based” rules. In the former case, the rule is abstracted from the sequence of perceptual relations between elements in a given array of training stimuli; in the latter case, the rule is based on the structural relation between abstract functional roles. The AAB and ABB patterns learned by tamarin monkeys in Hauser et al.'s (2002b) study are an example of the former, pattern-based type of rule; the noun-verb-noun pattern learned by human language users is an example of the latter, role-based type of rule. Both kinds of operations may qualify as “rule-like” in the sense that they generalize a given relation beyond the feature set on which it was originally trained. But it is role-based (i.e., “algebraic”) rules, as Marcus (Reference Marcus2001) points out, that are the hallmarks of human thought and language. To date, there is no evidence for this kind of rule learning in any nonhuman animal.
4. Higher-order spatial relations
All normal adult humans are capable of using allocentric representations of spatial relations and of reasoning about the higher-order relation between spatial relations at different scales. The ubiquity of maps, diagrams, graphs, gestures, and artificial spatial representations of all sorts in human culture speaks for itself. Indeed, by the age of 3, all normal humans are able to reason about the higher-order relation between small-scale artificial spatial models and large-scale spatial relations in the real world (see Gattis Reference Gattis2005 for a review). DeLoache (Reference DeLoache2004) has argued that this ability represents a crucial step in children's progress towards becoming “symbol minded.” The question at hand is whether there is any evidence that nonhuman animals can reason about the higher-order relation between spatial relations in a similar fashion.
The best evidence to date for higher-order spatial reasoning in a nonhuman animal comes from the work of Kuhlmeier and colleagues (Kuhlmeier & Boysen Reference Kuhlmeier and Boysen2001; Reference Kuhlmeier and Boysen2002; Kuhlmeier et al. Reference Kuhlmeier, Boysen and Mukobi1999). Kuhlmeier et al. (Reference Kuhlmeier, Boysen and Mukobi1999) first instructed seven captive chimpanzees to associate the miniature and the full-sized versions of four distinct objects by drawing their attention to the association “verbally and gesturally” (p. 397). After this initial training, the chimpanzees watched as the experimenter hid a miniature can of soda behind a miniature version of one of the four objects within a 1:7 scale model of a full-sized room or outdoor enclosure. Then the chimpanzees were given the opportunity to find the real can of soda in the adjacent full-sized space. When the chimpanzees were tested on a version of the task in which they were rewarded only if they retrieved the can of soda on the first search attempt (Kuhlmeier & Boysen Reference Kuhlmeier and Boysen2001), six out of the seven subjects performed above chance.
These results demonstrate that chimpanzees are able to learn to associate two objects (the real object and its miniature) that are highly similar perceptually and to locate a reward based on this association. But this is a far cry from being able to reason about the higher-order relation between a scale model and its real-world referent. Indeed, Kuhlmeier et al. (Reference Kuhlmeier, Boysen and Mukobi1999, p. 397) reported that one chimpanzee was able to locate the food rewards simply upon being shown the miniature version of the hiding place without referring to the scale model at all. In short, this first protocol did not require the chimpanzees to reason about the higher-order spatial relation between the scale model and full-sized room. A simple, learned association between two arbitrary cues sufficed.
In a follow-up experiment designed to eliminate purely associative cues, Kuhlmeier and Boysen (Reference Kuhlmeier and Boysen2002) varied the congruency of the color, shape, or position of the miniatures relative to the full-sized version of the hiding site. As a group, the chimpanzees were successful when positional cues were absent. However, when all the hiding sites were visually identical and the correct one had to be found based on its relative location within the scale model alone, only two of the seven chimpanzees performed above chance.
It is clear from these results that reasoning in terms of relative spatial locations alone is significantly more difficult for chimpanzees than is reasoning in terms of object-based cues alone. But it must be noted that even the successful performance of two out of the seven subjects does not demonstrate higher-order relational abilities, since the four locations in which the hiding sites were placed remained constant across all of these experiments (Kuhlmeier, personal communication). Hence, it is impossible to know whether the two successful chimpanzees were reasoning on the basis of a general, systematic understanding of the analogy between spatial locations in the scale model and spatial locations in the outdoor enclosure, or whether, more modestly, they had simply learned over the course of their long experimental history with this particular protocol to associate a particular location in the scale model with a particular location in the enclosure.
It remains to be seen whether chimpanzees, or any other nonhuman animal, could succeed in this protocol if the hiding sites were randomly relocated on each trial. In the meantime, there is a conspicuous absence of evidence that any nonhuman animal can reason about scale models, maps, or higher-order spatial relations in a human-like fashion.
5. Transitive inference
Ever since Piaget (Reference Piaget1928; Reference Piaget1955), the ability to make systematic inferences about unobserved transitive relations has been taken as a litmus test of logical-relational reasoning (but see Wright (Reference Wright2001). For example, told that “Bill is taller than Charles” and “Abe is taller than Bill,” human children can infer that “Abe is taller than Charles” without being given any information about the absolute heights of Abe, Bill, or Charles (Halford (Reference Halford1984). Over the last quarter century, comparative researchers have persistently claimed that nonhuman animals are capable of making transitive inferences in a purely logical-relational fashion, as well. Upon closer examination of the evidence, however, it becomes apparent that the kinds of transitive inferences that are made by nonhuman animals do not require a systematic, domain-general logical-relational competence, but rather, can be made using much more prosaic, domain-specific, and egocentric information-processing mechanisms.
5.1. Transitive choices in the lab
For many decades now, the classic comparative test of transitive inference has been a nonverbal five-item task developed by Bryant and Trabasso (Reference Bryant and Trabasso1971) in which subjects are incrementally trained on pairs of stimuli (i.e., A+B-, B+C-, C+D-, D+E-) and then tested on non-adjacent untrained pairs. The discriminative relation between the stimuli used in most of these studies is not, in fact, transitive; it is the subjects' choices that become transitive as a result of the pattern of differential reinforcement: that is, repeated reinforcement of the choice of A over B and of B over C eventually leads to the subject preferring A over C. As Halford et al. (Reference Halford, Wilson and Phillips1998b) pointed out, a subject's preferences can become transitive through incremental reinforcement without there being a transitive relation between the underlying task elements themselves, and therefore without requiring the subject to understand anything about transitivity as a logical property. Indeed, many researchers have shown that successfully selecting B over D in the traditional five-item incremental protocol can be achieved using purely associative operations (De Lillo et al. Reference DeLillo, Floreano and Antinucci2001; Wynne (Reference Wynne1995).
To be sure, reinforcement history cannot be the whole story, as Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) have recently demonstrated. Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) trained eight hooded crows in a clever variation on Bryant and Trabasso's five-item protocol. Five colored cards were used to represent the elements in the series, A through E. The color on one side of the card served as the choice stimulus, and a circle of the same color on the underside of the card served as the post-choice feedback stimulus. The crows were asked to choose one of two simultaneously presented cards. Importantly, the colored circles on the underside of the cards were displayed to the crows only after they had selected one of the two choice stimuli. The crows were divided into two experimental groups. In the ordered-feedback group, the diameter of the circles associated with the choice stimuli became progressively smaller from A to E. In the constant-feedback group, the diameter of the feedback circles did not change. After initial training, Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) overexposed both groups of crows to D+E- pairings. Under traditional associative models, massive overexposure to D+E- pairings should lead to preferentially selecting D over B. Nevertheless, the crows in the ordered-feedback group selected B over D in the BD pairings, whereas the crows in the constant-feedback group either chose at random or preferred D over B.
Lazareva et al.'s (2004) results show that reinforcement history alone cannot account for the emergence of choice transitivity among nonhuman animals. Moreover, we agree with Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) that these results are consistent with some kind of “spatial representation” hypothesis (Gillan Reference Gillan1981). But what is not often noted by comparative researchers is that evidence for an integrated representation of an ordered series is not in and of itself evidence for transitive reasoning or relational integration in a logical-deductive sense. There is more to making logically underpinned transitive inferences than constructing an ordered representation of one's choices.
As Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) themselves point out, in order to claim evidence for logically underpinned transitive inferences, one must show that the organism can, in fact, distinguish between transitive and non-transitive relations and that it makes its choices on the basis of this logical relation independently of other non-logical factors such as reinforcement history and training regime (see also Halford et al. Reference Halford, Wilson and Phillips1998a; Wright Reference Wright2001). The results reported by Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) do not provide evidence for either of these criteria.
In a follow-up experiment, Lazareva and Wasserman (Reference Lazareva and Wasserman2006) showed that pigeons select B over D stimuli in the same protocol employed by Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) even when the size of the post-choice cues is constant – which demonstrates that the transitive perceptual relation between the post-choice cues is not, in fact, computationally necessary for successfully passing this particular protocol. It is unclear why crows – but not pigeons – were unable to pass the test in the constant-feedback condition. There are many possible explanations. For example, Lazareva et al. (Reference Lazareva, Smirnova, Bagozkaja, Zorina, Rayevsky and Wasserman2004) did not rule out the possibility that it was simply the variability between post-choice cues that encouraged the crows' successful responses rather than their transitivity per se. In any case, in order to warrant the claim that the crows were reasoning on the basis of the logical relation between post-choice stimuli independently of other non-logical factors, it would be necessary to show that the crows could systematically generalize to novel stimuli on a first trial basis: For example, trained to associate a novel choice stimulus, X, with a colored circle of a given diameter, could the crows correctly choose between X and any stimulus from the set, A through E, on a first-trial basis in a systematic manner? To date, there is no evidence that crows, or any other nonhuman animal, could pass such a test.
5.2. Transitive inferences in the wild
Many researchers have argued that animals' full transitive reasoning capabilities are most likely to manifest themselves in inferences involving social relations (e.g., Bond et al. Reference Bond, Kamil and Balda2003; Grosenick et al. Reference Grosenick, Clement and Fernald2007; Kamil Reference Kamil2004; Paz et al. Reference Paz, Bond, Kamil and Balda2004). Much of the early fieldwork focused on nonhuman primates (see Tomasello & Call Reference Tomasello and Call1997 for a review). The strongest evidence to date for transitive social inferences in a nonhuman animal comes not from primates, however, but from birds (see review by Kamil Reference Kamil2004) and fish (see Grosenick et al. Reference Grosenick, Clement and Fernald2007). Paz et al. (Reference Paz, Bond, Kamil and Balda2004), for example, showed that male pinyon jays can anticipate their own subordinance relation to a stranger after having witnessed the stranger win a series of confrontations with a familiar but dominant conspecific. Similarly, Grosenick et al. (Reference Grosenick, Clement and Fernald2007) allowed territorial A. burtoni male fish to observe pairwise fights between five rivals (i.e., AB, BC, CD, DE), with the outcomes implying a dominance ordering of A>B>C>D>E. When subsequently given a choice between B and D, observers preferred to spend more time adjacent to D rather than B.
Results such as these demonstrate that the ability to keep track of the dominance relations between tertiary dyads is not limited to nonhuman primates or even to mammals (cf. Tomasello & Call (Reference Tomasello and Call1997). Furthermore, fish and birds, in addition to nonhuman primates, can apparently use this information to make rational (i.e., ecologically adaptive) choices about how to respond to potential rivals (see also Bergman et al. (Reference Bergman, Beehner, Cheney and Seyfarth2003; Bond et al. Reference Bond, Kamil and Balda2003; Hogue et al. Reference Hogue, Beaugrand and Lague1996; Silk Reference Silk1999). The accumulated evidence therefore rules out a traditional associative explanation and strongly supports a more complex, information-processing account of how nonhuman animals keep track of and respond to dominance relations among conspecifics.
But none of the available comparative evidence suggests that nonhuman animals are able to process transitive inferences in a systematic or logical fashion, even in the social domain. The experiments reported by Paz et al. (Reference Paz, Bond, Kamil and Balda2004) and Grosenick et al. (Reference Grosenick, Clement and Fernald2007) provide evidence for only one particular kind of transitive inference: an inference from watching a series of agonistic interactions between conspecifics to an egocentric prediction about how to respond to a potentially dominant rival. Neither experiment provides any evidence that these subjects would also be able to systematically predict the relation between unobserved third-party dyads or could use their own interactions with a conspecific to predict that conspecific's relation to other rivals – let alone answer the kind of omni-directional queries of which humans are manifestly capable: For example, what individuals are dominant to B? What is the relation between C and A? Is A dominant to C to a greater or lesser extent than B is dominant to C? (Goodwin & Johnson-Laird Reference Goodwin and Johnson-Laird2005; Halford et al. Reference Halford, Wilson and Phillips1998a).
In short, whereas at least some nonhuman animals clearly are able to make transitive inferences about their own relation to potential rivals to a degree that rules out purely associative learning mechanisms, the comparative evidence accumulated to date is nevertheless consistent with the hypothesis that nonhuman animals' understanding of transitive relations is punctate, egocentric, non-logical, and context-specific.
6. Hierarchical relations
Being able to process recursive operations over hierarchical relations is unarguably a key prerequisite for using a human language (Hauser et al. Reference Hauser, Chomsky and Fitch2002a). And most normal human children are capable of reasoning about hierarchical class relations in a systematic and combinatorial fashion by the age of five (Andrews & Halford Reference Andrews and Halford2002; cf. Inhelder & Piaget Reference Inhelder and Piaget1964). Given the ubiquity and importance of hierarchical relations in human thought, the lack of any similar ability in nonhuman animals would therefore constitute a marked discontinuity between human and nonhuman minds.
6.1. Seriated cups and hierarchical reasoning
A number of comparative researchers have reinterpreted the behavior of nonhuman animals in hierarchical terms (e.g., Byrne & Russon Reference Byrne and Russon1998; Greenfield Reference Greenfield1991; Matsuzawa Reference Matsuzawa, McGrew, Marchant and Nishida1996). In each of these cases, however, there is no evidence that the nonhuman animals themselves cognized the task in hierarchical terms or employed hierarchically structured mental representations to do so. The most widely cited case of hierarchical reasoning among nonhuman animals, for example, has come from experiments involving seriated cups. It has been claimed that “subassembly” (i.e., combining two or more cups as a subunit with one or more other cups) requires the subject to represent these nested relations in a combinatorial and “reversible” fashion (Greenfield Reference Greenfield1991; Westergaard & Suomi Reference Westergaard and Suomi1994). Indeed, Greenfield (Reference Greenfield1991) argued that children's ability to nest cups develops in parallel with their ability to employ hierarchical phonological and grammatical constructions, and therefore, that the ability of nonhuman primates to seriate cups is the precursor to comprehending hierarchical grammars (see Matsuzawa Reference Matsuzawa, McGrew, Marchant and Nishida1996 for claims of a similar “isomorphism” between tool and symbol use).
But is it actually necessary to cognize hierarchically structured relations in order to assemble nested cups? To date, Johnson-Pynn, Fragaszy, and colleagues have provided the most convincing evidence that a nonhuman animal can use subassembly to assemble seriated cups (Fragaszy et al. Reference Fragaszy, Galloway, Johnson-Pynn and Brakke2002; Johnson-Pynn & Fragaszy Reference Johnson-Pynn and Fragaszy2001; Johnson-Pynn et al. Reference Johnson-Pynn, Fragaszy, Hirsh, Brakke and Greenfield1999). Yet, Johnson-Pynn and Fragaszy themselves dispute the claim that this behavior requires hierarchical relational operations of the kind suggested by Greenfield (Reference Greenfield1991).
Fragaszy et al. (Reference Fragaszy, Galloway, Johnson-Pynn and Brakke2002), for example, presented seriated cups to adult capuchin monkeys, chimpanzees, and 11-, 16-, and 21-month-old children. Children of all three ages created five-cup sets less consistently than the nonhuman subjects did, and they were rarely able to place a sixth cup into a seriated set. Bizarrely, at least for a purely relational interpretation of the results, monkeys were more successful than either apes or human children on the more challenging six-cup trials, yet were also the most inefficient (in terms of number of moves) of the three populations.
Fragaszy et al.'s (2002) explanation for these anomalous results is quite sensible (see also Fragaszy & Cummins-Sebree Reference Fragaszy and Cummins-Sebree2005): They hypothesize that the seriation task does not, in fact, require the subject to reason about combinatorial, hierarchical relations per se, but depends more simply on situated, embodied sensory-motor skills that are experientially, rather than conceptually, driven. Apes and monkeys do better than children because they are more physically adept than 11- to 21-month-old children are – not because they have a more sophisticated representation of the combinatorial and hierarchical relations involved. Although subassembly may be a more physically “complex” strategy than other methods of seriation, it does not necessarily require the subject to cognize the spatial-physical relations involved as hierarchical; and therefore there is no reason to claim an isomorphism between the embodied manipulation of nested cups and the cognitive manipulation of symbolic-relational representations (cf. Greenfield Reference Greenfield1991; Matsuzawa Reference Matsuzawa, McGrew, Marchant and Nishida1996).
6.2. Hierarchical relations in the wild
The strongest evidence to date in support of the claim that nonhuman animals can reason about hierarchically structured relations in the social domain comes from Bergman et al.'s (2003) study of free-ranging baboons. Bergman et al. designed an elegant playback experiment in which female baboons heard a sequence of recorded calls mimicking a fight between two other females. Mock agonistic confrontations were created by playing the “threat-grunt” of one individual followed by the subordinate screams of another. On separate days, the same subject heard one of three different call sequences: (1) an anomalous sequence mimicking a rank reversal between members of the same matrilineal family (i.e., sisters, mothers, daughters, or nieces); (2) an anomalous sequence mimicking a between-family rank reversal (i.e., between members of two different matrilineal families in which one of the families is dominant to the other); or (3) a control sequence replicating an existing dominant-subordinate relationship (i.e., no rank reversal) using between-family or within-family dyads. As predicted, there was a significant difference in the focal subjects' responses to the three different kinds of call sequences. Subjects looked longest at between-family rank reversals. There was no significant difference between within-family reversals and no-reversal control sequences. According to Bergman et al., the reason the baboons responded more strongly to between-family rank reversals than within-family sequences is because the baboons recognized that the former imply a superordinate reorganization of matrilineal subgroups. Bergman et al. (Reference Bergman, Beehner, Cheney and Seyfarth2003, p. 1236) conclude: “Our results suggest that baboons organize their companions into a hierarchical, rule-governed structure based simultaneously on kinship and rank” (see also Seyfarth et al. Reference Seyfarth, Cheney and Bergman2005).
In our view, the evidence reported by Bergman et al. (Reference Bergman, Beehner, Cheney and Seyfarth2003) does not support this conclusion. Even if baboons do make a categorical distinction between kin and non-kin dyads based on interaction history, familiarity, spatial proximity, phenotypic cues, or some other observable regularity (see Silk Reference Silk2002a for a review of the possibilities), this does not necessarily mean that they represent the entire matrilineal social structure as an integrated relational schema in which non-kin relations are logically superordinate to between-kin relations. As Bergman et al. (Reference Bergman, Beehner, Cheney and Seyfarth2003) themselves point out, between-family rank reversals are much more disruptive to baboon social life than within-family rank reversals. Therefore, Bergman et al.'s (Reference Bergman, Beehner, Cheney and Seyfarth2003) results are consistent with the hypothesis that female baboons have learned that rank reversals among non-kin are more salient (i.e., associated with greater social turmoil and personal risk) than are within-kin rank reversals occurring in someone else's family (notably, Bergman et al. did not test rank reversals within the focal subject's own family). While baboons clearly recognize particular conspecifics' vocalizations and represent dominance and kin relations in a combinatorial manner, there is nothing in Bergman et al.'s data that remotely suggests a higher-order, hierarchical relation among these representations.
Once again, there is not simply an absence of evidence; there is evidence of an absence. Bergman et al. (Reference Bergman, Beehner, Cheney and Seyfarth2003) note that the subjects' responses to apparent rank reversals were unrelated to the rank distance separating the two signalers: that is, subjects paid as much attention to mock rank reversals involving closely ranked opponents as those involving more distantly ranked opponents. Bergman et al. use this fact to rebut the hypothesis that the baboons were responding more strongly to between-family rank reversals simply because the individuals involved had more disparate ranks. However, the data cut both ways: If the baboons did cognize the relation between female conspecifics as an integrated matrilineal dominance hierarchy, ceteris paribus, they should have been more surprised at a rank reversal between a very low ranking and a very high ranking individual than by a rank reversal between two individuals of adjacent ranks. Ironically, Bergman et al.'s results provide some of the strongest evidence to date that female baboons do not, in fact, cognize the structure of their conspecifics' matrilineal social relationships in a systematic or hierarchical fashion.
7. Causal relations
There is ample evidence that traditional associationist models are inadequate to account for nonhuman causal cognition; but the available comparative evidence also suggests that there is a critical and qualitative difference between the ways that human and nonhuman animals reason about causal relations (see Penn & Povinelli Reference Penn and Povinelli2007a for a more extensive review and discussion). Humans explicitly reason in terms of unobservable and/or hidden causes (Hagmayer & Waldmann Reference Hagmayer and Waldmann2004; Kushnir et al. 2005; Saxe et al. Reference Saxe, Tenenbaum and Carey2005), distinguish between “genuine” and “spurious” causes (Lien & Cheng Reference Lien and Cheng2000), reason diagnostically from effects to their possible causes (Waldmann & Holyoak Reference Waldmann and Holyoak1992), and plan their own interventions in a quasi-experimental fashion to elucidate ambiguous causal relations (Hagmayer et al. Reference Hagmayer, Sloman, Lagnado, Waldmann, Gopnik and Schulz2007). Numerous researchers have argued that normal humans – not just scientists or philosophers – form “intuitive theories” or “mental models” about the unobservable principles and causal forces that shape relations in a specific domain (e.g., Carey Reference Carey1985; Gopnik & Meltzoff Reference Gopnik and Meltzoff1997; Keil Reference Keil1989; Murphy & Medin Reference Murphy and Medin1985). These tacit systems of higher-order relations at various levels of generality modulate how human subjects judge and discover novel relations within those domains by a process akin to analogical inference (Goldvarg & Johnson-Laird Reference Goldvarg and Johnson-Laird2001; Lee & Holyoak Reference Lee, Holyoak, McNamara and Trafton2007; Lien & Cheng Reference Lien and Cheng2000; Tenenbaum et al. Reference Tenenbaum, Griffiths, Niyogi, Gopnik and Schulz2007). In short, the ability to reason about higher-order, analogical relations in a systematic and productive fashion appears to be an integral aspect of human causal cognition.
In stark contrast to the human case, there is no compelling evidence that nonhuman animals form tacit theories about the unobservable causal mechanisms at work in the world, seek out explanations for anomalous causal relations, reason diagnostically about unobserved causes, or distinguish between genuine and spurious causal relations on the basis of their prior knowledge of abstract causal mechanisms.Footnote 2 Indeed, there is consistent evidence of an absence across a variety of protocols (see, e.g., Penn & Povinelli Reference Penn and Povinelli2007a; Povinelli Reference Povinelli2000; Povinelli & Dunphy-Lelii Reference Povinelli and Dunphy-Lelii2001; Visalberghi & Tomasello Reference Visalberghi and Tomasello1998).
A variety of nonhuman animal species – and certainly not primates alone (Emery & Clayton Reference Emery and Clayton2004b) – are able to construct and use tools in a flexible and adaptive fashion. But a series of seminal experiments, initiated by Visalberghi and colleagues (see Visalberghi & Limongelli Reference Visalberghi, Limongelli, Russon, Bard and Parker1996 for a review), provides a particularly compelling example of how nonhuman animals' remarkable use of tools nevertheless belies a fundamental discontinuity with our human understanding of causal relations.
Visalberghi and Limongelli (Reference Visalberghi and Limongelli1994) tested capuchin monkeys' ability to retrieve a piece of food placed inside a transparent tube using a straight stick. In the middle of the tube, there was a highly visible hole with a small transparent cup attached. If the subject pushed the food over the hole, the food fell into the cup and was inaccessible (“trap-down” condition). Visalberghi and Limongelli (Reference Visalberghi and Limongelli1994) tested four capuchin monkeys to see whether they would understand that they needed to push the food out the end of the tube away from the hole. After about 90 trials, only one out of the four capuchin monkeys learned to push the food away from the hole, and even this one learned the correct behavior through trial and error. Worse, once the experimenters rotated the tube so that the trap hole was now facing up and causally irrelevant (“trap-up” condition), the one successful capuchin still persisted in treating the hole as if it needed to be avoided – making it obvious that even this subject misunderstood the causal relation between the trap hole and the retrieval of the reward.
Povinelli (Reference Povinelli2000) and colleagues subsequently replicated Visalberghi's trap-tube protocol with seven chimpanzees. Povinelli performed the experiments once when the chimpanzees were juveniles (5 to 6 years old) and again when they were young adults (10 years old). Three out of the seven chimps learned to solve the trap-down version of the task as adults, with one chimp, Megan, learning to solve the task within 100 trials. However, none of the chimps showed any evidence of distinguishing between the trap-up and trap-down versions of the task. By way of comparison, it should be noted that children as young as 3 years of age successfully solve the trap-tube task after only a few trials (see Limongelli et al. Reference Limongelli, Boysen and Visalberghi1995).
Recently, Mulcahy and Call (Reference Mulcahy and Call2006b) tested ten great apes on a version of the trap-tube task that allowed subjects to choose whether to pull or push the reward through the tube. Three out of the ten subjects learned to avoid the trap when pulling rather than pushing. However, the majority of subjects still failed the task. Indeed, even the three successful subjects took an average of 44 trials to achieve above-chance performance, and then continued to fail Visalberghi and Limongelli's (1994) push-only version of the task. Therefore, these latest results seem to confirm two earlier hypotheses: (1) nonhuman apes are more adept at pulling than pushing in tool-use tasks such as these (see, e.g., Povinelli (Reference Povinelli2000, Ch. 5); and (2) nonhuman primates' causal knowledge is tightly coupled to specific task parameters and bodily movements: in particular, they do not appear to grasp the abstract, analogical similarity between perceptually disparate but functionally equivalent tasks (Penn & Povinelli Reference Penn and Povinelli2007a; Povinelli Reference Povinelli2000; Visalberghi & Tomasello Reference Visalberghi and Tomasello1998).
Nonhuman primates are not the only animals that seem to be incapable of cognizing the general causal principles at issue in the trap-tube task. Seed et al. (Reference Seed, Tebbich, Emery and Clayton2006) recently presented eight rooks with a clever modification to Visalberghi's trap-tube task in which each tube contained two traps, one which was functional and one which was not. Seven out of eight rooks rapidly learned to pull the food away from the functional trap and successfully transferred this solution to a novel but perceptually similar version of the task. Nevertheless, when presented with transfer tasks in which the visual cues that were associated with success in the initial tasks were absent or confounded, only one of the seven subjects passed. In a follow-up experiment (Tebbich et al. (Reference Tebbich, Seed, Emery and Clayton2007), none of the rooks passed the transfer tasks.
Seed et al.'s (2006) results add to the growing evidence that corvids are quite adept at using stick-like tools (see, e.g., Weir & Kacelnik (Reference Weir and Kacelnik2007). But as Seed et al. (Reference Seed, Tebbich, Emery and Clayton2006) point out, these results also suggest that rooks share a common cognitive limitation with nonhuman primates: they do not understand “unobservable causal properties” such as gravity and support; nor do they reason about the higher-order relation between causal relations in an analogical or theory-like fashion. Instead, rooks, like other nonhuman animals, appear to solve tool-use problems based on evolved, domain-specific expectations about what perceptual features are likely to be most salient in a given context and a general ability to reason about the causal relation between observable contingencies in a flexible, goal-directed but task-specific fashion (see also Penn & Povinelli Reference Penn and Povinelli2007a).
8. Theory of mind
Nonhuman animals certainly manifest many sophisticated social-cognitive abilities. But having a theory of mind (ToM) sensu Premack and Woodruff (Reference Premack and Woodruff1978) means something more specific than being a socially savvy animal: it means being able to impute unobservable, contentful mental states to other agents and then to reason in a theory-like fashion about the causal relation between these unobservable mental states and the agents' subsequent behavior (see Penn & Povinelli Reference Penn and Povinelli2007b for a more extensive discussion of this point). Of course, theory-like inferences are not the only way in which a cognizer might reason about other agents' mental states (see Carruthers & Smith Reference Carruthers and Smith1996 for a review of the possibilities). Mentalistic simulation, for example, provides an alternative and popular explanation. However, all but the most radical simulation-oriented theories do not deny that humans represent causal relations involving other agents' unobservable mental states. They simply propose an alternative, analogical mechanism for how humans do so.
Whiten (Reference Whiten, Carruthers and Smith1996; Reference Whiten and Sperber2000) has proposed another, influential hypothesis about how nonhuman apes (and young children) might represent the mental states of their conspecifics without relying on theory-like metarepresentations. Whiten proposed that nonhuman apes use “intervening variables” to stand in for generalizations about the causal role played by a given mental state in a set of disparate behavioral patterns. For example, a chimpanzee that encodes the observable patterns “X saw Y put food in bin A,” “X hid food in bin A,” and “X sees Y glancing at bin A” as members of the same abstract equivalence class could be said, on Whiten's account, to recognize that “X knows food is in bin A” and, therefore, be capable of “explicit mindreading” (Whiten Reference Whiten, Carruthers and Smith1996).
Notice that Whiten's example of “explicit mindreading” is a textbook example of analogical reasoning: Whiten's hypothetical chimpanzee must infer a systematic higher-order relation among disparate behavioral patterns that have nothing in common other than a shared but unobservable causal mechanism: that is, what X “knows.” If this is an “intervening variable,” it is an intervening variable that requires reasoning about the higher-order, role-governed relational similarity between perceptually disparate causal relations in order to be produced.
We believe Whiten is right in this sense: If a nonhuman animal were capable of inferring that these disparate behavioral patterns were actually instances of the same superordinate causal relation, then the animal would surely have demonstrated that it possessed a ToM and the ability to reason analogically, as well. There is, however, no such evidence on offer. Indeed, until recently, there has been a fragile consensus that nonhuman animals lack anything even remotely resembling a ToM (Cheney & Seyfarth Reference Cheney, Seyfarth and Peterson1998; Heyes Reference Heyes1998; Tomasello & Call Reference Tomasello and Call1997; Visalberghi & Tomasello Reference Visalberghi and Tomasello1998).
A few years ago, however, Hare et al. (Reference Hare, Call, Agnetta and Tomasello2000; Reference Hare, Call and Tomasello2001) reported “breakthrough” evidence that chimpanzees do, in fact, reason about certain psychological states in their conspecifics (see, particularly, Tomasello et al. Reference Tomasello, Call and Hare2003a; Reference Tomasello, Call and Hare2003b). And since then, there have been a flurry of similar claims on behalf of corvids and monkeys based on similar protocols (Bugnyar & Heinrich Reference Bugnyar and Heinrich2005; Reference Bugnyar and Heinrich2006; Dally et al. Reference Dally, Emery and Clayton2006; Emery & Clayton Reference Emery and Clayton2001; in press; Flombaum & Santos Reference Flombaum and Santos2005; Santos et al. Reference Santos, Nissen and Ferrugia2006). Because Povinelli and colleagues have provided detailed critiques of Hare et al.'s (Reference Hare, Call, Agnetta and Tomasello2000; Reference Hare, Call and Tomasello2001) protocol and results elsewhere (see Penn & Povinelli Reference Penn and Povinelli2007b; Povinelli Reference Povinelli2004; Povinelli & Vonk Reference Povinelli and Vonk2003; Reference Povinelli and Vonk2004), here we will focus on the best available evidence for a ToM system among non-primates. As will become apparent, our original critique of Hare et al.'s (Reference Hare, Call, Agnetta and Tomasello2000; Reference Hare, Call and Tomasello2001) protocol applies, mutatis mutandis, to the new claims being made on behalf of corvids, as well.
The best evidence for a ToM system in a non-primate comes from the work of Emery, Clayton and colleagues (Emery & Clayton Reference Emery and Clayton2001; Reference Emery and Clayton2004b; in press). Dally et al. (Reference Dally, Emery and Clayton2006), for example, had scrub-jays cache food items under one of four conditions: (1) in the presence of a dominant conspecific, (2) in the presence of a subordinate, (3) in the presence of the storer's preferred partner, or (4) in private. The storers were allowed to cache the food in two trays, one nearer and one farther away from the observer, and then they were allowed to recover their caches in private three hours later. Dally et al. (Reference Dally, Emery and Clayton2006) showed that birds that had stored food in the presence of a dominant or subordinate competitor tended to re-cache food predominantly from the near tray, and that the proportion of food that was re-cached was greatest for birds that had stored food in the presence of a dominant competitor. In a follow-up experiment, scrub-jays were given the chance to cache successively in two trays, each in view of a different observer. After three hours, storers were allowed to recover their caches. Dally et al. (Reference Dally, Emery and Clayton2006) reported that significantly more food caches were re-cached when a previous observer was present than when the storers retrieved their caches in private or in view of a control bird that had not witnessed the original caching. Furthermore, if a previous observer was present, storers tended to re-cache from the tray that the previous observer had actually observed.
Results such as these leave no doubt that corvids are remarkably intelligent creatures, able to keep track of the social context of specific past events, as well as the what, when, and where information associated with those events (Clayton et al. Reference Clayton, Griffiths, Emery and Dickinson2001). But nothing in the results reported to date suggests that corvids actually reason about their conspecifics' mental states – or even understand that their conspecifics have mental states at all – as distinct from their conspecifics' past and occurrent behaviors and the subjects' own knowledge of past and current states of affairs (Penn & Povinelli Reference Penn and Povinelli2007b; Povinelli et al. Reference Povinelli, Bering and Giambrone2000; Povinelli & Vonk Reference Povinelli and Vonk2003; Reference Povinelli and Vonk2004).Footnote 3
In the case of Dally et al.'s (2006) experiment, for example, it suffices for the subjects to keep track of which competitor was present during which caching event and to formulate strategies on the basis of observable features of the task alone: for example, <Re-cache food if a competitor has oriented towards it in the past>, <Try to cache food in sites that are farther away from potential competitors>, <Attempt to pilfer food if the competitor that cached it is not present>, and so on. Since none of the protocols required the subjects to reason in terms of the specific contents of the competitor's epistemic mental states, the additional inference that the subjects acted the way they did because they understood that <The competitor knows where the food is located> does no additional cognitive or explanatory work. This additional mentalistic claim merely satisfies our all-too-human need to posit an explicit, conscious, propositional reason for the birds' behaviors. But it is obvious that animals – including humans – do not necessarily need to “know” why they are acting the way they are acting in order for a behavior to be flexible, effective, and (biologically) rational (see lucid discussions by Heyes & Papineau Reference Heyes, Papineau, Nudds and Hurley2006; Kacelnik Reference Kacelnik, Hurley and Nudds2006).
Indeed, many of the same researchers who claim evidence for ToM abilities in corvids explicitly acknowledge that an explanation based on responding to observed cues alone would be sufficient to account for the existing data. Dally et al. (Reference Dally, Emery and Clayton2006, p. 1665), for example, point out that scrub-jays' ability to keep track of which competitors have observed which cache sites “need not require a human-like ‘theory of mind’ in terms of unobservable mental states, but […] may result from behavioral predispositions in combination with specific learning algorithms or from reasoning about future risk.” Similarly, Bugnyar and Heinrich (Reference Bugnyar and Heinrich2006, p. 374) acknowledge that a representation of “states in the physical world” and “responses to subtle behavioral cues given by the competitor” would be sufficient to explain the available evidence concerning the manipulative behaviors of ravens – as well, we would add, as all the other comparative evidence claiming to show ToM-like abilities in nonhuman animals to date (for examples of the kind of protocols that could, in principle, provide evidence for a ToM system in a nonhuman animal, see Penn & Povinelli Reference Penn and Povinelli2007b).
9. Explaining the discontinuity
Up to this point in the article, we have focused solely on showing that there is, in fact, a pervasive functional discontinuity between human and nonhuman minds, and that this discontinuity is located specifically in the way that human and nonhuman animals reason about relations. Now we turn to the daunting question of how to account for this pervasive discontinuity. Let us first consider the three most influential hypotheses that have been proposed in recent years.
9.1. The massive modularity hypothesis
A “modular” explanation for the evolution of human cognition is popular among many evolutionary-minded theorists (e.g., Barkow et al. Reference Barkow, Cosmides and Tooby1992). Certainly, many central cognitive processes – including almost all of the cognitive mechanisms we share with nonhuman animals – are at least moderately modular once the notion of modularity has been defined in a purely functional sense (see Barrett Reference Barrett and Kurzban2006). But the modular story alone does not provide a satisfying explanation for the disparity between human and nonhuman minds.
As we have seen in our review of the comparative evidence, the pattern of similarities and differences between human and nonhuman relational reasoning is remarkably consistent across every domain of cognition, from same-different reasoning and spatial relations to tool use and ToM. Therefore, it seems highly implausible that the disparities in each domain are the result of independent, module-specific adaptations. It seems much more likely (not to mention, parsimonious) that a common set of specializations – perhaps in some more general “supermodule” – is responsible for augmenting the relational capabilities of all of the cognitive modules we inherited from our nonhuman ancestors. Unfortunately, the two most popular supermodules that have been proposed to date – ToM and language – do not do a good job of accounting for the comparative evidence.
9.2. The ToM hypothesis
A number of comparative researchers believe that the discontinuity between human and nonhuman minds can be traced back to some limitation in nonhuman animals' social-cognitive abilities (e.g., Cheney & Seyfarth Reference Cheney, Seyfarth and Peterson1998; Terrace Reference Terrace, Terrace and Metcalfe2005a; Tomasello et al. Reference Tomasello, Carpenter, Call, Behne and Moll2005). Although we certainly agree that nonhuman animals do not appear to possess anything remotely resembling a ToM, the hypothesis that some aspect of our ToM alone is responsible for the disparity between human and nonhuman cognition seems difficult to sustain. For example, it is very hard to see how a discontinuity in social-cognitive abilities alone could explain the profound differences between human and nonhuman animals' abilities to reason about causal relations in the physical world or nonhuman animals' inability to reason about higher-order spatial relations. Even Tomasello and his colleagues have admitted that trying to explain all the differences between human and nonhuman cognition in terms of a difference in ToM skills is “highly speculative” at best (Tomasello & Call Reference Tomasello and Call1997, p. 418). Indeed, in a different context, Tomasello has himself argued (e.g., Tomasello Reference Tomasello2000) that human language learners rely on cognitive capacities – such as analogical reasoning and abstract rule learning – that are independent from ToM and absent in nonhuman animals. So while our ability to participate in collaborative activities and to take each others' mental states into account may be a distinctive feature of the human lineage, it is clearly not the only or even the most basic one.
9.3. The language-only hypothesis
The oldest and still most popular explanation for the wide-ranging disparity between human and nonhuman animals' cognitive abilities is language (for recent examples of this venerable argument, see Bermudez Reference Bermúdez2003; Carruthers Reference Carruthers2002; Clark Reference Clark2006). Dennett (Reference Dennett1996, p. 17) described the extreme version of this hypothesis in characteristically pithy terms: “Perhaps the kind of mind you get when you add language to it is so different from the kind of mind you can have without language that calling them both minds is a mistake.”
To be sure, language clearly plays an enormous and crucial role in subserving the differences between human and nonhuman cognition. But we believe that language alone is not sufficient to account for the discontinuity between human and nonhuman minds. In order to make our case, we need to distinguish between three distinct versions of the language-only hypothesis: (1) that verbalized (or imaged) natural language sentences are responsible for the disparity between human and nonhuman cognition; (2) that some aspect of our internal “language faculty” is responsible for the disparity; and (3) that the communicative and/or cognitive function of language served as the prime mover in the evolution of the uniquely human features of the human mind.
9.3.1. Are natural language sentences what makes the human mind human?
Natural language tokens clearly play an enormous role in “extending” and even in “rewiring” the human mind (Bermudez Reference Bermúdez2005; Clark Reference Clark2006; Dennett Reference Dennett1996). Gentner and colleagues, for example, have shown that relational labels play an instrumental role in facilitating young human learners' sensitivity to relational similarities and potential analogies (Gentner & Rattermann Reference Gentner, Rattermann, Gelman and Byrnes1991; Loewenstein & Gentner Reference Loewenstein and Gentner2005). Our ability to reason about large quantities of countable objects in a generative and systematic fashion seems to require the acquisition of numeric symbols and a linguistic counting system (Bloom & Wynn Reference Bloom and Wynn1997). Numerous studies have shown that subjects with language impairment exhibit a variety of cognitive deficits (e.g., Baldo et al. Reference Baldo, Dronkers, Wilkins, Ludy, Raskin and Kim2005) and that deaf children from hearing families (i.e., “late signers”) show persistent deficits in ToM tasks (see Siegal et al. Reference Siegal, Varley and Want2001 for a review). Furthermore, there is good evidence that a child's ability to pass certain kinds of ToM tests is intricately tied to the acquisition of specific sentential structures (de Villiers Reference deVilliers, Baron-Cohen, Tager-Flusberg and Cohen2000). Normal human cognition clearly depends on normal linguistic capabilities.
But although natural language clearly subserves and catalyzes normal human cognition, there is compelling evidence that the human mind is distinctively human even in the absence of normal natural language sentences (see Bloom Reference Bloom2000; Garfield et al. Reference Garfield, Peterson and Perry2001; Siegal et al. Reference Siegal, Varley and Want2001). Varley and Siegal Reference Varley and Siegal2000), for example, studied the higher-order reasoning abilities of an agrammatic aphasic man who was incapable of producing or comprehending sentences and whose vocabulary was essentially limited to perceptual nouns. In particular, he had lost all his vocabulary for mentalistic entities such as “beliefs” and “wants.” Yet this patient continued to take care of the family finances and passed a battery of causal reasoning and ToM tests (see also Varley et al. Reference Varley, Siegal and Want2001; Reference Varley, Klessinger, Romanowski and Siegal2005). Although late-signing deaf children's cognitive abilities may not be “normal,” they nevertheless manifest grammatical, logical, and causal reasoning abilities far beyond those of any nonhuman subject (Peterson & Siegal Reference Peterson and Siegal2000). And the many remarkable cases of congenitally deaf children spontaneously “inventing” gestural languages with hierarchical and compositional structure provide further confirmation that the human mind is indomitably human even in the absence of normal linguistic enculturation (see, e.g., Goldin-Meadow Reference Goldin-Meadow2003; Sandler et al. Reference Sandler, Meir, Padden and Aronoff2005; Senghas et al. Reference Senghas, Kita and Ozyurek2004).
Of course, the process of learning a language may “rewire” the human brain in ways that make certain kinds of cognition possible that would not be possible otherwise, even if the subject subsequently loses the ability to use language later in life. But this ontogenetic version of the “rewiring hypothesis” (Bermudez Reference Bermúdez2005) begs the question of what allows language to so profoundly rewire the human mind, but no other.
Over the last 35 years, comparative researchers have invested considerable effort in teaching nonhuman animals of a variety of taxa to use and/or comprehend language-like symbol systems. Many of these animals have experienced protracted periods of enculturation that rival those of modern (coddled) human children. The stars of these animal language projects have indeed been able to approximate certain superficial aspects of human language, including the ability to associate arbitrary sounds, tokens, and gestures with external objects, properties, and actions and a rudimentary sensitivity to the order in which these “symbols” appear when interpreting novel “sentences” (Herman et al. Reference Herman, Richards and Wolz1984; Pepperberg Reference Pepperberg2002; Savage-Rumbaugh & Lewin Reference Savage-Rumbaugh and Lewin1994; Schusterman & Krieger Reference Schusterman and Krieger1986). But even after decades of exhaustive training, no nonhuman animal has demonstrated a clear mastery of abstract grammatical categories, closed-class items, hierarchical syntactic structures, or any of the other defining features of a human language (cf. Kako Reference Kako1999). Furthermore, there is still no evidence that symbol-trained animals are any more adept than symbol-naive ones at reasoning about unobservable causal forces, mental states, analogical inferences, or any of the other tasks that require the ability to cognize higher-order relations in a systematic, structural fashion (cf. Thompson & Oden Reference Thompson and Oden2000).
If the history of animal language research demonstrates nothing else, it demonstrates that you cannot create a human mind simply by taking a nonhuman one and teaching it to use language-like symbols. There must be substantive differences between human and nonhuman minds that allow the former, but not any of the latter, to master grammatically structured languages to begin with (cf. Clark Reference Clark2001).
9.3.2. Is some aspect of the human language faculty the key?
A more plausible variation on the language-only hypothesis is that some aspect of our internal faculty for language is responsible for our unique cognitive abilities. In a recent and influential version of this proposal, Hauser et al. (Reference Hauser, Chomsky and Fitch2002a) distinguish between the faculty of language in the narrow sense (FLN) and the faculty of language in the broad sense (FLB). They define FLN as including only the computational mechanisms specific to “narrow syntax” and to mapping syntactic representations into the systems of phonology and semantics. FLB, on the other hand, encompasses all the aspects of our sensory and cognitive systems that go into the production and comprehension of language, including the sensory-motor systems responsible for perceiving and producing the perceptual patterns of language, and the conceptual-intentional systems responsible both for representing the semantic/conceptual meaning of linguistic expressions and for reasoning about their implications. According to Hauser et al. (Reference Hauser, Chomsky and Fitch2002a), “most, if not all, of FLB is based on mechanisms shared with nonhuman animals” (p. 1573). On the narrowest and most ambitious version of their hypothesis (i.e., “Hypothesis 3,” p. 1573), the only aspect of human cognition that is qualitatively unique to our species is specific to FLN, and in particular, to the computational mechanisms responsible for recursion.
We believe the available comparative evidence firmly rules out the narrowest and most ambitious version of Hauser et al.'s (2002a) hypothesis. While the computational mechanisms responsible for recursion – at least the kind of recursion characteristic of human languages – certainly appear to be unique to the human mind, there are many other aspects of human languages that are also uniquely human but not included in Hauser et al.'s (2002a) construal of FLN (see Pinker & Jackendoff Reference Pinker and Jackendoff2005). More generally, over the course of this article, we have argued that there are many aspects of the human conceptual-intentional system that are unique to human subjects but are not specifically linguistic, ranging from our ability to reason about hierarchical social relations to our ability to theorize about unobservable causal mechanisms and mental states. Some of these cognitive capabilities also seem to require recursive operations over hierarchically structured representations (see our discussion regarding “the proper treatment of symbols in a nonhuman cognitive architecture” in section 10 of this article), suggesting that recursion is not specific to FLN. Indeed, Hauser et al. (Reference Hauser, Chomsky and Fitch2002a) themselves suggest that recursion evolved first in some noncommunicative domain. So, even according to their own hypothesis, the discontinuity between human and nonhuman minds presumably began before the evolution of the language faculty narrowly construed – although their hypothesis leaves unanswered what exactly changed in the human conceptual-intentional system to allow for the advent of recursive operations over hierarchically structured representations.
Carruthers (Reference Carruthers2002; Reference Carruthers, Carruthers, Laurence and Stich2005a) has proposed a much broader – and, we believe, more plausible – role for the language faculty in subserving human cognition. Carruthers argues that the distinctively human capacity for non–domain-specific, cross-modular thinking implicates representations in what Chomsky (Reference Chomsky1995) calls “logical form” (LF).Footnote 4 The LF hypothesis has much to recommend it. We do not doubt, for example, that there are many human cognitive abilities that rely on linguaform representations, including, but certainly not limited to, our ability to reinterpret our own thoughts in a propositional and domain-general fashion. What we dispute, however, is the implication that, aside from our language faculty, human and nonhuman minds are fundamentally the same.
Our review of the comparative evidence has highlighted a number of domains in which human subjects are able to reason in a fashion that seems beyond the grasp of any nonhuman animal. In order to support the claim that LF representations alone are responsible for the discontinuity between human and nonhuman cognition, one would have to argue that all of these uniquely human abilities – from our ability to reason about higher-order causal relations to our ability to impute unobservable mental states – are causally dependent on LF representations. But this seems inconsistent with the available evidence. Carruthers (Reference Carruthers2002) himself argues that the full-fledged, uniquely human ToM system that comes online at about four years of age is essentially “language-independent” in its mature form (p. 672). Hence, ToM provides at least one example of a cognitive module that is distinctively human but that is not entirely dependent on occurrent LF representations. Causal and logical reasoning provide two further examples. The disparities we highlighted between human and nonhuman causal cognition often occur in highly domain-specific, embodied tasks – for example, pushing a food reward out of a tube – which would seem to definitively rule out Carruthers's hypothesis that the discontinuity between human and nonhuman cognition is limited to non–domain-specific, cross-modular kinds of thinking. Furthermore, many prominent theories of logical and relational reasoning postulate that human subjects employ quasi-imagistic “mental models” (e.g., Goodwin & Johnson-Laird Reference Goodwin and Johnson-Laird2005). Carruthers (Reference Carruthers2002, p. 658) acknowledges the indispensable role these models play in human cognition; but there is good evidence that the mental models employed by human beings are non-sentential in structure and yet qualitatively different from those employed by nonhuman animals.
Finally, while LF representations may very well be necessary to reason about certain kinds of higher-order relations, particularly those involving linguistically mediated representations (Bermudez Reference Bermúdez2003; Reference Bermúdez2005), there is little reason to believe that LF representations are necessary in order to reason about any sort of higher-order relations at all. Indeed, the available evidence suggests otherwise. We noted above that humans without any appreciable grammatical or linguistic ability are nevertheless often able to reason quasi-normally about higher-order causal relations and mental states (e.g., Siegal et al. Reference Siegal, Varley and Want2001). Conversely, subjects who suffer from frontal forms of frontotemporal dementia show selective impairment in the ability to integrate higher-order visuospatial relations (Waltz et al. Reference Waltz, Knowlton, Holyoak, Boone, Mishkin, Santos, Thomas and Miller1999) and to pass ToM tests (Gregory et al. Reference Gregory, Lough, Stone, Erzinclioglu, Martin, Baron-Cohen and Hodges2002) even when their linguistic abilities are still largely normal (see also Blair et al. Reference Blair, Marczinski, Davis-Faroque and Kertesz2007). This double dissociation suggests that the ability to reason about higher-order relations is not entirely encapsulated within our language faculty narrowly construed (i.e., FLN).
9.3.3. Did the adaptive functions of language drive the evolution of the human mind?
The third – and, we believe, most plausible – version of the language-only hypothesis is that the communicative and/or cognitive functions of language played an instrumental role in the evolution of the human brain. Learning a language seems to require the ability to cognize higher-order abstract relations in a systematic, generative, and structural fashion. And it seems indisputable – at least to us – that the language faculty, broadly construed, is the product of extensive evolutionary tinkering (Pinker & Bloom Reference Pinker and Bloom1990; Pinker & Jackendoff Reference Pinker and Jackendoff2005). So it is possible that our ability to reason about higher-order relations evolved first in order to accommodate the requirements of language, and then was co-opted, exported, and/or duplicated for other purposes in nonlinguistic domains.
But there are good reasons, we would argue, to favor a more complex, coevolutionary relationship between human thought and human language (see also Bloom Reference Bloom1994; Reference Bloom2000; Bloom & Keil Reference Bloom and Keil2001). While the advantages of symbolic communication are enormous, the adaptive advantages of being able to reason in a relational fashion have a certain primacy over the communicative function of language. It is quite difficult to imagine how communicating in hierarchically structured sentences would be of any use without the ability to entertain hierarchically structured thoughts. But it is quite easy to imagine how the ability to reason about higher-order relations – particularly causal and mentalistic relations – might be highly adaptive without the ability to communicate those thoughts to anyone else. If one is a tool-using bipedal ape in a rapidly-changing environment surrounded by ambitious and conniving conspecifics, the evolutionary advantages of reasoning about higher-order relations go far beyond the ability to communicate hierarchical thoughts to those conspecifics.
Over the course of this target article, we have argued that our ability to reason about higher-order relations subserves a wide variety of distinctively human capabilities. It seems possible that the adaptive advantages of one or more of these capabilities might have played a critical role in pushing the human brain in a relational direction either in conjunction with, or even prior to, the evolution of the language faculty narrowly construed. Our coevolutionary story does not make language an exaptation (cf. Hauser et al. (Reference Hauser, Chomsky and Fitch2002a); nor does it make our prelinguistic relational capabilities a “pre-adaptation” for language (cf. Christiansen & Kirby Reference Christiansen and Kirby2003); nor does it deny the enormous evolutionary importance that language has had in “rewiring” the human mind (cf. Bermudez Reference Bermúdez2005). We are simply hypothesizing that the communicative function of language may have been just one among a number of factors that pushed the cognitive architecture of our species in a relational direction.
In any case, regardless of which factors most strongly contributed to the unique evolution of the human brain, language alone is no longer directly and entirely responsible for the functional discontinuity between extant human and nonhuman minds.
10. On the proper treatment of symbols in a nonhuman cognitive architecture
The crux of the matter, then, is to identify the specific changes to the hominoid cognitive architecture that enabled Homo sapiens sapiens to reason about higher-order relations in a structurally systematic and inferentially productive fashion, and ultimately resulted in the evolution of our unique linguistic, mentalistic, logical, and causal reasoning abilities. Behavioral evidence from extant animal species alone cannot tell us what changed in the neural architecture of the human brain since the split from our nonhuman ancestors. But when that evidence is combined with recent advances in computational models of biological cognition, it becomes possible to sketch a fairly detailed representational-level specification of the kind of changes we should be looking for.
10.1. The PSS hypothesis
The classical school of thought in cognitive psychology has insisted for more than three decades that both human and nonhuman minds are the product of a physical symbol system (Fodor Reference Fodor1975; Reference Fodor1997; Fodor & McLaughlin Reference Fodor and McLaughlin1990; Fodor & Pylyshyn Reference Fodor and Pylyshyn1988; Newell Reference Newell1980; Reference Newell1990; Newell & Simon Reference Newell and Simon1976; Pinker & Prince Reference Pinker and Prince1988). According to the now familiar tenets of the physical symbol system (PSS) hypothesis, mental representations are composed of discrete, symbolic tokens, which can be combined into complex representations by forming syntactically structured relations of various types. Cognitive processes, according to the classical view, are rule-governed algorithms that operate over the formal structure of these mental representations in a truth-preserving fashion. The classic defense of the PSS hypothesis is that it provides a computational account for several of the most spectacular aspects of human thought, including our abilities to generalize rule-like relations over abstract categorical variables, to reason in an inferentially coherent fashion, and to use the artificial symbols of a natural language in a systematic, recursive, and generative manner (Fodor & Pylyshyn (Reference Fodor and Pylyshyn1988; Marcus Reference Marcus2001; Newell Reference Newell1980; Pinker & Prince Reference Pinker and Prince1988).
The PSS hypothesis is certainly not the “only game in town” (Fodor Reference Fodor1975). Indeed, the PSS hypothesis has been roundly criticized for a variety of reasons, ranging from its biological implausibility to its inability to deal with the graded semantic flexibility of many cognitive processes (e.g., Barsalou (Reference Barsalou1999; Clark Reference Clark1997; Rumelhart & McClelland Reference Rumelhart and McClelland1986; van Gelder Reference van Gelder1998). However, even among those who see the PSS hypothesis as fundamentally misguided, most would agree that human subjects are often able to approximate the systematic, higher-order, relational capabilities putatively associated with a PSS, at least in their linguistically mediated behavior. Smolensky (Reference Smolensky1999) calls this the “Symbolic Approximation” hypothesis: that is, the hypothesis that some – though certainly not all – aspects of human mental representations admit of abstract, higher-level descriptions that are closely approximated by the kinds of discrete, abstract structures posited by symbolic, linguistic theory.
In our view, the Symbolic Approximation hypothesis defines an essential and irreducible benchmark for any viable model of human cognition, far beyond the confines of linguistically mediated processes and symbolic, linguistic theory (Holyoak & Hummel Reference Holyoak, Hummel, Dietrich and Markman2000; Reference Holyoak, Hummel, Gentner, Holyoak and Kokinov2001; Hummel & Holyoak Reference Hummel and Holyoak2003; Reference Hummel and Holyoak2005). Although the classical version of the PSS hypothesis appears dead as a model of biological cognition in general, there are compelling reasons to believe that something closely approximating the functionality of a PSS is necessary in order to subserve the systematic, higher-order relational inferences of which human subjects are manifestly capable (Gentner Reference Gentner, Gentner and Goldin-Meadow2003; Goodwin & Johnson-Laird Reference Goodwin and Johnson-Laird2005; Halford et al. Reference Halford, Wilson and Phillips1998a; Holyoak & Thagard Reference Holyoak and Thagard1995). Indeed, all of the most successful neurally inspired computational models of relational reasoning employ – or at least try to approximate the capabilities of – a PSS (e.g., Eliasmith & Thagard Reference Eliasmith and Thagard2001; Hummel & Holyoak Reference Hummel and Holyoak2003; Plate Reference Plate2000; Wilson et al. Reference Wilson, Halford, Gray, Phillips, Gentner, Holyoak and Kokinov2001). In our view, the operational question for researchers interested in modeling the human mind should no longer be whether the human mind implements a PSS, but rather how the higher-order relational capabilities of a PSS can be combined with the associative and generalization capabilities of a nonclassical system in a neurally plausible cognitive architecture (for similar views but alternative answers to this question, see Eliasmith & Thagard Reference Eliasmith and Thagard2001; Marcus Reference Marcus2001; Plate Reference Plate2000; Pollack Reference Pollack1990; Shastri & Ajjanagadde Reference Shastri and Ajjanagadde1993; Smolensky Reference Smolensky1990; Reference Smolensky1999; Wilson et al. Reference Wilson, Halford, Gray, Phillips, Gentner, Holyoak and Kokinov2001).
The situation with respect to nonhuman minds, however, is quite different. In the following subsections, we evaluate the degree to which nonhuman minds approximate the defining features of a PSS. As will become quickly apparent, the comparative evidence does not support the all-or-nothing position taken by the orthodox version of the PSS hypothesis (e.g., Fodor & Pylyshyn Reference Fodor and Pylyshyn1988). Unlike the PSS hypothesis, however, the Symbolic Approximation hypothesis invites the possibility that different cognitive organisms may approximate different features of a PSS to varying degrees or even, pace Fodor and Pylyshyn (Reference Fodor and Pylyshyn1988), in a punctate and content-specific manner. And, in fact, this is exactly what the comparative evidence suggests is the case.
10.2. Symbols
The PPS hypothesis is often construed as the claim that mental representations are symbolic. The problem with this construal is that there is little consensus among cognitive researchers on what it means for a representation to be “symbolic” (see discussion by Marcus Reference Marcus2001). Therefore, to sidestep the nettlesome issue of what counts as a “symbol” sensu stricto, we will start with a generic definition of a “mental representation” sensu largo and then ask which, if any, of the additional symbolic abilities postulated by the PSS hypothesis are found in the cognitive behavior of nonhuman animals.
Markman and Dietrich (Reference Markman and Dietrich2000) propose a sensible, minimalist definition of a mental representation as any internal information-carrying state that mediates a cognitive system's furtherance of its goals. We will not pretend that this definition puts to rest the entire – or even a small part – of the controversy surrounding what counts as a mental representation. But we nevertheless propose to stipulate without further argument that nonhuman animals employ “representations” in this minimalist sense, as even the most bare-boned associationist theory of animal learning agrees on the causal relevance of information-carrying mediating states, as well as the explanatory need for these states within comparative research.
The additional claim that nonhuman mental representations carry information about particular states of affairs and that these same representations can subsequently be used off-line in a productive fashion is slightly more controversial, but it should not be. It seems unarguable that nonhuman animals are capable of forming internal representations about discrete states of affairs that endure beyond the sensory-motor inputs giving rise to them. Honey-bees may not be capable of constructing full-fledged cognitive maps, but they are manifestly capable of keeping track of information associated with multiple landmarks they have encountered in the past and then using these representations of absent states of affairs in order to find their way home (e.g., Menzel et al. Reference Menzel, Greggers, Smith, Berger, Brandt, Brunke, Bundrock, Hülse, Plümpe, Schaupp, Schüttler, Stach, Stindt, Stollhoff and Watzl2005). Scrub-jays may not have a theory of mind; but they are manifestly capable of remembering the “what,” “when,” and “where” information associated with tens of thousands of independent cache sites, and they can keep track of “who saw what when” for the purposes of protecting those sites from potential pilferers (Emery Reference Emery and Watanabe2004; Emery & Clayton Reference Emery and Clayton2004b). Bermudez (Reference Bermúdez2003), following Strawson (Reference Strawson1959), calls these “particular-involving” representations; and we feel that it is indisputable that nonhuman animals represent the world in particular-involving ways.
Moreover, nonhuman animals apparently have the ability to update representations associated with a particular state of affairs – for example, where and when one particular piece of food was cached – without catastrophically affecting representations associated with other similar states of affairs – for example, where all the other pieces of food were cached. And they are able to update these representations in response to a single exposure (see, e.g., Clayton et al. Reference Clayton, Bussey and Dickinson2003). For reasons cogently set forth by Blackmon et al. (Reference Blackmon, Byrd, Cummins, Poirier and Roth2004), this means that the nonhuman animals are atomistic learners and that at least some of their internal representations are functionally discrete.
10.3. Compositionality
Perhaps the single most contentious claim of the PSS hypothesis is that mental representations are compositional – that is, complex mental representations are formed by combining discrete representational states into more complex structures such that different combinations of simpler representations can be used to represent different states of affairs in a combinatorial fashion. Few dispute the fact that human thought can approximate the functional effects of compositionality (cf. Prinz & Clark Reference Prinz and Clark2004). The purported compositionality of nonhuman animals' mental representations, on the other hand, has been the object of innumerable, hard-fought battles, particularly between the “associationist” and “symbolic” theoretical camps that have dominated comparative debate for many decades. In our view, the comparative evidence accumulated over the past quarter-century comes down firmly in favor of neither of these venerable theoretical alternatives. Instead, the available comparative evidence suggests that compositionality is a ubiquitous feature of animal cognition, albeit not necessarily the kind of compositionality posited by the PSS hypothesis or “symbolic” accounts of nonhuman cognition.
The PSS hypothesis argues not only that mental representations are compositional but also that they are compositional in a specific fashion: Complex compositional representations in a PSS are formed by concatenation, thereby retaining the identity of the original constituents, rather than by some other conjunctive mechanism that sacrifices the integrity of the original constituents (see discussions by Aydede Reference Aydede1997; Horgan & Tienson Reference Horgan and Tienson1996; van Gelder Reference van Gelder1990). Van Gelder (Reference van Gelder1990) has suggested that any cognitive system should be considered “functionally compositional” if it possesses generally reliable and effective mechanisms for (1) producing a complex representation given its constituents, and (2) decomposing a complex representation back into those constituents – regardless of whether these complex representations are formed by concatenation or by some other means.
As we see it, the comparative evidence leaves no doubt that the nonhuman mind employs enduring, functionally discrete, particular-involving mental representations that are at least functionally compositional in van Gelder's agnostic sense. As Horgan and Tienson (Reference Horgan and Tienson1996) point out, the complexity of social relationships among nonhuman animals would be literally unthinkable without the ability to represent novel dyadic relations by combining discrete representations associated with each individual in a combinatorial fashion. More generally, the well-documented ability of nonhuman animals to keep track of means-ends contingencies and predicate-argument relationships in a combinatorial fashion implies that they possess some generally reliable and productive mechanism for encoding the relation between particular constituents. Such a mechanism is necessary in order to ensure that when multiple relations predicate the same property, the fact that it is the same property in each case is somehow manifest in the structural similarity between the representations. Horgan and Tienson (Reference Horgan and Tienson1996) argue that this is all it should take in order for a representational system to qualify as “syntactically structured”; and therefore one must conclude, they argue, that nonhuman animals employ syntactically structured mental representations, albeit not necessarily in the concatenative sense posited by the PSS hypothesis.
We agree. And this conclusion rules out most traditional associative and distributed connectionist models as plausible accounts of the nonhuman mind (see again Marcus Reference Marcus2001). However, any number of researchers have proposed nonclassical connectionist architectures that are functionally discrete, particular-involving, and syntactically structured without being concatenatively compositional in the sense postulated by the PSS hypothesis (e.g., Plate Reference Plate, Mylopoulos and Reiter1991; Pollack Reference Pollack1990; Smolensky Reference Smolensky1990; van Gelder Reference van Gelder1990; Wilson et al. Reference Wilson, Halford, Gray, Phillips, Gentner, Holyoak and Kokinov2001). Many of these proposals can account for the kind of compositionality manifested by nonhuman animals. Therefore, none of the comparative evidence available to date warrants the widespread assumption among comparative cognitive researchers (e.g., Gallistel Reference Gallistel, Jing, Rosenzweig, d'Ydewalle, Zhang, Chen and Zhang2006) that nonhuman animals necessarily form compositional representations in the concatenative fashion proposed by the PSS hypothesis.
Indeed, as we will see below, nonhuman animals do not even come close to approximating any of the other, more distinctive, higher-order features of a PSS. Therefore, at least in biological organisms, the various representational capabilities putatively associated with a PSS are not a package deal as a matter of nomological necessity (see Hadley Reference Hadley1997).
10.4. Types and tokens
The distinction between types (e.g., kinds, classes, roles, variables) and tokens (e.g., individuals, instances, fillers, values) is one of the essential characteristics of a genuine PSS. A PSS maintains explicit information about the syntactic type identity of each structural relation it employs and the type identity of its allowable constituents as distinct from the constituents involved in any particular relational instance. For example, in a PSS, the abstract characteristics of the loves relation is explicitly represented and invariant to whether John loves Mary or Mary loves John.
The ability to reason about the relation between types and tokens pervades many aspects of human thought. Role-governed rules appear to be a formative feature of all human languages; and the universal ability of humans to learn novel role-governed rules is evident not only in their mastery of natural human languages but also in their ability to extract the abstract rules of artificial grammars in AGL experiments. “Role-governed categories” (Markman & Stilwell Reference Markman and Stilwell2001) also play a central role in human concept formation far beyond the abstract grammatical structures of language. A human subject is perfectly capable of reasoning about a role-based category such as “lovers” or “mothers” or “tools” without there being any set of perceptual features that all lovers, mothers, or tools have in common. Moreover, the ubiquitous human capacity to find analogical correspondences between perceptually disparate relations appears to require an ability to find systematic correspondences between the roles defined for those relations as distinct from the perceptual similarity between the fillers of these roles (Gentner Reference Gentner1983; Gentner & Markman Reference Gentner and Markman1997; Markman & Gentner Reference Markman and Gentner1993; Reference Markman and Gentner2000). Therefore, analogical inferences – one of the hallmarks of the human mind and a prominent feature of abstract causal reasoning and ToM – seem to require the ability to distinguish between roles and their fillers and to dynamically “bind” one with the other without corrupting the independence of either. Notably, the ability to maintain role-filler independence while dynamically binding roles and fillers to particular relations seems to require the kind of concatenative compositionality posited by the PSS hypothesis (for more extensive discussions of this point, see Doumas & Hummel Reference Doumas, Hummel, Holyoak and Morrison2005; Holyoak & Hummel Reference Holyoak, Hummel, Dietrich and Markman2000; Hummel & Holyoak Reference Hummel and Holyoak2003; Hummel et al. Reference Hummel, Holyoak, Green, Doumas, Devnich, Kittur, Kalar, Levy and Gayler2004).
Whereas an explicit and concatenative relation between types and tokens appears to be necessary to explain human subjects' higher-order relational capabilities, there is no need or evidence for this distinction in nonhuman cognitive behaviors. Nonhuman animals appear to reason on the basis of “feature-based categories” alone (Markman & Stilwell Reference Markman and Stilwell2001) – that is, they appear to represent categories such as “mothers,” “tools,” or “kin” based on particular sets of features shared by members of the category (e.g., gender, perceptual affordances, affiliative behavior) rather than on the abstract role that members play in a given relational schema. Moreover, there is no evidence, as we argued above, that nonhuman animals are able to process analogical relations or role-governed rules in a human-like fashion. Thus, one of the fundamental features of a PSS, the explicit distinction between types and tokens, appears to be absent from the cognitive behavior of nonhuman animals.
10.5. Structural relations
Classical theories posit that there are a wide range of distinct structural relations between content-bearing representations, of which “constituency” is the most prominent example (Fodor & Pylyshyn Reference Fodor and Pylyshyn1988). One of the defining features of a PSS is that it allows cognitive processes to operate over the formal structure of a relation in a truth-preserving fashion independently of the relation's particular constituents. Among other things, this permits the PSS hypothesis to explain how recursive operations over hierarchically structured representations can be both inferentially coherent and computationally feasible.
We have argued earlier that nonhuman mental representations are functionally compositional and syntactically structured. Therefore, the nonhuman cognitive architecture must be capable of operating over a range of structural relations between content-bearing representations as well. The comparative evidence suggests, however, that nonhuman animals are unable to reason about the higher-order structural relation between these relations in a human-like fashion and are unable to perform those kinds of operations – such as recursion and deductive inference – which apply to the formal structure of a relation independently from the semantic or perceptual features of its constituents. Many theorists, for example, have suggested that humans form a “causal model” of the network of causal relations within a given domain and then use this causal network to make novel inferences and plan their interventions in a quasi-experimental fashion (Gopnik et al. Reference Gopnik, Glymour, Sobel, Schulz, Kushnir and Danks2004; Hagmayer et al. Reference Hagmayer, Sloman, Lagnado, Waldmann, Gopnik and Schulz2007; Waldmann & Holyoak Reference Waldmann and Holyoak1992). Nonhuman animals also appear to be implicitly sensitive to the differences between certain basic causal structures (see Blaisdell et al. Reference Blaisdell, Sawa, Leising and Waldmann2006). Unlike humans, however, nonhuman animals appear to be incapable of explicitly reasoning about these causal networks in a diagnostic manner, of recognizing the structural similarities between perceptually disparate causal relations, of generalizing their prior knowledge about causal structures to novel contexts, or of reasoning about the structure of causal relations independently of their particular perceptual features (see our discussion in Penn & Povinelli Reference Penn and Povinelli2007a).
In our review of the comparative evidence for hierarchical representations and transitive inferences, we found that nonhuman animals reason solely in terms of first-order perceptual relations (e.g., rates of affiliation, reinforcement history, and outcomes of dyadic agonistic encounters), rather than in terms of the logical, role-governed, and/or structural aspects of the relations themselves. Although metrics such as “early association,” “familiarity,” and “age similarity” may provide heuristic proxies for the kinship relation between two conspecifics, these metrics reduce a role-based, structured relation to an analog chunk and forgo the ability to reason about the higher-order relation between these relations independently of their particular perceptual characteristics. There is no evidence, for example, that nonhuman animals understand the higher-order relation between the grandmother-of and mother-of relation, or the analogical similarity between the father-of and mother-of relation. Similarly, the performance of nonhuman animals on RMTS tasks suggests that nonhuman animals are chunking these relations into analog measures of variability, rather than reasoning about the structural relation between relations per se. And we came to similar conclusions with respect to nonhuman animals' performance on tests of transitive reasoning.
In short, the comparative evidence suggests that nonhuman mental representations are implicitly structured, but that nonhuman animals are incapable of representing these structural relations explicitly (i.e., of explicitly tokening the relation qua relation) and therefore are incapable of reasoning about the higher-order structural relation between relations in a recursive, systematic, or productive fashion.
10.6. Systematicity
Fodor and Pylyshyn (Reference Fodor and Pylyshyn1988) famously argued that all cognitive organisms capable of understanding aRb must also understand bRa, where a and b are referential entities and R is some relation. An oft-cited example is that any organism that can understand John loves Mary will necessarily understand Mary loves John or any other systematic variant thereof. Fodor and Pylyshyn (Reference Fodor and Pylyshyn1988) called this feature of thought systematicity. They argued that (1) systematicity is a unique effect of a PSS; (2) it is universally observed in all cognitive organisms; and therefore (3) all cognitive organisms must employ a PSS. As Fodor and Pylyshyn (Reference Fodor and Pylyshyn1988, p. 28) put it, “that infraverbal cognition is pretty generally systematic seems, in short, to be about as secure as any empirical premise in this area can be.”
Many critics have pointed out that Fodor and Pylyshyn's formulation of systematicity was not particularly well-defined or operationally tractable (e.g., Doumas & Hummel Reference Doumas, Hummel, Holyoak and Morrison2005; Hadley Reference Hadley1994; Niklasson & van Gelder Reference Niklasson and van Gelder1994). Moreover, it turns out to be relatively easy for clever connectionist models to replicate many of the examples of systematicity cited by Fodor and Pylyshyn without resorting to classically structured representations (e.g., Chalmers Reference Chalmers1990; Niklasson & van Gelder Reference Niklasson and van Gelder1994; Plate Reference Plate, Mylopoulos and Reiter1991; Smolensky Reference Smolensky1990).
The argument from systematicity fares even worse from a comparative point of view. Nonhuman cognition is certainly systematic and productive to some degree, but it does not appear to be systematic in the way or for the reasons postulated by the PSS hypothesis. Certainly, any animal capable of thinking the thought dominates(A, B) is likely to be able to think the thought dominates(B, A) for any arbitrary pair of conspecifics of the appropriate age and gender. And this kind of systematicity is often cited by advocates of the classical school of thought to support extending the PSS hypothesis to nonhuman animals (e.g., Carruthers Reference Carruthers2004; Fodor & Pylyshyn Reference Fodor and Pylyshyn1988). But there is a fundamental difference between the kind of systematicity manifested by nonhuman animals and the kind of systematicity posited by the PSS hypothesis.
The kind of systematicity manifested by nonhuman animals is limited to perceptually based relations in which the values that each argument can take on in the relation are constrained only by observable features of the constituents in question (e.g., the gender and age of the conspecific). Feature-based systematicity such as this does not require the cognizer to posit relational roles distinct from the relations' constituents, nor to cognize the fact that certain relations logically imply certain other relations. Not coincidentally, this is also the kind of systematicity that happens to be easily implemented by many nonclassical connectionist models (e.g., Niklasson & van Gelder Reference Niklasson and van Gelder1994). But as Fodor and colleagues have repeatedly argued (e.g., Fodor Reference Fodor1997; Fodor & McLaughlin Reference Fodor and McLaughlin1990), the kind of systematicity posited by the PSS hypothesis is not statistical or accidental or a by-product of domain-specific adaptations; rather, it arises as a matter of nomological necessity from the fact that a PSS defines relations structurally. Classical systematicity entails cognizing the fact that certain relations necessarily imply other relations independently of any particular domain or learning context: for example, for all R, the relations R(a,b) and R(b,c) necessarily imply the relation R(a,c) provided that R is a transitive relation.
There is no evidence for this kind of classical, inferential, role-governed, domain-independent systematicity among nonhuman animals.
11. The relational reinterpretation hypothesis
Here's the pickle. On the one hand, despite its many flaws, the PSS hypothesis lays out a package of representational capabilities that appear to be well – though imperfectly – approximated by normal human minds. On the other hand, whereas nonhuman minds approximate some of these same capabilities to some degree, they do so to a significantly lesser degree than human minds do, and in some cases, not at all.
The comparative evidence therefore poses a serious challenge to the classical version of the PSS hypothesis. All of the strongest empirical arguments for the PSS hypothesis rest on representational capabilities that appear to be largely absent from nonhuman species – for example, inferential systematicity, types and tokens, concatenative compositionality, and explicitly hierarchical relations. In short, the evidence for a classical PSS among infraverbal organisms is a lot less “secure” than Fodor and Pylyshyn (Reference Fodor and Pylyshyn1988) assumed.
The comparative evidence poses an equally serious challenge for many prominent nonclassical theories of cognition. The most extreme critics of the classical school have argued that one can do without the notion of “representation” and “computation” altogether (e.g., Brooks Reference Brooks1991; van Gelder Reference van Gelder1995; Reference van Gelder1998). But the comparative evidence definitively rules out any nonrepresentational, purely embodied, or traditional associative account of animal cognition; and it strongly suggests that nonhuman minds, like human ones, are highly structured, information-processing devices in a way that stomachs and Watt governors are not (cf. Clark Reference Clark2001). Indeed, nonhuman minds approximate certain features of a PSS that are extremely problematic for the kind of traditional distributed connectionist systems that have been the principal antagonist to the PSS hypothesis for more than a quarter century (Elman Reference Elman1996; Hinton et al. Reference Hinton, McClelland, Rumelhart, Rumelhart and McClelland1986; Rumelhart & McClelland Reference Rumelhart and McClelland1986). Whatever kind of architecture the nonhuman mind employs, it is certainly not based solely on traditional distributed connectionist networks or associative learning.
The comparative evidence therefore leads us to propose a hybrid alternative to the orthodox debate between classical and nonclassical theories of cognition: what we call the relational reinterpretation (RR) hypothesis. Povinelli's “reinterpretation hypothesis” previously suggested that humans alone are able to “reinterpret” the world in terms of unobservable causal forces and mental states (e.g., Povinelli Reference Povinelli2000; Povinelli et al. Reference Povinelli, Bering and Giambrone2000). According to our relational reinterpretation hypothesis, the discontinuity between human and nonhuman minds extends much farther: to any cognitive capability that requires reinterpreting perceptual relations in terms of higher-order, structural, role-governed relations.
According to the RR hypothesis, animals of many taxa employ functionally compositional, particular-involving, syntactically structured mental representations about observable features, entities, and relations in the world around them. Furthermore, they form abstract representations about statistical regularities they perceive in the behavior of certain classes of physical objects (e.g., observable causal relations) and other animate agents (e.g., affiliative interactions) and are capable of using these representations off-line to make decisions in a flexible, reliable, and ecologically rational (i.e., adaptive) fashion. Human animals alone, however, possess the additional capability of reinterpreting these perceptually grounded representations in terms of higher-order, role-governed, inferentially systematic, explicitly structural relations – or, to be more precise, of approximating these higher-order features of a PSS, subject to the evolved, content-specific biases and processing capacity limitations of the human brain. Ex hypothesi, the discontinuity between the cognitive abilities of human and nonhuman animals – including our unique linguistic, logical, mentalistic, cultural and causal reasoning abilities – largely results from the substantial difference in degree to which human and nonhuman minds are able to approximate the relational capabilities of a PSS.
Our RR hypothesis bears more than a nominal resemblance to Karmiloff-Smith's (Reference Karmiloff-Smith1992) “representational redescription” hypothesis and to the growing family of “dual-process” accounts of reasoning (Evans Reference Evans2003). The case for two broad “systems” of reasoning within the human mind is already well-founded on the basis of the evidence from human cognitive behavior alone. Our review of the comparative evidence suggests that a dual-process account is well-founded from a comparative perspective as well. And our version of the RR hypothesis is indebted to Karmiloff-Smith's (1992) earlier and perspicacious argument that what makes human cognition unique lies in the manner that we “reinterpret” the lower-order representations we share with other animals (Povinelli et al. Reference Povinelli, Bering and Giambrone2000). Unlike most dual-process account theorists, however, we do not believe that the nonhuman mind (a.k.a. “System 1”) is limited to automatic, associative processes. Indeed, we believe that nonhuman animals are capable of many kinds of “representational redescriptions” – just not the structurally systematic, role-governed relational redescriptions that are the hallmark of the human mind.
Importantly, we are not claiming that our higher-order relational capabilities are the sole and sufficient condition for explaining all of our species' unique cognitive abilities. The uniquely human biological specializations associated with language (see Pinker & Jackendoff Reference Pinker and Jackendoff2005), ToM (see Saxe Reference Saxe2006), and complex causal reasoning (see Johnson-Frey Reference Johnson-Frey2004) – to take only the most obvious candidates – are clearly much more multifarious than a domain-general capacity for higher-order relational reasoning alone. Our claim, rather, is that the ability to reason about higher-order structural relations in a systematic and productive fashion is a necessary – but not sufficient – condition for the normal development and full realization of these other capabilities in human subjects. Our further claim is that it is highly unlikely that the human ability to reason about higher-order relations evolved de novo and independently with each distinctively human cognitive capability. Rather, it seems much more likely that higher-order relational reasoning belongs to a single “supermodule” which is duplicated, reused, shared, or called upon by the functional “modules” associated with each of these other distinctively human cognitive capabilities (see Barrett Reference Barrett and Kurzban2006 for an important discussion of the many possible relationships between architectural, developmental, and functional modularity).
Nor should our RR hypothesis be reduced to the claim that human minds employ a classical “language of thought” (LoT) and nonhuman minds do not. We have argued, both here and elsewhere, that it is highly unlikely that human subjects are pure LoT processors in the sense imagined by the classical PSS hypothesis (see Doumas & Hummel Reference Doumas, Hummel, Holyoak and Morrison2005; Holyoak & Hummel Reference Holyoak, Hummel, Dietrich and Markman2000; Hummel & Holyoak Reference Hummel and Holyoak2003; Reference Hummel and Holyoak2005). Furthermore, given our analysis of the comparative evidence, the representational systems employed by nonhuman animals arguably merit being construed as a kind of nonclassical proto-LoT, analogous to the protolanguages that some researchers suggest preceded the evolution of human language (e.g., Bickerton Reference Bickerton1995). Indeed, it seems likely to us that different species, as well as different “modules” within the cognitive architecture of a given species, approximate different features of a PSS to varying degrees. The evolutionary result, in our opinion, is that “every species gets the syntax it deserves” (Bloom Reference Bloom2000, p. 517), rather than a dichotomous distinction between those with a LoT and those without.
To be sure, we believe there is at least as great a discontinuity between our human LoT and the proto-LoTs of nonhuman animals as there is between the protolanguages of early humans and the languages employed by modern Homo sapiens. Accordingly, our RR hypothesis is more accurately portrayed as the claim that a distinctively human, modular system for approximating a LoT – that is, one that subserves higher-order, role-governed relational representations in a systematic and domain-general fashion – has evolved on top of and reinterprets the output of the proto-symbolic systems we still share with other animals.
Some readers may take the last few paragraphs as a retreat from our initial claim that Darwin was mistaken. Our disagreement with Darwin is, indeed, hedged. Contrary to Darwin's “mental continuity” hypothesis, we have argued that there is a functional discontinuity between human and nonhuman minds – specifically, that only human animals are able to reason about higher-order relations in a structurally systematic and inferentially productive fashion. But, at the same time, we have acknowledged from the outset that this cognitive gap must have evolved largely through incremental, Darwinian processes. The question that naturally arises, then, is this: What representational-level and physical-level innovations explain how this functional discontinuity between human and nonhuman minds arose in an evolutionarily plausible manner?
11.1. LISA: Relational reasoning in a biological symbol system
We do not by any means have a complete answer to this formidable question; but we can at least point to work that suggests one possible step towards an answer. Hummel and Holyoak (Reference Hummel and Holyoak1997; Reference Hummel and Holyoak2003; Reference Hummel and Holyoak2005) have proposed a hybrid symbolic-connectionist model of relational reasoning – LISA (Learning and Inference with Schemas and Analogies) – which we view as one promising (though partial) approach to implementing our RR hypothesis in a computationally feasible and neurally plausible fashion (see also Holyoak & Hummel Reference Holyoak, Hummel, Dietrich and Markman2000; Reference Holyoak, Hummel, Gentner, Holyoak and Kokinov2001; Morrison et al. Reference Morrison, Krawczyk, Holyoak, Hummel, Chow, Miller and Knowlton2004). LISA combines the syntactic strengths of a PSS with the semantic flexibility and generalization capabilities of a distributed connectionist system by using temporal synchrony to approximate the dynamic role-filler binding capabilities of a PSS within a connectionist architecture. Notably, LISA implements the distinctive higher-order relational capabilities of a PSS via an additional representational system that has been grafted onto a simpler system of conjunctive representations used for long-term storage. This simpler system provides conjunctive representations that are functionally, but not concatenatively, compositional and therefore is arguably sufficient to approximate the representational capabilities of nonhuman animals but insufficient to approximate the higher-order relational capabilities of humans.
LISA provides an existence proof that the higher-order relational capabilities of a PSS can, in fact, be grafted onto a neurally plausible, distributed connectionist architecture. At the same time, LISA shows that it is quite hard to approximate the higher-order relational capabilities of a PSS within a neural network – particularly to achieve both role-filler independence and dynamic role-filler binding. In other words, LISA suggests that approximating the higher-order, role-governed features of a PSS is not likely to be an ability that evolved by accident or as a by-product of increased brain size, greater neural plasticity, or larger processing capacity alone. There must be other substantive differences between human and nonhuman primate brains waiting to be discovered (Preuss Reference Preuss and Gazzaniga2004).
If LISA is broadly correct, the substantive difference between human and nonhuman brains will be found in the prefrontal cortices, and specifically in synchronized activity among prefrontal neural populations that support working memory, as well as among neural populations in the frontal and posterior cortical areas (see Lu et al. Reference Lu, Morrison, Hummel and Holyoak2006; Morrison et al. Reference Morrison, Krawczyk, Holyoak, Hummel, Chow, Miller and Knowlton2004; Robin & Holyoak Reference Robin, Holyoak and Gazzaniga1995; Waltz et al. Reference Waltz, Knowlton, Holyoak, Boone, Mishkin, Santos, Thomas and Miller1999; Reference Waltz, Knowlton, Holyoak, Boone, Back-Madruga, McPherson, Masterman, Chow, Cummings and Miller2004). Of course, we are not suggesting that temporal synchrony among prefrontal neural populations is the only possible neural-level explanation for the functional differences between human and nonhuman relational cognition, nor that it provides a full explanation (see, e.g., Jung & Haier Reference Jung and Haier2007). We are simply suggesting that computational models of biological cognition such as LISA provide an important tool for comparative researchers wishing to formulate biologically plausible, representational-level hypotheses concerning the similarities and differences between human and nonhuman minds.
11.2. Moving forward
Admittedly, our RR hypothesis has a number of substantial holes. With respect to the empirical evidence, we have not directly addressed a number of important cognitive domains. In some cases – for example, numeracy, cooperation, and mental time travel – others have already proposed analyses of the functional discontinuity between human and nonhuman animals' capabilities that are largely consistent with the hypothesis defended in the present article (see, e.g., Dehaene Reference Dehaene1997; McElreath et al. Reference McElreath, Clutton-Brock, Fehr, Fessler, Hagen, Hammerstein, Kosfeld, Milinski, Silk, Tooby, Wilson and Hammerstein2003; Suddendorf & Corballis Reference Suddendorf and Corballis2007a). In other cases – for example, empathy and metacognition – the discontinuity between human and nonhuman minds continues to be challenged (cf. Preston & de Waal Reference Preston and de Waal2002; Smith et al. Reference Smith, Shields and Washburn2003). We believe our analysis and hypothesis can (and should) be extended to these latter domains as well. Indeed, we believe that our RR hypothesis offers a powerful framework for explaining what all these disparate cases – from cooperation and mental time travel to numeracy and metacognition – have in common. But we acknowledge that we have not had the space to extend our analysis to these other domains herein.
With respect to our representational-level claims, we have not specified how our proposed symbolic-relational supermodule combines inputs from such a motley collection of perceptual and conceptual modules in a computationally feasible fashion. Fodor (Reference Fodor2000) has argued that this problem is unsolvable, and therefore that the human mind cannot, in the end, be entirely computational. We do not have a complete solution to Fodor's challenge; but, like many others, we do not believe it is in principle unsolvable (Barrett Reference Barrett2005; Carruthers Reference Carruthers and Stainton2005b; Pinker Reference Pinker2005). Hybrid symbolic-connectionist architectures such as LISA provide one possible solution that Fodor has not considered.
The most glaring weakness in our hypothesis is that we have no complete, biologically plausible model of nonhuman cognition to propose. Many who have adopted some form of the Symbolic Approximation hypothesis have taken on the ambitious goal of trying to determine how the unique symbolic capabilities of the human mind might be implemented in a neurally plausible architecture (e.g., Holyoak Reference Holyoak, Ericsson and Smith1991; Hummel & Holyoak Reference Hummel and Holyoak2003; Marcus Reference Marcus2001; Plate Reference Plate, Mylopoulos and Reiter1991; Pollack Reference Pollack1990; Shastri & Ajjanagadde Reference Shastri and Ajjanagadde1993; Smolensky Reference Smolensky1990; Reference Smolensky1999; Wilson et al. Reference Wilson, Halford, Gray, Phillips, Gentner, Holyoak and Kokinov2001). Although we applaud the efforts of these researchers and believe that they are already shedding new light on our species' unique relational capacities, relatively little effort has been invested in modeling the relational abilities of other cognitively sophisticated animals. In our view, the entire field of cognitive science – not just our particular hypothesis – would benefit if more effort were focused on constructing biologically plausible, behaviorally accurate, computationally feasible models of the cognitive abilities of honeybees, corvids, and chimpanzees, in addition to the cognitive abilities of enculturated, language-wielding humans.Footnote 5
Fortunately, the fate of our RR hypothesis does not ride on the success or failure of any particular computational proposal. Our most important claim in this target article is simply that whatever “good trick” (Dennett Reference Dennett1996) was responsible for the advent of human beings' ability to reinterpret the world in a symbolic-relational fashion, it evolved in only one lineage – ours. Nonhuman animals didn't (and still don't) get it.
ACKNOWLEDGMENTS
The authors would like to thank José Luis Bermúdez, Paul Bloom, Dedre Gentner, Arthur Markman, and seven other anonymous reviewers for their invaluable comments on earlier drafts. Thanks also to Aaron Blaisdell, Patricia Cheng, Nicola Clayton, Leonidas A. A. Doumas, Graeme Halford, John Hummel, Laurie Santos, and Michael Waldmann for helpful suggestions and discussions along the way. This work was supported in part by NIH Grant MH072613 to KJH and a Centennial Fellowship to DJP by the James S. McDonnell Foundation.