1. Preliminaries
1.1. Epistemic and metaphysical modalities
We employ the modal words ‘might’ or ‘may’ and ‘must’, expressing possibility and necessity – as opposed, for example, to permission and obligation – in significantly different ways. One broad contrast is between epistemic and alethic uses. When, knowing that Aunt Mabel fully intended to catch the 10.22am from Sheffield to St. Pancras, and that the train has arrived without her on it, we say: ‘She must have missed the train’, we are claiming that, given what we know, the only reasonable conclusion to draw is that she missed the train. When you enquire after the whereabouts of Uncle Bill, and I reply: ‘He may be in the garden’, what I mean is that nothing we, or at least I, know rules out his being in the garden. These examples illustrate the epistemic use. But there are other cases in which we use ‘must’ and ‘may’ or ‘might’ to express beliefs about what is necessary or possible in a non-epistemic, alethic sense. Thus we often think that things might have gone differently from how, as we know, they went – e.g. that Mozart might have outlived Haydn, instead of pre-deceasing him by nearly two decades (as he actually did), or that Aristotle might never have gone in for philosophy. And when we assert, say, that the product of two odd numbers must be odd, we probably mean not merely that given what we know about numbers, it follows that m × n is odd, if m and n are both odd, but that it is, in some sense, outright and objectively necessary that the product of odd numbers is odd.
1.2. Relative and absolute modalities
Epistemic possibility and necessity fairly clearly involve a kind of relativity. To say that something might be so, meaning that it is epistemically possible, is to say that for all we know, it is so – that nothing we know rules it out, or that it is consistent with, and so possible relative to, what we know. Similarly, mutatis mutandis, with epistemic necessity – what must be so, given, or relative to, what we know.
It is plausible, or at least widely accepted, that we use modal words to express other kinds of relative necessity and possibility. Thus although the example is not uncontroversial, it is often supposed that when we speak of physical necessity and possibility, we are concerned with what must or may be so, relative to the laws of physics. Similarly, we may take biological, psychological, etc., necessity and possibility to be forms of relative modality.
On what may be termed the standard approach, different kinds of relative necessity are all relativizations of a single underlying kind of necessity, which is naturally taken to be logical necessity. Relative necessity is logical necessity relative to a certain body of propositions. Thus it is physically necessary that p just if the proposition that p logically follows from the laws of physics, and physically possible that p just if its negation (the proposition that ¬p) is not a logical consequence of those laws. How precisely this general idea should be implemented is a further question. In its classic formulation, the proposition that p is physically necessary iff the conditional Γ→p , where Γ is a conjunction of physical laws, is logically necessary (i.e. iff □(Γ→p)). There are reasons to think this agreeably simple and straightforward analysis won't do as it stands, but we need not pursue them here. The general idea is surely right.Footnote 1
It is natural to assume, as the standard approach does, that the underlying kind of necessity in terms of which the various sorts of relative necessity are explained is not itself relative, but absolute. Indeed, at least on the face of it, the supposition that logical necessity is itself merely relative would be viciously regressive. In any case, it is unclear what other kind of necessity logical necessity might be taken to be a relativization of. And the idea sits uncomfortably with the very plausible principle that each kind of necessity is closed under logical consequence.
If one thought, as David Lewis did, that illuminating analyses of philosophically problematic concepts could be given in terms of possible worlds, the obvious way to explain the notion of absolute necessity would be through absolutely unrestricted quantification over worlds: it is absolutely necessary that p just in case it is true at every world without restriction that p. For those who have less faith in the explanatory value of the hypothesis of a plurality of worlds, a better way to characterize the notion might replace unrestricted quantification over worlds by unrestricted quantification over propositions. What is absolutely necessary is what holds unconditionally – equivalently, what holds, or rather would hold, under any condition whatever. A little more formally, □p to abbreviate ‘it is absolutely necessary that p’ and □→ for the counterfactual conditional, □p=∀q(q □→p). If this is to capture the notion of absolute necessity, q must be understood as ranging unrestrictedly over all propositions whatever, and not just those expressible in the language one happens to be using. And of course, since the explanans employs the subjunctive or counterfactual conditional, which is itself a modal notion, it can make no claim to provide a reductive explanation of the notion of necessity, such as Lewis though he could give in terms of worlds.Footnote 2
It is easily seen that logical necessity is absolute in this sense; when □ is interpreted as expressing logical necessity, □p entails the strict conditional □(q→p) for any q, which in turn entails the generalized counterfactual ∀q(q□→p). But whilst it is clear that all logical necessities are absolute, it is a further, and rather less straightforward, question whether the converse is true, i.e. whether all absolute necessities are logical. Among those who have thought so we may count Ludwig Wittgenstein – or at least, the Wittgenstein of the Tractatus, who unequivocally declares that the only necessity is logical – and, provided that logical necessity is construed broadly enough, the logical empiricists, who held that all necessary truths are true in virtue of meaning, or analytic. Pre-eminent among those who have rejected the converse is, of course, Saul Kripke. Although, as far as I know, the now widely employed term ‘metaphysical necessity’ is nowhere used in Naming and Necessity,Footnote 3 Kripke firmly declares necessity to be a metaphysical concept, in contrast with analyticity and apriority, which belong, he tells us, to semantics and epistemology respectively. Logical necessity receives barely any discussion in Kripke's lectures, but he makes it unmistakeably clear that he regards as necessary many things – for example, that water is H2O, that gold is an element, that Hesperus is Phosphorus – which he would count as neither logical truths nor analytic, nor as knowable a priori. Kripke does not use my term ‘absolute necessity’, but he leaves us in no doubt that he takes these things to be necessary in the strongest sense.Footnote 4
1.3. The logic of absolute necessity
Which logical principles hold for □ when it is understood as expressing absolute necessity? I contend that they are the principles of the strongest normal modal logic, S5, and thus include not only the fairly obvious K and T principles □(A→B)→(□A→□B) and □A→A, but the more controversial S4 principle □A→□□A and the stronger, characteristic S5 law ◊□A→□A. The simplest and most direct argument to support this claim I know is semantic, and is most easily sketched using the usual possible worlds semantics for modal logics. In briefest terms, a model in this semantics comprises a domain W of worlds, together with a relation R of relative possibility or accessibility defined on W and an exhaustive assignment of truth-values to the simplest propositions relative to members of W. Negations, conjunctions, etc., evaluate as you would expect. □A evaluates as true at a member w∈W iff A evaluates as true at each w′∈W such that wRw′. Which modal principles are validated by the semantics depends upon what constraints, if any, are taken to govern the accessibility relation R. The K principle holds without further constraints. Requiring R to be reflexive – so that each world is possible relative to itself – validates the T principle: requiring symmetry – that is, if wRw′, then w′Rw – secures the B principle (A→□◊A); and requiring transitivity – that is, if wR w′ and w′Rw′′, then wRw′′ – secures the S4 principle. In essence, the semantic argument goes as follows. It is absolutely necessity that p just in case, no matter what proposition q may be, if it were the case that q, it would be that p. Here, q is to be understood as ranging over absolutely all propositions, without any restriction. In terms of worlds, this is tantamount to the requirement that p should be true at absolutely all possible worlds, without restriction. In the context of world-semantics, this translates into the requirement, for □p to be true at any given world in the model, that p should be true at every world in the model accessible from that world, when absolutely every world is accessible from every world. This is equivalent to requiring that the accessibility relation be an equivalence relation – i.e. reflexive, symmetric, and transitive. But these constraints ensure that the characteristic S5 principle holds.Footnote 5
2. Problems
I know of no more succinct formulation of the basic philosophical questions about necessity than Michael Dummett's:
The philosophical problem of necessity is twofold: what is its source, and how do we recognise it?Footnote 6
The first question clearly belongs to metaphysics. It asks after the basis of necessity; that is, when something is necessary, what makes it so? The second is equally clearly epistemological. Assuming we know some things to be necessary, how do we know? The two halves of Dummett's problem interlock in a way that puts a constraint on acceptable answers to both questions. At a minimum, no satisfactory answer to either can preclude a credible answer to the other.Footnote 7
In this paper, I shall be focusing on Dummett's first, metaphysical question, but we should keep in mind that no answer to it can be satisfactory which does not at least leave room for a workable epistemology of modality.
When directed at a kind of relative necessity – physical necessity, say – the metaphysical question is plausibly taken to concern the ground or basis of the truth of the body of propositions to which that kind of relative modality is relative, e.g. the laws of physics; and the epistemological question is essentially a question about how we know what follows from those propositions. These questions certainly need answering, but their answers will have no direct bearing on the metaphysics and epistemology of modality as such. It is when Dummett's questions are directed at absolute necessity (and by implication, absolute possibility) that they concern modality most directly, and it is then that they are apt to seem especially hard to answer. Indeed, some have thought the metaphysical question unanswerable. Simon Blackburn once argued that any attempt to say what makes for necessity must get skewered on one or the other horn of a lethal dilemma. For suppose we say that what makes it necessary that p is the fact that q. Then either it is necessary, in turn, that q, or it is a merely contingent fact that q. If the first, we may have succeeded in explaining why p is necessary, but since we have explained it by appealing to another necessity, we have merely postponed the hard question: why is there any necessity at all? If the second, then since it might just as well not have been the case that q , it seems that we have not so much explained why it must be that p, as shown that it isn't really necessary after all.Footnote 8
3. Some unsatisfactory answers
The difficulty of answering Dummett's first question head on has led some – Blackburn and others who, like him, have agreed that we cannot reject the notion of necessity altogether – to deny that there really are any objective modal facts, as distinct from facts about what what we find ourselves able or unable to imagine or conceive. It has led others to look for a reductive explanation – an explanation of modal facts in wholly non-modal and putatively less problematic terms. I shall comment briefly on two theories of this kind before turning to the approach I think best.
3.1. Conventionalism and the linguistic theory of necessity
One answer which Blackburn explicitly takes to fall victim to the first horn of his dilemma is conventionalism – roughly, the idea that whenever it is necessary that p, what makes it so is our having adopted conventions for the use of words which somehow ensure that the sentence(s) we use to state that p are true. Since the fact, if it is one, that we have adopted the appropriate conventions is a contingent fact, this answer clearly fits the rubric of the first horn, and Blackburn swiftly concludes that the conventionalist answer must fail – if we need not have adopted those conventions, then it can't be really necessary that p after all. Since I have explained in some detail elsewhere why this argument failsFootnote 9, here I shall simply observe that it begs the question by relying on a principle which the conventionalist is bound to reject in any case. For what the contingency of the factFootnote 10 of our having adopted such-and-such conventions directly shows is not that it isn't necessary that p, but only that it isn't necessarily necessary. To get to the conclusion that it is not necessary simpliciter, we must appeal to the characteristic S4 principle: □p→□□p. But this principle is one which any serious conventionalist must in any case reject.
The conventionalist answer is perhaps the clearest form of what is sometimes called the linguistic theory of necessity, embraced by the logical empiricists, which seeks to explain necessity away as truth in virtue of meaning. It is indeed necessary that vixens are female, for example, but all that really amounts to is that we have opted to use our word ‘vixen’ to mean ‘female fox’, so that ‘Vixens are female’ is guaranteed, by the meanings of its constituent words, to express a truth.Footnote 11
The attractions of this kind of theory are obvious enough: avoidance of mystery coupled with explanatory economy – we need meanings anyway, to explain how we say true and false things about the world, and the theory then gives us necessity as a by-product. But it runs into some serious problems.
If we agree with Kripke that there are necessities like
Water is H 2 O
Gold is an element
which are not analytic or knowable a priori, it may seem that, even if some form of the linguistic theory can explain some necessities, it can't give an adequate account of all of them. Some more recent defenders of the theory have, however, sought to meet this objection by arguing that even in these cases, whilst the truth of the specific proposition that water is H2O, say, is an a posteriori, empirical discovery, its necessity derives from our having adopted a covering general convention to the effect that chemical substances are defined by their chemical composition, and similarly in other cases.Footnote 12 I shall not discuss this extension of the theory here, because I think it succumbs to older objections which apply even before any attempt to extend it to encompass a posteriori necessities is made.
Briefly, the first point, made long ago by QuineFootnote 13 is that definitions don't create truths, but merely allow us to re-write them more succinctly – that ‘vixen’ means ‘female fox’ merely allows us to abbreviate the longer truth ‘Female foxes are female’ to ‘Vixens are female’, so that the latter inherits the necessity of the former; but it does nothing to account for necessity of the longer truth. To ensure that, conventions of a different kind would be needed – sentence-sized conventions, each directly stipulating the truth of a complete sentence, or perhaps that of every sentence of a certain form. But of these, there can be only finitely many, and so, since there are infinitely many different logically valid sentence forms, not enough to account of all necessary truths one-by-one. Any attempt to meet this shortfall by holding that there is a base class of truths guaranteed directly by convention, all others being logical consequences thereof, runs into some well-known problems highlighted by Quine (op.cit) and Dummett.Footnote 14 Consequence relations bring with them further necessities: Let B comprise the basic necessities, directly secured by conventions, and let q be any non-basic necessity, whose necessity is to be explained by appeal to its being a consequence of necessities p 1,…, p n in B. Then the conditional p 1,…, p n→q will be a further necessity, so far unaccounted for. This looks, and Quine argued is, viciously regressive.Footnote 15
3.2. Worlds and combinatorial theories of possibility
What is necessary is what holds true in all possible worlds, what is possible is what is true in at least one of them. Whatever the merits of a model-theoretic semantics based on a domain of worlds – and they are surely many and considerable – taken as explanations of the concepts of necessity and possibility, these appear hopelessly circular. Some – most notably David Lewis – have thought otherwise: that we can (i) say what worlds are in non-modal terms and (ii) have reason to believe that the worlds are many and varied enough to account for all the possibilities. Worlds, Lewis contends, can be characterized in non-modal terms as spatio-temporally (and hence causally) closed systems of things. One such system of things comprises Lewis himself together, as he charmingly puts it, with all his surroundings – that is, everything to which he is spatio-temporally related. His theory says that there is a vast plurality of such systems. Necessity is simply truth at each and every one of them, and possibility truth at at least one. If the theory is to have any plausibility as an explanation of necessity and possibility, the worlds must somehow cover all the possibilities – there needs to be (at least) one for each way things might have been. Obviously mere proliferation of worlds is not enough – there must be sufficient variety in the plurality. The variety is to be guaranteed by Lewis's combinatorialism. In essence, this claims that each world is made up of smallest bits – mereological individuals – arranged in a certain way. Since distinct worlds are spatio-temporally disjoint, the individuals of our world cannot be in any other world. Instead, each other world has its own individuals, but arranged differently. The admissible arrangements are given by his principle of recombination: roughly ‘anything can co-exist with anything else, at least provided that they occupy distinct spatio-temporal positions. Likewise, anything can fail to co-exist with anything else’.Footnote 16, Footnote 17
Many philosophers have rejected Lewis's theory because they find its central claim – that there exists, in addition to the space-time we inhabit, a vast number (at least 22 ℵ0) of others, each spatio-temporally disjoint both from ours and from each other – simply incredible. Lewis dubbed this objection ‘the incredulous stare’, and complained in earlier workFootnote 18 that an incredulous stare is not an argument. In later writings, he grants that his ‘modal realism does disagree, to an extreme extent, with common sense opinion about what there is',Footnote 19 but argues that common sense opinion is not sacrosant, and that we should grit our teeth and accept his seemingly extravagant ontology because it enables us, better than any alternative, to explain or analyse a lot of things – necessity and possibility themselves, counterfactual conditionals, propositions, and properties, to mention some – which we, as philosophers, find problematic and would like to explain or analyse in less problematic terms.
The efficacy of this defence is open to question. Even if the theory succeeds – it is far from clear that it does – in explaining the modal in non-modal terms, the reduction may be reckoned unsatisfactory for two reasons. First, on the epistemological side, it may seem that giving the truth-conditions of modal propositions in terms of how things are throughout a vast plurality of worlds each of which bar one is inaccessible to us precludes any credible account of how we may know or reasonably believe anything about necessity and unrealized possibility. Second, on the metaphysical/ontological side, it may be thought that the putative reduction comes at an exorbitant cost: how is having to swallow a vast array of inaccessible space-times any sort of philosophical advance on accepting unreduced modal facts?
In fairness to Lewis, I should confess that I am less sure of the epistemological objection – at least in this form – than I used to be. For whilst it is very hard to see how we could know what goes on and how things stand in any world other than ours, it can be argued that we need no such knowledge of particular other worlds. To know that it might have been the case that p, even though it is actually not so, we do not need to know, concerning some other particular world, w 17 say, that it is true at that world that p; we need only to know the purely general existential proposition that there is a world at which p. And to know that necessarily p, we don't need somehow to verify, concerning each of the other worlds besides our own, that p is true there: we need rather to know the general negative existential proposition that there is no world at which ¬p. And, or so Lewis may claim, these are the kinds of thing we can know by considering whether things can be combined or arranged so that p, or so that ¬p. Lewis can best reply to the epistemological challenge, in other words, by appealing to his principle of recombination.
This may be a good reply, as far as it goes. But it serves only to re-focus critical attention where it should have been all along, on the recombination principle itself. First, whilst Lewis is surely right when he says that ‘our everyday modal opinions are, in large measure, consequences of a principle of recombination’,Footnote 20 this serves to relocate epistemological doubts about his theory, rather than to dispel them: if it really is the case that anything can co-exist with anything else, that tells us a lot about what counterfactual states of affairs are possible – but why should we believe it?
Further, it seems clear that the kind of recombination principle Lewis would have us accept amounts to a very substantial modal thesis. Given its pivotal role in his theory, it is is good question whether that theory can after all claim to provide a reductive explanation of the modal in genuinely non-modal terms. Perhaps Lewis could argue that while his recombination principle does indeed have very substantial implications for what possibilities there are, it can be stated in entirely non-modal terms – roughly, for anything x and anything else y, there is a world in which x exists but y does not, and a world in which y exists but x does not; the principle's having modal consequences does not preclude its playing a part in providing an explanation of modal concepts in non-modal terms. I shall not try to assess the effectiveness of this defence here. Even if it succeeds, the fact that Lewis's explanation effectively conflates the task of explaining the concept of possibility with the separable question of what possibilities there are gives those of us who do not share his view on that question ample reason to reject it. Lewis's explanation may appeal to those who share his commitment to a form of Humean atomism (no necessary connections between distinct existencesFootnote 21). But the assumption that any recombination – or separation – of basic entities (i.e. Lewis's mereological individuals) is possible threatens to foreclose from the outset against any view on which there are absolute necessities which are not, even in a quite broad sense, logical necessities. In particular, it rules against a host of familar kinds of essentialist claims, such as the necessity of a thing's membership in its fundamental kind, or its composition or origin. Such claims are, of course, controversial. But that is precisely the point: just for that reason, they ought not to be settled by what purports to be a good explanation of the concepts of possibility and necessity – they are controversial, but they are not self-contradictory.
4. Essence and modality
4.1. Basics
Everyone, save those who think we should dispense with the notion of necessity altogether, agrees that if a conjunctive proposition (A and B) is to be true, each of its conjuncts must be true. That is, it is not just true, but necessarily true, that if A and B is true, each of A and B is true. Why is this? What makes it so? A simple, and I believe correct, answer is that it is necessary because that is what it is to be a conjunction. Or more accurately, it is part of what it is to be a conjunction. For it is equally part of what it is to be a conjunction that if A is true and B is true, then their conjunction (A and B) must be true. Conjunction is a function from pairs of propositions to propositions. It is that function which takes a pair of propositions (A, B) to a true proposition (A and B) if and only if each of A and B is true. That is the nature, or essence, of conjunction. It is necessarily true that a conjunction is true only if each of its conjuncts is because it is true by, or in virtue of, the nature of conjunction that a conjunction is true only if each of its conjuncts is.
The general pattern of explanation illustrated by our example could be represented:
Necessarily … because it is true by the nature of ______ that —
where the first and final gaps are filled by declarative sentences, and the second by a singular or plural noun-phrase.
Further plausible examples conforming to this pattern readily suggest themselves:
Necessarily Aristotle is a man because it is true by Aristotle’s nature that he is a man
Necessarily whales are mammals because it is true by the nature of whales that they are mammals
Necessarily gold is an element because it is true by the nature of gold that it is an element
Necessarily everything is self-identical because it is true by the nature of identity that it is a reflexive relation
Necessarily a + b = b + a for all whole numbers a,b because it is true in virtue of the nature of addition that it is commutative
What I shall call the essentialist theory of necessity claims that all necessities – or at least all metaphysical necessitiesFootnote 22 – can be explained in this way, and that metaphysical possibilites are simply those propositions whose truth is not ruled out by the natures of things. In what follows, I shall try first to give a more precise and explicit statement of the theory, and then – in part by discussing some of the more obvious questions to which it gives rise and some of the difficulties it faces – to explain the theory more fully.
4.2. Historical remark
The idea that we should distinguish the essential properties of things from those ‘accidental’ properties which they merely happen to possess, but might perfectly well have lacked, and that its essential properties collectively constitute a thing's nature or essence, goes back to Aristotle. The suggestion that (all) necessities have their source in the nature of things is less clearly attributable to Aristotle, although there are certainly passages in which he comes close to making itFootnote 23 . In modern times, it was, as far as I know, first put forward by Kit Fine in his ground-breaking paper ‘Essence and Modality’.Footnote 24, Footnote 25 Prior to the publication of Fine's paper, the prevailing view among those philosophers and logicians who accepted Aristotle's distinction was that the notion of an essential property should be explained in terms of the notion of necessity, as expressed by the usual modal operator, □; . On this approach, essential predications such as:
Cows are essentially quadrupeds
Aristotle is essentially human
are to be analysed as:
∀x□(x is a cow → x is a quadruped)
□Aristotle is human [or, to allow for Aristotle's possible non-existence: □(Aristotle exists → Aristotle is human)]
After arguing, in my view pretty convincingly, that this approach can't adequately capture the notions of essential property and essence, Fine proposes that we should reverse the order of explanation. Thus instead of explaining ‘Cows are essentially quadrupeds' as meaning ‘∀x□(x is a cow → x is a quadruped)’, we should hold that the latter necessitated predication is true precisely because it is true in virtue of the nature of cows that they are quadrupeds.
4.3. An essentialist theory of necessity and possibility
Although the essentialist theory is basically quite simple, giving a general explicit statement of it is not entirely straightforward. As a first stab, we might try formulating the theory's two central theses schematically as:
Necessarily p if, and because, there is something the nature of which ensures that p
Possibly p if, and because, there is nothing whose nature rules out its being true that p
Taken at face value, the first of these involves an existential quantification, and the second the negation of one. So we might rewrite them, in what looks like familiar notation, as:
□p if an only if, and because, ∃x it is true by the nature of x that p
◊p if and only if, and because ¬∃x it is true by the nature of x that ¬p
But a moment's reflection should be enough to see that these can be no ordinary quantifications – ordinary quantification being either first-order, in which the bound variables range over objects or individuals (and nothing else), or second- or higher-order, in which the variables range over properties or relations of first- or higher-level. For, as our examples illustrate, the ‘things' whose natures are to be invoked to explain why such-and-such is necessary may be entities of very different types. They may be individual objects, such as Aristotle. But they may be general kinds or sorts of objects, e.g. whales, or general kinds of stuff or substance, e.g. gold, or relations, e.g. identity, or functions, e.g. conjunction, or perhaps entities of some other kind. Syntactically, the quantification is first-order – at least in the sense that its bound variable x occupies a position which needs filling with a noun or noun-phrase (singular or plural) or some other kind of substantival expression. But semantically, it must be understood as ranging over entities belonging to different ontological types or categories. Indeed, if the essentialist theory is to explain all (metaphysical) necessities, we must suppose that it ranges over entities of all types. We need, in other words, to employ a universal variable, whose admissible values include not only objects, but properties, relations, and functions, of each type and level, and entities of any other sorts there may be.Footnote 26
To get an adequate formulation, we need to introduce some further flexibility along a different dimension. So far, what we have allows only for explanations of necessity which appeal to the nature of a single entity. But it is easy to see that we need to go beyond this. Consider this slightly more complicated example. It is necessary that if A and B is true or A and C is true, then A and (B or C) is true. This is not true in virtue of the nature of conjunction alone, nor in virtue of the nature of disjunction alone; it is true in virtue of the natures of conjunction and disjunction together. It is easy to find examples in which an essentialist explanation would need to appeal to the natures of three or more entities. Since there is no plausible finite bound of the number of entities to which we may need to appeal, it would seem that we need something like:
□p iff, and because, ∃x 1 … x n it is true by the nature of x 1 … x n that p
◊p iff, and because, ¬∃x 1 … x n it is true by the nature of x 1 … x n that ¬p
so that the natures of any finite number of things may be involved in accounting for a given necessity. Employing the very useful and suggestive notation □xp introduced by Fine,Footnote 27 our two principles may be more concisely stated:
(Necessity) □p ↔ ∃x 1 … x n□x1 … xnp
(Possibility) ◊p ↔ ¬∃x 1 … x n□x1 … xn¬p
Our formulation is schematic in two respects, as indicated by the free propositional variable p and the free numerical variable n. Since the theory claims to account for every (metaphysical) necessity and possibility, the propositional variable is to be understood as implicitly universally quantified. And since its explanations may appeal to any finite number of things, the numerical variable is to be understood as implicitly bound by an appropriately placed existential quantifier ranging unrestrictedly over the positive integers.Footnote 28
4.4. The essentialist theory further explained – questions and difficulties
4.4.1. What are essences/natures?
This is the first, and most obvious question about the content, as distinct from the form, of the theory. The essence (or nature) of something is what it is to be that thing. A thing's essence is given by its definition, in the primary non-obsolete sense listed in the SOED, i.e. ‘A precise statement of the nature, properties, scope, or essential qualities of a thing’. There is, I think inevitably, a strong whiff of circularity in this explanation. You will make nothing of it unless you have already some sort of grasp of a contrast between saying what something is and saying any of many other things true of it – between saying, for example, that Aristotle is a man, which is to state part of what it is to be Aristotle, and saying that he is a philosopher, which is to state something which is, but mightn't have been, true of him. A definition in the relevant sense is of the thing, rather than of a word standing for the thing, although in an important class of cases, what serves as a definition of the thing might just as well be presented as a definition of a word for it, as with the definition of a square as a closed rectilinear plane figure composed of four sides equal in length and meeting at right-angles. This is what it is to be a square, but it would serve equally well to define the word ‘square’ (in the geometrical sense).
The properties figuring in a thing's definition are those properties which make it what it is, and so those properties without which it quite literally could not be (i.e. exist). They are its essential properties. Thus if x is essentially φ , it is so necessarily – that is, it is necessary that x is φ . Furthermore, nothing could fail to have the essence it has; that is, if x is essentially φ , it is necessary that it is essentially φ. (□xφ(x)→□□xφ(x)). Thus when the essentialist theory claims that it is necessary that p because it is true in virtue of the nature of something x that p, the explanans is itself necessary, and the theory is explaining one necessity by appeal to another.
4.4.2. Blackburn's dilemma revisited, and a related difficulty
According to Blackburn's dilemma, if we seek to explain one necessity by appealing to another, we merely go in a circle or set off on a vicious regress, while if we appeal instead to some merely contingent fact, we succeed only in undermining the putative necessity. In view of the point just made, it may seem that the essentialist theory must get impaled on the first, necessity, horn. And so it would, if the dilemma were sound. But it isn't. Blackburn is, of course, quite right that if we explain a proposition p's necessity by claiming that necessarily p because q, the proposition that q will itself be either contingent or necessary. But, supposing it to be necessary, this does not mean – as the necessity horn appears to simply assume – that we will be explaining p’s necessity by appeal to q’s necessity. There is a clear distinction between an explanation in which the explanans is the fact that q and one in which the explanans is the fact that it is necessary that q. Certainly, explanations of the latter kind are common enough: we explain why necessarily p by deducing p from q , and observe that since q is necessary, p must be so too (i.e. we rely on the obviously valid principle that what follows from a necessary truth must be itself necessary: □A, □(A→B) |– □B). Necessity is transmitted from the premise to the conclusion. But an explanation of the former sort is not transmissive. The explanans is plain q, not its being necessary that q . It may be – and with an essentialist explanation, it is – the case that the explanans is itself necessarily true if true at all, but that does not mean that what ‘really’ does the explanatory work is the necessity of the explanans rather than its plain truth. Blackburn simply misses this distinction.
It may be felt that even so, the essentialist theory runs into a related problem. Schematically, suppose that □xφx. Then as we've just seen, □□xφx. So applying (Necessity), it must be that for some y, □y□xφx. But now, haven't we just taken the first step in a troublesome-looking regress? – for it will, presumably, be necessary that □y□xφx , i.e. □□y□xφx, so that for some z, □z□y□xφx, and so on. One might, to be sure, block the threatened regress by exempting necessities of the form □□xp from the scope of the essentialist theory (i.e. by restricting the range of p in (Necessity) so as to exclude propositions explicitly stating the natures of things). But even if such a restriction could be argued to be other than simply ad hoc, it is not clear that it is needed, since it is not clear that the regress is vicious. Arguably, it is not. Whatever x may be, if it is true in virtue of x's nature that p, it is so necessarily precisely because nothing could have a different nature from the nature it actually has – whatever x may be, it is part of what it is for something to be x's nature that x could not have had any other nature. In that sense, if it is (part of) x’s nature to be φ, then that itself holds true in virtue of x’s nature. That is, whilst, assuming □xφx , there has to be a y such that □y□xφx, there need not be anything other than x itself whose nature ensures that □xφx.
4.4.3. What kind of explanation does this essentialist theory provide?
Blackburn's objection to any attempt to explain necessities by appeal to further necessities – that we are always left with a ‘bad residual “must”’Footnote 29 – assumes, in effect, that any worthwhile explanation of a necessity must explain it reductively, in wholly non-modal terms. The related difficulty just discussed is likewise liable to appear more serious if it is assumed that reduction is required, and could not be answered as I've suggested if that assumption is granted. But we should, I am suggesting, reject the assumption. Essence, as I've described it, is an essentially modal notion. I don't myself think it can be adequately explained in non-modal terms. However, although it can't be fully grasped by someone who does not already have some understanding of necessity, it cannot be adequately explained in terms of necessity, at least as it is expressed by a unary sentential operator such as the usual □.
If the essentialist theory does not provide a reductive explanation of necessity, what kind of explanation, if any, does it provide? Philosophers sometimes say – and in my view are at least sometimes right to say – that when we are dealing with fundamental concepts (concepts which cannot be analysed in terms of other, more fundamental ones), we must settle for an explanation which charts the connections between them – there is no digging deeper, so we must dig sideways. I am suggesting that this is so here, with the concepts of essence and other modal concepts (centrally, necessity and possibility and the strong or counterfactual conditional). But in one way, at least, the essentialist explanation does a little more. For what explaining necessity in terms of essence (□ in terms of □x) does is to bring out a kind of structure in the class of necessities as a whole, by identifying a fundamental subclass – those encapsulating the natures of things – in which all others have their source or ground.
4.4.4. Contigency
The essentialist theory explains □p and ◊p by means of the unmodalized quantifications ∃x 1 … x n□x1 … xnp and ¬∃x 1 … x n□x1 … xn¬p respectively. As I have said, the bound variables x i are to be understood to range over all things whatever. But if this includes things which exist only as a matter of contingency, it might be thought that the theory must run into some serious, and perhaps insuperable, objections. Of course, no such problems arise, if the essentialist embraces necessitism – the view, forcefully defended by Timothy Williamson, that everything that exists does so necessarily, and that nothing which doesn't exist could do so. And at least some of them might be avoided by adopting a view, such as Alvin Plantinga's, according to which the essences of things exist of necessity, even when those things themselves exist only contingently. I do not think these views can be dismissed out of hand, as obviously incorrect. But I think it is of interest to see whether the essentialist can withstand the objections turning on contingent existence without resorting to them.
5. Iterated necessities
According to the essentialist, a proposition is necessary iff there are some things in virtue of whose natures it is true. This is the claim we formalized as:
(Necessity) □p↔∃x 1 … x n□x1 … xnp
It is, we may suppose necessary that if Aristotle exists, he is a man, i.e.
(a) □∀z(z = a→Ha)
Applying (Necessity) to (a) gives
(b) ∃x□x∀z(z = a→Ha)
We are assuming that □ in (a) expresses absolute necessity, which we have argued to be governed by S5 principles, so that (a) entails:
(c) □□∀z(z = a→Ha)
On the face of it, pending some restriction on the application of (Necessity), (c) in turn entails:
(d) □∃x□x∀z(z = a→Ha)
But this, it seems, cannot be right. For it says that as a matter of necessity, there exists something the nature of which makes it true that if anything is Aristotle, it is a man. And it is quite unclear what this thing could be, other than Aristotle. But Aristotle's existence – at least according to widely held views – is a contingent matter, and if we are right in supposing that nothing else is such that its nature could make this true, it seems that it cannot be necessary that there is something which makes it true that if anything is Aristotle, it is a man.
One might block this argument by denying that (a) entails (c). This would involve either denying that absolute necessity conforms to the S4 principle that □A entails □□A , or denying that metaphysical necessity is, after all, absolute. Neither course seems to me attractive.Footnote 30 Assuming that it is not an option to deny that (a) entails (c), can we deny that (c) entails (d)? How exactly is (d) to be inferred from (c)? The only obvious routeFootnote 31 is to assume that we may necessitate (instances of) (Necessity) to obtain
(**) □(□p↔∃x 1…x n□x1…xnp)
and apply a principle of substitution of necessary equivalents:
(□↔ sub) □(A↔B), □A |– □B
to infer (d) from (c) together with
(e) □(□∀z(z = a→Ha)↔∃x□x∀z(z = a→Ha)) instance of (**)
We can, and I think should, block this, by restricting (□↔sub) to formulae not containing occurrences of □v, i.e. A, B must be □v-free. This restriction is not merely ad hoc – propositions about essence have additional content beyond what can be captured just using ordinary modal operators. Of course, there is nothing to prevent us from applying (□↔ sub) to formulae whose subformulae are □v-free equivalents of formulae containing □v. Thus given that □(A↔∃x□xB), we may replace ∃x□xB by its equivalent □B to get □(A↔□B) and then apply (□↔sub), together with □A as a minor premise to get □□B.
6. Contingent existents and non-existents
If some of the things which in fact exist might not have existed, or if there might have existed things other than those which actually exist, it may seem that the essentialist theory is bound to give incorrect results – to count as necessary some propositions which, absolutely speaking, might have been false, and to count as possible some propositions which could not have been true. For suppose that □xp (so that according to the theory, □p), but that (i) there is no other entity y such that □yp and (ii) x exists only contingently. Then if x hadn't existed, there would have been nothing to guarantee p’s truth, and, since x mightn't have existed, it can't be (absolutely) necessary – although necessary, it is not necessarily necessary. Or suppose that there is nothing whose nature rules out its being the case that p, so that according to the theory, ◊p; nonetheless, it may seem, there might have existed something whose nature would rule out its being the case that p, so that it is possible that p only in a relative sense, not absolutely – it is possible that p, but not necessarily possible that p (contrary to the S5-character of absolute modalities).
As with our previous objection, one might respond by retreating from the claim that the logic of absolute modality is S5, or by abandoning the view that metaphysical necessities are (always) absolute. But can the objection be countered without giving ground on these matters? I do not have space for a full discussion of the issue. Instead, I want to sketch my reasons for thinking that a positive answer may be defensible. In broadest outline, I shall argue, first, that whilst there may indeed be a good deal of contingency about what does and doesn't exist, the crucial question, for the essentialist theory, is how variations in what exists might affect what natures there are; and second, that, to the extent that variations in what things exist does affect what natures there are or would be, the differences make no difference to what is necessary or possible.
Suppose, schematically, that it is true in virtue of x’s nature that p, and so, according to the essentialist theory, necessary that p. And suppose that x is something which exists only as a matter of contingency, and so might not have existed. If, but only if, x’s non-existence would entail that there is no such thing as x’s nature, there will – assuming there is nothing apart from x whose nature ensures that p – be a problem for the essentialist theory, because although it is, as things are, necessary that p, had x not existed, there would have been nothing whose nature ensured that p, so that it might not have been necessary that p, whence it is not absolutely necessary that p, but only necessary relative to how things are. So the crucial question, as far as (Necessity) is concerned, is whether x ‘s non-existence would mean that there is no such thing as x’s nature – no such thing as what it is to be x. In a similar way, when we turn to the possibility of there existing things besides those which actually exist, the crucial question, as far as (Possibility) is concerned, is whether the existence of such ‘new’ things would bring with it the existence of ‘new’ natures which might place additional constraints of what is possible – so that something which is possible as things are would not be possible, were there to exist those ‘new’ things.
A thing's essence or nature consists of those of its properties which are essential to it. We may accordingly take essences in general to be more or less complex conjunctive properties. For example, the essence or nature of elephants – what it is to be an elephant – consists in being a large mammal equipped with a trunk and tusks. Being a natural, or finite cardinal, number consists in being either 0 or one of its successors (i.e. standing in the ancestral of the relation of immediate succession to 0). And similarly in other cases. The existence of elephants appears to be a contingent matter. Sadly, it may be that one day there will no elephants, and it seems at least possible that there should never have been any elephants. Does that mean that there will be, or might have been, no such thing as being an elephant? It seems to me that it the answer is plainly that it would not. There being elephants is one thing, and there being such a thing as the property of being an elephant is another. Properties may be uninstantiated. It may be a contingent matter whether a certain property has instances. That does not mean that the existence of the property itself is a contingent matter. And in many cases, including many of the complex properties which constitute the essences of things, it will not be a contingent matter, on the theory of properties I favour.Footnote 32
According to that theory – the abundant theory – the conditions for the existence of properties and relations are in an important respect very weak and undemanding. To a first approximation, the existence of a meaningful predicate – that is, roughly, a predicate with well-understood application or satisfaction conditions – is sufficient for the existence of a corresponding property or relation. A property (or relation) is simply a way for a thing or things to be – the way something is (or the way some things are), if it (or they) satisfies (or satisfy) the predicate. Of course, it would be implausible to take the actual existence of a suitable predicate to be a necessary condition for that of a corresponding property. Which properties actually have predicates standing for them is an accident of history. To state a condition which is not only sufficient but also necessary, we must have recourse to modality – what is required (and also sufficient) is not that there actually be a suitable predicate, only that there could be one. At least, this is both necessary and sufficient for the existence of what we may call purely general properties. Purely general properties are those which correspond to purely general predicates – that is, predicates which essentially involve no names or other devices of definite singular reference. They are thus to be contrasted with object-dependent properties, properties specified by predicates essentially involving singular reference to an object or object which must exist, if there is to be a corresponding property.
The condition for the existence of purely general properties is essentially modal: for any purely general property φ, it is necessary and sufficient for φ's existence that there could be a predicate standing for it. If, as I believe we should, we take this modality to be absolute, and governed by S5 principles, then we can give a simple argument for the conclusion that all purely general properties exist necessarily – so that all purely general properties actually exist, and none of them could have failed to exist. Let φ be any purely general property, let p be the proposition that φ exists (i.e. that there is such a property as φ), and let q be the proposition that there exists a predicate standing for φ. Then according to the abundant theory, it is necessarily the case that p iff ◊q – that is: □(p↔◊q). It follows from this both that □p↔□◊q (by the K-principle) and that p↔◊q (by the Law of Necessity). But by S5, ◊q↔□◊q. Whence by the transitivity of the biconditional, p↔□p.
If that is right, then, in so far as the essences of things are purely general properties, contingencies of existence and non-existence can pose no threat to the essentialist theory's central claims (i.e. to (Necessity) and (Possibility)). In other words, there will be a threat only if some essences are not purely general, but object-dependent properties, and the existence of the objects in question is itself a contingent matter.
That at least some essences are, or involve, object-dependent properties seems to me virtually certain. A plausible example is afforded by the natural numbers. If these are to be defined as Frege proposed, to be a natural number is to be either identical with 0 or one of its successors.Footnote 33 However, whilst this makes the essence of the natural numbers an object-dependent property, the existence of 0 is arguably no contingent matter. Are there object-dependent essences involving objects which exist only contingently? And if there are, do they pose an insuperable problem for the essentialist theory? These questions call for fuller answers than I can give give here, where I have space for only a few brief and sketchy remarks.
The most plausible case I know for an affirmative answer to the first question turns on the essences of individual objects – individual people, plants, numbers, and so on. The essentialist should agree that individual objects have essences, for we can surely properly ask: what is it to be Aristotle, or to be this plant, or the number 17, say. Part of the answer, at least, will be that it is to belong to a certain kind. Thus to be Aristotle is, at least in part, to be a human being. But is there more to being Aristotle than this purely general property? Being a human being is something Aristotle has in common with many other individuals. It may be held that there must be something which makes Aristotle the particular human being he is. And now, since any purely general property will be one that he could share with other individuals, it may seem that what makes him the particular human being he is must be some object-dependent property, in virtue of which he and he alone is related to some other object(s) – an object or objects whose existence is as contingent as his own. The obvious candidates for such an object-dependent essential property are, of course, provided by the much discussed thesis that material objects, or at least individual living things, have their origins essentially. Thus what distinguishes Aristotle, say, from every other human being is his having originated in a particular egg and sperm produced by Phaestis and Nicomachus respectively.
This line of argument is clearly not irresistible. Apart from any doubts which may be felt about the specific thesis of essentiality of origin, it is anything but obvious that an individual must have an essence in the strong sense invoked in the argument – that is, an essential property which distinguishes it from every other individual. Here it is important cleanly to separate an epistemological sense of individuation from a metaphysical sense. Perhaps it is true – although it is scarcely uncontroversial – that if an individual is to be individuated in thought or speech, there must be some means of distinguishing that individual from all others. But that is clearly an epistemological matter, and the property or properties by which something is individuated in this sense – singled out, in thought or speech – need not, and typically will not, be essential ones. In any case, what is needed is an argument for the metaphysical principle that for each particular object a, there must be some essential property of a which by virtue of which it is distinct from every other object. The prospects for such an argument do not seem bright. We may well be reluctant to accept the distinctness of distinct individuals as a brute, inexplicable fact. But even if something akin to the Identity of Indiscernibles – say that ∀x∀y(x≠y→∃φ(φx↔¬φy)) – were accepted as necessarily true, it would fall well short of what is required. First, nothing in this ensures that the property which separates any given pair of individuals is an essential property of either; second, there is plainly no valid move from Leibniz's principle to the much stronger one that is required, which asserts that there is, for any given individual, some single property which separates it from every other individual, i.e. ∀x∃φ∀y(x≠y→(φx↔¬φy)).
In the absence of any clear and compelling grounds for thinking that at least some individual essences must be object-dependent properties involving contingently existing objects, the essentialist may agree that there are contingencies of existence and non-existence, but argue that they make no difference to what essences there are, and so after all pose no threat to his central principles. But even if he does accept that at least some individuals have object-dependent essences requiring objects whose existence is a contingent matter, it is still far from clear that trouble must ensue. There are two cases to consider, according as what is envisaged is the existence of ‘new’ objects, distinct from all those which actually exist, or the non-existence of some of those objects which actually exist. In the first case, what is envisaged involves there being some ‘new’ individual essences; in the second, we envisage the absence of some ‘old’ (i.e. actually existing) individual essences.
The essentialist may argue, in the first case, that the possibility of there being new objects is purely general – it is the possibility that there should exist objects of some general kind, distinct from any (actually) existing objects of that kind. Since that general kind – as distinct from its ‘new’ instances – already exists, any limitations it may impose on what is possible are already in place. Further, while its envisaged new instances – or so we are conceding – will possess object-dependent essences, either those essences will depend on objects which actually exist, or they will be envisaged as depending upon further ‘new’ objects of some relevant general kind(s). To the extent that they are envisaged as dependent on actually existing objects, any constraints they impose on what is possible must be already in place; but to the extent that they may depend upon ‘new’ objects, any resulting constraints are purely general, resulting from having an essence of a certain kind, and so once again are already in place. Thus there are no essentially new constraints on what is possible.Footnote 34, Footnote 35
Turning to the second case, the crucial question is whether the absence of some ‘old’ essence – i.e. the essence of some actually existing individual – could open up possibilities which are, as things stand, closed off by its presence. The essentialist can argue that it cannot do so, as follows. Suppose, for example, there were no such thing as Aristotle's essence. If there is any proposition which could not be true as things stand, but which could be true, were there no such thing as Aristotle's essence, it must be one expressing a putative possibility concerning Aristotle himself, since Aristotle's essence is irrelevant to, and imposes no constraints upon, putative possibilities in which he is not involved. It would have to be a proposition such as the proposition that Aristotle might have been a frog – a proposition which is, and must be, false as things are, because its truth is ruled out by Aristotle's essence (supposing it to be true in virtue of Aristotle's nature that if he exists at all, he is a man). The question is whether such a proposition would or could be true, were there to be no such thing as Aristotle's essence. And the answer is that it would not, and could not, be true, and it is sheer confusion to think otherwise. It is, of course, possible that there should have been no such thing as Aristotle's essence. Had there been no such thing as his essence, there would have been no such thing as Aristotle either. Hence there would have been no singular propositions concerning Aristotle. Thus, had there been no such thing as Aristotle's essence, there would have been no such proposition as the proposition that Aristotle might have been a frog – so that proposition could not be true in those circumstances, because it would not have existed in those circumstances. The only question that remains is whether that proposition – which does, of course, actually exist (since Aristotle and his essence exist) – is or could be true of those possible circumstances. And the answer is that it could not be true, because its truth is ruled out by Aristotle's essence.Footnote 36
7. Concluding remark
Many of the issues discussed here require much fuller treatment than I have been able to manage here, and there are doubtless further questions and problems which will need to be addressed, before we may have any confidence that the essentialist theory is viable. But I hope I have done enough to persuade you that the theory has at least some merit and is worthy of further investigation.