EXPLORING THE LANDSCAPE OF RELATIONAL SYLLOGISTIC LOGICS

ALEX KRUCKMAN; LAWRENCE S. MOSS

doi:10.1017/S1755020320000386

EXPLORING THE LANDSCAPE OF RELATIONAL SYLLOGISTIC LOGICS

Part of: General logic

Published online by Cambridge University Press: 21 October 2020

ALEX KRUCKMAN and

LAWRENCE S. MOSS

Show author details

ALEX KRUCKMAN: Affiliation:
DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE WESLEYAN UNIVERSITYMIDDLETOWN, CT, USA E-mail: akruckman@wesleyan.edu
LAWRENCE S. MOSS: Affiliation:
DEPARTMENT OF MATHEMATICS INDIANA UNIVERSITY BLOOMINGTONBLOOMINGTON, IN, USA E-mail: lmoss@indiana.edu

Article contents

Abstract
Introduction
${\cal L}_{2}$: Adding the sentence former $({\mathsf{some} \ x \ y})$ to ${{\cal L}_{1}}$
${\cal L}_{3}$ and ${\cal L}_{3.5}$: Adding the term former $({\textit{r} \ \mathsf{some} \ x})$ to ${{\cal L}_{1}}$ and ${\cal L}_{2}$
${\cal L}_{4}$ and ${\cal L}_{4.5}$: Adding term complementation to ${{\cal L}_{1}}$ and ${\cal L}_{2}$
${\cal L}_{5}$ and ${{\cal L}_{5.5}}$: Putting it all together
Footnotes
References

Rights & Permissions

Abstract

This paper explores relational syllogistic logics, a family of logical systems related to reasoning about relations in extensions of the classical syllogistic. These are all decidable logical systems. We prove completeness theorems and complexity results for a natural subfamily of relational syllogistic logics, parametrized by constructors for terms and for sentences.

Keywords

natural logic syllogistic logic proof systems completeness computational complexity

MSC classification

Primary: 03B65: Logic of natural languages

Type: Research Article
Information: The Review of Symbolic Logic , Volume 14 , Issue 3 , September 2021 , pp. 728 - 765

DOI: https://doi.org/10.1017/S1755020320000386 [Opens in a new window]
Copyright: © The Review of Symbolic Logic, 2020

1 Introduction

This paper explores several fragments of relational syllogistic logic and aims to provide completeness and complexity results for them. These are among the simplest of all logical systems. To set notation and terminology by example, let us consider the absolutely simplest logical system ${\mathcal {A}}$ , the one for sentences “all p are q” introduced in [3]. The syntax begins with a set ${\mathbf {N}}$ , called the nouns. Then the sentences of ${\mathcal {A}}$ are simply the expressions $(\mathsf{all}\ p \ q)$ , where p and q are nouns. The semantics is equally straightforward. A model ${\cal M}$ is a set M together with an interpretation function giving a subset $[\![ p ]\!]\subseteq M$ for each noun p. We then define:

(1)

$$ \begin{align} {\cal M}\models\ \mathsf{all}\ p \ q {\quad \text{iff}\quad} [\![p]\!] \subseteq [\![ q ]\!]. \end{align} $$

We employ standard model-theoretic notation and terminology. We say that ${\cal M}$ satisfies a sentence ${\varphi} $ , or that ${\cal M}$ is a model of ${\varphi} $ , when ${\cal M}\models {\varphi} $ . A theory is a set of sentences. Given a theory $\Gamma $ , we write ${\cal M}\models \Gamma $ to mean that ${\cal M}$ satisfies every sentence in $\Gamma $ ; naturally we say that ${\cal M}$ satisfies $\Gamma $ , or that ${\cal M}$ is a model of $\Gamma $ . We then have the usual notion of logical consequence: given a theory $\Gamma $ and a sentence ${\varphi} $ , we write $\Gamma \models {\varphi} $ if every model of $\Gamma $ satisfies ${\varphi} $ .

We match the semantics with a proof system. Our system has two rules of inference, shown below:

In these rules, the material above the line is the set of premises, and the sentence below is the conclusion. So $({\scriptstyle \mathrm{AX}})$ has no premises, and $({\scriptstyle \mathrm{BARBARA}})$ has two. A substitution instance of a rule is obtained by substituting nouns in ${\mathbf {N}}$ for the variables x, y, and z. We can then define the provability relation $\Gamma \vdash {\varphi} $ : this means that there is a tree whose root is ${\varphi} $ , and every node in the tree is either a leaf and belongs to $\Gamma $ , or else it is the conclusion, and its children are the premises, of a substitution instance of one of our rules. The soundness/completeness theorem for this system states that $\Gamma \models {\varphi} $ iff $\Gamma \vdash {\varphi} $ . For the proof, see [Reference Moss, Hamm and Kepser3].

1.1 Syntax and semantics of the logics in this paper

We are concerned not with ${\mathcal {A}}$ but rather with a family of extensions of it. We start with a set ${\mathbf {N}}$ of nouns (just as above), and also a set ${\textbf {V}}$ , the verbs. Then we define terms and sentences via the syntax below, where p is any noun in ${\mathbf {N}}$ , r is any verb in ${\textbf {V}}$ , and x and y are any terms:

(2)

$$ \begin{align} \begin{array}{ll} \text{terms} &\qquad p \mid {r \ \mathsf{all} \ x} \mid {r\ \mathsf{some}\ x} \mid {\overline{{x}}} \\ \text{sentences} &\qquad {\mathsf{all}\ x\ y} \mid {\mathsf{some}\ x\ y}. \end{array} \end{align} $$

Note that we have recursion here, so terms can be nested, e.g. $(r \ \mathsf{all} \ (\overline{s \textsf{ some } p}))$ .

For the semantics, we start with a model in our previous sense (to interpret the nouns), and add interpretations of the verbs as binary relations: $[\![ r ]\!] \subseteq M\times M$ for all $r\in {\textbf {V}}$ . Then we interpret our terms in a given model by recursion as follows:

$$ \begin{align*} {[\![ {{r \ \mathsf{all} \ x}} ]\!]} &= \{m \in M : \text{for all } n\in [\![ x ]\!], m [\![ r ]\!] n\} \\ [\![ {{\textit{r} \ \mathsf{some}\ x}} ]\!] & = \{m\in M : \text{there is } n\in [\![ x ]\!] \text{ such that } m [\![ r ]\!] n \} \\ [\![ {{\overline{{x}}}} ]\!] & = M\setminus[\![ x ]\!]. \end{align*} $$

Thus, the interpretation of every term is a subset of M. We define the truth-relation for sentences and models by generalizing (1) above:

$$ \begin{align*} {\cal M}\models {\mathsf{all } \ x \ y} & {\quad \text{iff}\quad} [\![ x ]\!] \subseteq [\![ y ]\!] \\ {\cal M}\models {\mathsf{some} \ x \ y} & {\quad \text{iff}\quad} [\![ x ]\!] \cap [\![ y ]\!]\neq\emptyset. \end{align*} $$

The basic languages in this paper are all sublanguages of the language just presented, which we call ${{\cal L}_{5.5}}$ . And they are all extensions of the language we call ${{\cal L}_{1}}$ , which only has the sentence former (all x y) and the term former (r all x). There are three features in ${{\cal L}_{5.5}}$ which are absent from ${{\cal L}_{1}}$ : the sentence former $({\mathsf{some} \ x \ y})$ , the term former (r some x), and term complementation. We thus explore $2^3 = 8$ logical systems, obtained by all possible combinations of these features. Those languages are listed in the chart in Figure 1. We organize matters by studying ${\cal L}_n$ and ${\cal L}_{n.5}$ in Section n. Note that ${{\cal L}_{1}}$ and ${\cal L}_{2}$ are related in the same was as ${\cal L}_n$ and ${\cal L}_{n.5}$ for $n>2$ , namely by adding the sentence former $({\mathsf{some} \ x \ y})$ .

Fig. 1 Languages in this paper, given by section.

We are interested in complete proof systems and the complexity of the consequence relation for each of these languages. By this we mean the computational complexity of the following decision problem: given a finite theory $\Gamma $ and a sentence ${\varphi} $ , output “yes” if $\Gamma \models {\varphi} $ and “no” otherwise. Our results, which are summarized in the last two columns of Figure 1, will be explained in more detail in §1.5 below.

1.2 Related work

This paper is similar in spirit to [Reference Pratt-Hartmann and Moss10], which also considered completeness and complexity results for decidable languages extending the basic syllogistic logic ${\mathcal {A}}$ . In fact, the largest language of this paper, ${{\cal L}_{5.5}}$ , is essentially equivalent to the largest language considered in [Reference Pratt-Hartmann and Moss10], called $\mathcal {R}^{*\dagger }$ there (see Propositions 5.1 and 5.2 below). But there are several differences between the other languages in [Reference Pratt-Hartmann and Moss10] and in this paper. First, [Reference Pratt-Hartmann and Moss10] allowed for complemented verbs, contrary to what we do here. Second, [Reference Pratt-Hartmann and Moss10] also explored logical systems where the two terms in sentences like $({\mathsf{all} \ x \ y})$ were not treated the same. For example, in two systems there, the subject noun phrase x was required to be (negated) atomic. Third, we allow nested terms, which are not part of the syntax in [Reference Pratt-Hartmann and Moss10]. The upshot is that there is no overlap in the technical results from [Reference Pratt-Hartmann and Moss10] and this paper, except for the result on ${{\cal L}_{5.5}}$ which we quote in §5.

In [Reference McAllester and Givan2], McAllester and Givan considered a language which is a slight extension of our language ${\cal L}_{3.5}$ . In §3, we essentially provide a new proof of their result that the consequence relation for this language is ${{\rm C}{\scriptstyle \mathrm{O-}}{\rm NPT}{\scriptstyle \mathrm{IME}}}$ complete. Our version of this result is slightly sharper, since we prove that the weaker language ${\cal L}_{3}$ is already ${{\rm C}{\scriptstyle \mathrm{O-}}{\rm NPT}{\scriptstyle \mathrm{IME}}}$ hard, and we also provide complete (though nonsyllogistic) proof systems for ${\cal L}_{3}$ and ${\cal L}_{3.5}$ .

We began this paper with ${\mathcal {A}}$ , which was introduced in [Reference Moss, Hamm and Kepser3]. But the smallest language in Figure 1 is ${{\cal L}_{1}}$ , which is the extension of ${\mathcal {A}}$ by the term former $({r \ \mathsf{all} \ x})$ . As a proof system for ${{\cal L}_{1}}$ , we take the rules $({\scriptstyle \mathrm{AX}})$ and $({\scriptstyle \mathrm{BARBARA}})$ above (but we allow terms, not just nouns, to be substituted for the variables x, y, and z), together with a new rule:

It is easy to see that this proof system is sound (if $\Gamma \vdash{\varphi} $ , then $\Gamma \models {\varphi} $ ). Completeness was shown in [Reference Moss and Kruckman8], which also contains completeness and complexity results for a number of related languages. The completeness proof originated in [Reference Moss7], where it is also shown that the consequence relation for ${{\cal L}_{1}}$ is in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ . We reprove these results in §1.4 below, but using a more general framework described in §1.3. This framework unifies the ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ results for ${{\cal L}_{1}}$ and ${\cal L}_{2}$ in §2.5, and allows us to obtain more precise negative results for the other languages in the paper, as we shall see.

1.3 Syllogistic proof systems and bounded completeness

At this point, we wish to formally state what we mean by a syllogistic proof system. To state rules, we employ a language with noun variables $p,q,\dots $ , verb variables $r, s,\dots $ , term variables $x,y,\dots $ , and sentence variables ${\varphi} ,\psi ,\dots $ . (In practice, none of our rules will make use of noun variables, and very few will make use of sentence variables.)

A term template is defined as in (2), but using noun and verb variables in place of nouns and verbs, and with an additional base case: a term variable is a term template. A sentence template is defined as in (2), but using term templates in place of terms, and with an addition option: a sentence variable is a sentence template.

A syllogistic rule $\rho $ consists of finitely many (possibly none) sentence templates as premises, and a single sentence template as a conclusion. A syllogistic proof system is a finite set of syllogistic rules.

We use ${\vdash }$ to denote a syllogistic proof system. Given a syllogistic proof system, we also use the symbol ${\vdash }$ for the (standard) provability relation, defined shortly. A substitution instance of a rule $\rho $ is obtained by substituting nouns, verbs, terms, and sentences for all noun variables, verb variables, term variables, and sentence variables, respectively. A proof tree over a theory $\Gamma $ is a tree labeled by sentences, such that each node is either a leaf and belongs to $\Gamma $ , or else it is the conclusion, and its children are the premises, of a substitution instance of one of the rules. We write $\Gamma \vdash {\varphi} $ if there is a proof tree over $\Gamma $ whose root is ${\varphi} $ .

We should mention that a syllogistic proof system is subject to some important limitations. First, the system cannot include rules which allow for the withdrawal of assumptions, as in reductio ad absurdum, or proof by cases. (On the other hand, ex falso quodlibet is a syllogistic rule; see [Reference Moss, Hamm and Kepser3].) Second, the premises of each rule must be a fixed finite set of sentence templates; the set of premises cannot be listed as a schema.

Later in this paper, we shall see proof systems that are not syllogistic in our sense: To obtain completeness theorems for various logics, we need to add the rule (cases) and its variants $({\scriptstyle \mathrm{CASES}}_2)$ , $({\scriptstyle \mathrm{CASES}}')$ , and $({\scriptstyle \mathrm{CASES}}_{2}^{\prime })$ in §§2.3 and 3, the rule (raa) in §4, and the schema (chains) in §2.4.

Next, we introduce a strengthening of the notion of syllogistic proof system, which ensures that the consequence relation $\Gamma \models \varphi $ is efficiently decidable, for any finite theory $\Gamma $ and any sentence $\varphi $ .

Definition 1.1. Let ${\cal L}$ be a language, equipped with a syllogistic proof system ${\vdash }$ , and let A be any set of sentences in ${\cal L}$ . We write $\Gamma\ {{\vdash }_{ A}}\ {\varphi} $ if $\Gamma \vdash {\varphi} $ via a proof tree ${\cal T}$ with the property that all sentences in ${\cal T}$ belong to A.

Example 1.2. This example is based on the observation that

(3)

$$ \begin{align} \{\mathsf{all } \ x \ y, \mathsf{all } \ y \ z\}\ {\vdash}\ {\mathsf{all}\ (r \ \mathsf{all} \ z) \ (r \ \mathsf{all} \ x)}. \end{align} $$

For example, here is a proof tree:

Let A be the set of all sentences $\psi $ such that every subterm of $\psi $ is in the set $\{x,y,z, r \ \mathsf{all} \ x, r \ \mathsf{all} \ z\}$ (this is the set of subterms of the sentences appearing in (3)).

Our tree above does not show that

(4)

$$ \begin{align} \{\mathsf{all} \ x \ y , \mathsf{all} \ y \ z\} \ {{\vdash}_{ A}}\ \mathsf{all} \ (r \ \mathsf{all} \ z)\ (r \ \mathsf{all} \ x). \end{align} $$

The problem is that the term $({r \ \mathsf{all} \ y})$ is not in A. But the tree below does show (4):

Definition 1.3. A boundedly complete syllogistic proof system for ${\cal L}$ is a syllogistic proof system ${\vdash }$ for ${\cal L}$ which is sound, and such that that there exists a ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ -computable $f:{\cal P}_{\text { fin}}({\cal L})\to {\cal P}_{\text { fin}}({\cal L})$ such that whenever $\Gamma \models {\varphi} $ , then $\Gamma\ {{\vdash }_{ \text {f}(\Gamma \cup \{ {\varphi} \})}}\ {\varphi} $ .

Example 1.4. Suppose we are working with the language ${{\cal L}_{1}}$ . For any finite set of sentences $\Delta $ , let $f(\Delta )$ be the set of all sentences $\varphi $ such that every subterm of $\varphi $ is a subterm of a sentence in $\Delta $ . Note that f is computable in polynomial time by enumerating the subterms of sentences in $\Delta $ , and then forming all sentences $(\mathsf{all} \ u \ v)$ such that u and v are on the list. In connection with Example 1.2, the set A there is exactly $f(\{\mathsf{all } \ x \ y, \mathsf{all} \ y \ z,\ {\mathsf{all}\ (r \ \mathsf{all} \ z) \ (r \ \mathsf{all} \ x)}\})$ . In Proposition 1.6 below, we are going to show that the proof system with rules ( ${\scriptstyle \mathrm{AX}}$ ), ( ${\scriptstyle \mathrm{BARBARA}}$ ), and ( ${\scriptstyle \mathrm{ANTI}}$ ) is boundedly complete for ${{\cal L}_{1}}$ . The function f could serve as a witness. But in order to simplify the proof, we use a slightly different function g.

Theorem 1.5. Fix a language ${\cal L}$ . Let ${\vdash }$ be a boundedly complete syllogistic proof system for a language ${\cal L}$ . Then ${\vdash }$ is complete, and for any finite theory $\Gamma $ and any sentence $\varphi $ , the problem of deciding whether $\Gamma \models {\varphi} $ is in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Proof. This is essentially Appendix A of McAllester [Reference McAllester1]. Here is a sketch, based on this, and also on a parallel result in [Reference Pratt-Hartmann and Moss10] which was proved in the easier setting of syllogistic logics without complex terms.

Let $f:{\cal P}_{\text { fin}}({\cal L})\to {\cal P}_{\text { fin}}({\cal L})$ be a ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ -computable function such that $\Gamma\ {{\vdash }_{ \text {f}(\Gamma \cup \{ {\varphi} \})}}\ {\varphi} $ whenever $\Gamma \models {\varphi} $ . In particular, $\Gamma \models {\varphi} $ implies that $\Gamma \vdash {\varphi} $ . So the proof system is complete.

For the ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ decidability, first compute $f(\Gamma \cup \{ {\varphi} \})$ . Call this set A. Let $X_0 = \Gamma \cap A$ . We compute an increasing sequence of subsets of A by induction. Given $X_n$ , take each of the finitely many rules $\rho $ of the logic, and do the following: compute the set of all substitution instances of $\rho $ whose premises are all in $X_n$ ; for each such substitution instance, if the conclusion $\psi $ belongs to A, then add $\psi $ to $X_{n+1}$ . Continue until the first $n^*$ such that $X_{n^*+1} =X_{n^*}$ . Since all the $X_n$ are subsets of A, we have $n^* \leq 1 + |A|$ . And $\Gamma\ {{\vdash }_{ A}}\ {\varphi} $ iff $ {\varphi} \in X_{n^*}$ . We take it as standard that all of this can be done in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ . □

1.4 Example: Completeness and ${\scriptstyle \mathrm{PTIME}}$ decidability for ${{\cal L}_{1}}$

We illustrate the application of Theorem 1.5 to ${{\cal L}_{1}}$ with the syllogistic proof system consisting of the rules $({\scriptstyle \mathrm{AX}})$ , $({\scriptstyle \mathrm{BARBARA}})$ , and $({\scriptstyle \mathrm{ANTI}})$ .

For any set of sentences $\Delta $ , let $T(\Delta )$ be the set of subterms of sentences in $\Delta $ . Let $T^+(\Delta )$ be $T(\Delta )$ together with the terms $(r \ \mathsf{all} \ w)$ where $w\in T(\Delta )$ and where r is a verb which appears in $\Delta $ .

Let $g(\Delta )$ be the set of all sentences $(\mathsf{all} \ u \ v)$ , where $u\in T(\Delta )$ and $v\in T^+(\Delta )$ . Note that when $\Delta $ is finite, g is computable in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Proposition 1.6 [Reference Moss7].

If $\Gamma \models {\varphi} $ , then $\Gamma\ {\vdash}_{g(\Gamma \cup \{\varphi \})}\ {\varphi} $ . Hence the consequence relation for ${{\cal L}_{1}}$ is in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Proof. Fix a finite theory $\Gamma $ and a sentence $\varphi $ . We are going to save on some notation in this proof by writing T for $T(\Gamma \cup \{ {\varphi} \})$ , $T^+$ for $T^+(\Gamma \cup \{ {\varphi} \})$ , and A for $g(\Gamma \cup \{ {\varphi} \})$ .

We make a model ${\cal M}$ as follows. The domain M of the model is T. The structure of the model is given by

$$ \begin{align*} t\in [\![ p ]\!] &{\quad \text{iff}\quad} \Gamma\ {{\vdash}_{ A}}\ {\mathsf{all} \ t \ p}\\ t[\![ r ]\!] u &{\quad \text{iff}\quad} \Gamma\ {{\vdash}_{ A}}\ {\mathsf{all} \ t\ (r \ \mathsf{all} \ u)} \end{align*} $$

(Recall that the set A is fixed as $g(\Gamma \cup \{\varphi \})$ . When we write $\Gamma\ {{\vdash }_{ A}}\ \psi $ in this proof, we are not changing the meaning of A to $g(\Gamma \cup \{ \psi \})$ .)

Claim 1.7 (Truth Lemma). For all $a\in T$ ,

(5)

$$ \begin{align} t\in [\![ a ]\!] {\quad \text{iff}\quad} \Gamma\ {{\vdash}_{ A}}\ {\mathsf{all} \ t \ a}. \end{align} $$

Proof. The proof is by induction on a. When a is a noun, the assertion in (5) is part of the definition of the model. Assume (5) for a, and consider $({r \ \mathsf{all} \ a})$ . Since $({r \ \mathsf{all} \ a})\in T$ , then $a\in T = M$ as well. Note also that the sentence $({\mathsf{all} \ a \ a})$ belongs to A, and so $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ a \ a}$ by $({\scriptstyle \mathrm{AX}})$ . By induction, $a\in [\![ a ]\!]$ .

Let $t\in [\![ {{r \ \mathsf{all} \ a}} ]\!]$ . Since $a\in [\![ a ]\!]$ , we have $t[\![ r ]\!] a$ , and hence $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ {t} \ {(r \ \mathsf{all} \ a)}}$ by definition of $[\![ r ]\!]$ .

Conversely, assume that $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ {t} \ {(r \ \mathsf{all} \ a)}}$ . Then $t\in T = M$ . We show that $t\in [\![ {{r \ \mathsf{all} \ a}} ]\!]$ . For this, let $b\in [\![ a ]\!]$ , so by induction, $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ b \ a}$ . We must show that $t[\![ r ]\!] b$ , equivalently $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ t \ {(r \ \mathsf{all} \ b)}}$ . Note that $b\in M=T$ , so $({r \ \mathsf{all} \ b})\in T^+$ . The key point is that the sentence ${\mathsf{all}\ (r \ \mathsf{all} \ a) \ (r \ \mathsf{all} \ b)}$ belongs to A, since $({r \ \mathsf{all} \ a})\in T$ and $({r \ \mathsf{all} \ b})\in T^+$ . Using $({\scriptstyle \mathrm{ANTI}})$ and $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ b \ a}$ , we see that $\Gamma\ {{\vdash }_{ A}}\ \mathsf{all} \ (r \ \mathsf{all} \ a) \ (r \ \mathsf{all} \ b)$ . Then by $({\scriptstyle \mathrm{BARBARA}})$ , $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ t \ (r \ \mathsf{all} \ b)} $ . □

We continue by showing that ${\cal M}\models \Gamma $ . For this, take any sentence $(\mathsf{all} \ u \ v)$ in $\Gamma $ . Let $t\in [\![ u ]\!]$ . By (5), $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ t \ u}$ . Now t, u, and v are in T, so the sentences $(\mathsf{all} \ u \ v)$ and $(\mathsf{all} \ t \ v)$ are in A. By $({\scriptstyle \mathrm{BARBARA}})$ , $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ t \ v}$ . By (5) again, $t\in [\![ v ]\!]$ . Since t was arbitrary, we have shown that $[\![ u ]\!]\subseteq [\![ v ]\!]$ , and ${\cal M}\models {\mathsf{all} \ u \ v}$ .

Since ${\cal M}\models \Gamma $ and $\Gamma \models {\varphi} $ , the sentence $ {\varphi} $ holds in ${\cal M}$ . Let us write $ {\varphi} $ as $({\mathsf{all} \ a \ b})$ . Then $a\in T = M$ and the sentence $({\mathsf{all} \ a \ a})$ belongs to A, and so $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ a \ a}$ by $({\scriptstyle \mathrm{AX}})$ . Thus, $a\in [\![ a ]\!]$ by (5). So $a\in [\![ b ]\!]$ . By (5) again, $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{all} \ a \ b}$ . This concludes the proof.□

1.5 Results

Our two main themes are trade-offs between expressive power and computational complexity, and also the variety of devices that one can add on top of pure syllogistic logic in order to obtain sound and complete proof systems. Our results are summarized in Figure 1.

We begin with a negative result: ${\cal L}_{2}$ has no sound and complete syllogistic proof system. The argument for this is combinatorial, and in outline it is based on a similar result in [Reference Pratt-Hartmann and Moss10] for ${\mathcal {R}}$ . Nevertheless, there are proof systems which capture the consequence relation of ${\cal L}_{2}$ . Most of §2 is devoted to several such logics, each of which extends syllogistic logic in a different way. One way is to add a rule (cases) which enables proof by cases, another uses a schema of rules called (chains) (thus there are infinitely many rules, but the set of rules is an easily-defined set), and the last is to extend the syntax in such a way that the extension, called ${\cal L}_{2}^+$ , does have a sound and complete syllogistic proof system. In fact, this system is boundedly complete, and by Theorem 1.5, the consequence relation of ${\cal L}_{2}^+$ , and hence of its sublanguage ${\cal L}_{2}$ , is in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Notice that what we will show is that ${\cal L}_{2}$ has no sound and complete syllogistic proof system, but the larger language ${\cal L}_{2}^+$ does have such a proof system. The first example of this phenomenon is in [Reference Pratt-Hartmann9].

The lower bounds on complexity established in the rest of the paper show that (assuming P $\neq $ NP) none of the other languages admit boundedly complete syllogistic proof systems.

The languages ${\cal L}_{3}$ and ${\cal L}_{3.5}$ extend ${{\cal L}_{1}}$ and ${\cal L}_{2}$ with the term former $({\textit{r} \ \mathsf{some} \ x})$ . We show that the smaller language ${\cal L}_{3}$ has a consequence relation which is ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ hard, via a reduction from the one-in-three positive $3$ -SAT problem. On the other hand, a finite countermodel construction shows that the consequence relation of the larger language ${\cal L}_{3.5}$ is in ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ . It follows that the consequence relation for both languages are ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ complete. We give sound and complete proof systems for these logics which are not syllogistic: they use variants of the rule (cases).

The languages ${\cal L}_{4}$ and ${\cal L}_{4.5}$ extend ${{\cal L}_{1}}$ and ${\cal L}_{2}$ with term complementation. As with ${\cal L}_{3}$ , we show that the consequence relation for ${\cal L}_{4}$ is ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ hard, via a reduction from $3$ -SAT. But this time we leave open the question of ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ completeness; the upper bound of ${{\scriptstyle \mathrm{EXPTIME}}}$ comes from the known complexity of the larger language ${{\cal L}_{5.5}}$ . We also leave open the problem of formulating proof systems for ${\cal L}_{4}$ and ${\cal L}_{4.5}$ in their original syntax. Instead, we present a completeness result for an extension ${\cal L}_{4.5}^+$ of ${\cal L}_{4.5}$ . The proof system and the completeness result are comparatively simple, though the proof system is decidedly nonsyllogistic: it includes a form of reductio ad absurdum (raa), as well as several rule schemas, and the syntax is not even finitary, since there is an infinite family of sentence formers. This proof system restricts, by dropping $({\scriptstyle \mathrm{RAA}})$ , to a sound and complete proof system for the corresponding extension ${\cal L}_{4}^+$ of ${\cal L}_{4}$ .

The largest languages in this paper are ${\cal L}_{5}$ and ${{\cal L}_{5.5}}$ . We have the least to say about them, mostly because ${{\cal L}_{5.5}}$ has been studied (in a different but equivalent formulation, called $\mathcal {R}^{*\dagger }$ ) in [Reference Pratt-Hartmann and Moss10], and the complexity result for it from [Reference Pratt-Hartmann and Moss10] also holds for ${\cal L}_{5}$ , as we shall see. The sound and complete proof system for $\mathcal {R}^{*\dagger }$ from [Reference Pratt-Hartmann and Moss10] (which is nonsyllogistic due to its use of individual variables) can be adapted to a similar proof system for ${{\cal L}_{5.5}}$ , but we do not make that explicit here. We leave open the problem of formulating a proof system for ${\cal L}_{5}$ .

One might guess that when we explore a partial order of logics, stronger logics are harder to work with and to prove completeness for. But this is not always the case. One reason: as the logics get stronger, they include more and more features of first-order logic and are thus easier to analyze, due to our experience with first-order logic. A second reason: sometimes adding to the syntax of a logic restores harmony in some way, thereby making it easier for us to work with. The examples of ${\cal L}_{2}^+$ , ${\cal L}_{4}^+$ , and ${\cal L}_{4.5}^+$ emphasize the fact that there is not a monotone relationship between the strength of logical systems and their elegance, or between their strength and the difficulty of proving completeness. It is an open problem to develop a general theory which could explain this phenomenon. For example, why it is that some logical systems have (boundedly complete) syllogistic proof systems ( ${\cal L}_{2}^+$ ), while others do not ( ${\cal L}_{2}$ )? What we have at present are some ad hoc results, and much remains to be done.

2 ${\cal L}_{2}$ : Adding the sentence former $({\mathsf{some} \ x \ y})$ to ${{\cal L}_{1}}$

The main results on ${\cal L}_{2}$ are (1) it has no finite sound and complete syllogistic proof system; (2) nevertheless there are several nonsyllogistic devices which allow us to obtain sound and complete proof systems; (3) alternatively, we can extend the syntax of ${\cal L}_{2}$ to a larger language ${\cal L}_{2}^+$ in such a way that ${\cal L}_{2}^+$ has a boundedly complete syllogistic proof system; and (4) as a result of this last point, the consequence relation $\Gamma \models {\varphi} $ for ${\cal L}_{2}^+$ (and hence ${\cal L}_{2}$ ) is in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

2.1 The base system ${\vdash }_0$

We begin with a proof system in Figure 2 that will be used in this section and beyond.

Fig. 2 The proof rules of ${\vdash }_0$ .

It is easy to check that the rules in Figure 2 are sound. Theorem 2.3 below shows that they are also complete for conclusions of the form $({\mathsf{all } \ x \ y})$ . Since the rules $({\scriptstyle \mathrm{AX}})$ , $({\scriptstyle \mathrm{BARBARA}})$ , and $ ({\scriptstyle \mathrm{ANTI}})$ are complete for ${{\cal L}_{1}}$ , this result can be interpreted as saying that ${\cal L}_{2}$ is a conservative extension of ${{\cal L}_{1}}$ . It is also possible to give a direct model-theoretic proof of this conservativity result.

In many places in this paper, we will work with a set ${\mathbb {T}}$ of terms. We will always assume that such a set ${\mathbb {T}}$ is closed under subterms. We could usually take ${\mathbb {T}}$ to be the set of all terms in the language under study. But when we use ${\mathbb {T}}$ to build a model (as we do just below), that model will be infinite when ${\mathbb {T}}$ is infinite. Working with a finite set ${\mathbb {T}}$ allows us to build finite models in many situations.

We write $\Gamma \ {\vdash }_0\ \varphi $ if there is a proof of the sentence $\varphi $ from the theory $\Gamma $ using the rules in Figure 2. We also write

$$ \begin{align*} x \leq y \end{align*} $$

to mean that $\Gamma \ {\vdash }_0\ {\mathsf{all } \ x \ y}$ . We use this notation because $\leq $ is a preorder, due to $({\scriptstyle \mathrm{AX}})$ and $({\scriptstyle \mathrm{BARBARA}})$ . Please note that $\Gamma $ is left off of this notation.

The first canonical model of a theory $\Gamma $

Let $\Gamma $ be a theory, let ${\mathbb {T}}$ be a set of terms (closed under subterms as usual), and let M be the set of unordered pairs $\{t,u\}$ of terms from ${\mathbb {T}}$ . This includes singletons $\{t\} = \{t,t\}$ . We define a model ${\cal M}(\Gamma ,{\mathbb {T}})$ with domain M by setting

(6)

$$ \begin{align} \begin{aligned} \{t,u\}\in [\![ p ]\!] &{\quad \text{iff}\quad} t \leq p \text{ or } u \leq p \\ \{t,u\}[\![ r ]\!]\{v,w \} & {\quad \text{iff}\quad} \text{for some } a\in \{t,u\} \text{ and } b\in\{v,w \},\ a\leq {r \ \mathsf{all} \ b}. \end{aligned} \end{align} $$

Lemma 2.1 (Truth Lemma). In ${\cal M}(\Gamma ,{\mathbb {T}})$ , for all terms $x\in {\mathbb {T}}$ ,

$$ \begin{align*} [\![ x ]\!] = \{ \{t,u\}\in M : t \leq x \text{ or } u \leq x \}. \end{align*} $$

Proof. By induction on x. For a noun $p\in {\mathbf {N}}$ , this is by definition of the model. For a term of the form $({r \ \mathsf{all} \ x})\in {\mathbb {T}}$ , note that $x\in {\mathbb {T}}$ , since ${\mathbb {T}}$ is closed under subterms.

Suppose that $\{t,u\}\in {[\![ {{r \ \mathsf{all} \ x}} ]\!]}$ . Since $x\leq x$ by $({\scriptstyle \mathrm{AX}})$ , the induction hypothesis implies $\{x\}\in [\![ x ]\!]$ . So $\{t,u\}[\![ r ]\!]\{x\}$ . By the definition of $[\![ r ]\!]$ , either $t\leq {r \ \mathsf{all} \ x}$ , or else $u\leq {r \ \mathsf{all} \ x}$ .

Conversely, fix $\{t,u\}\in M$ , and suppose that (without loss of generality) $t\leq {r \ \mathsf{all} \ x}$ . Let $\{v,w \}\in M$ be an element of $[\![ x ]\!]$ . By the induction hypothesis, we have (without loss of generality) $v\leq x$ . By $({\scriptstyle \mathrm{ANTI}})$ , ${r \ \mathsf{all} \ x} \leq {r \ \mathsf{all} \ v}$ . By $({\scriptstyle \mathrm{BARBARA}})$ , $t \leq {r \ \mathsf{all} \ v}$ . So $\{t,u\}[\![ r ]\!]\{v,w \}$ , and hence $\{t,u\}\in {[\![ {{r \ \mathsf{all} \ x}} ]\!]}$ . □

Lemma 2.2. If $({\mathsf{all } \ x \ y})\in \Gamma $ , then ${\cal M}(\Gamma ,{\mathbb {T}})\models {\mathsf{all } \ x \ y}$ . If x and y are any terms in ${\mathbb {T}}$ , we have ${\cal M}(\Gamma ,{\mathbb {T}})\models {\mathsf{some} \ x \ y}$ . As a consequence, if ${\mathbb {T}}$ contains all subterms of sentences in $\Gamma $ , then ${\cal M}(\Gamma ,{\mathbb {T}})\models \Gamma $ .

Proof. Suppose $({\mathsf{all } \ x \ y})\in \Gamma $ . If $\{t,u\}\in [\![ x ]\!]$ in ${\cal M}(\Gamma ,{\mathbb {T}})$ , then by Lemma 2.1 (without loss of generality), $t\leq x$ . But then $t\leq y$ by $({\scriptstyle \mathrm{BARBARA}})$ , so $\{t,u\}\in [\![ y ]\!]$ by Lemma 2.1, and ${\cal M}(\Gamma ,{\mathbb {T}})\models {\mathsf{all } \ x \ y}$ .

Now suppose $x,y\in {\mathbb {T}}$ . Since $x\leq x$ and $y\leq y$ , we have $\{x,y\}\in [\![ x ]\!]\cap [\![ y ]\!]$ by Lemma 2.1, so ${\cal M}(\Gamma ,{\mathbb {T}})\models {\mathsf{some} \ x \ y}$ .

If ${\mathbb {T}}$ contains all subterms of sentences in $\Gamma $ , then ${\cal M}(\Gamma ,{\mathbb {T}})\models {\mathsf{all } \ x \ y}$ whenever $({\mathsf{all } \ x \ y})\in \Gamma $ by the first assertion, and ${\cal M}(\Gamma ,{\mathbb {T}})\models {\mathsf{some} \ x \ y}$ whenever $({\mathsf{some} \ x \ y})\in \Gamma $ by the second assertion. □

Theorem 2.3. If $\Gamma \models {\mathsf{all } \ x \ y}$ , then $\Gamma \ {\vdash }_0\ {\mathsf{all } \ x \ y}$ . Moreover, the proof only uses the rules $({\scriptstyle \mathrm{AX}})$ , $({\scriptstyle \mathrm{BARBARA}})$ , and $({\scriptstyle \mathrm{ANTI}})$ .

Proof. Choose ${\mathbb {T}}$ so that it contains x, y, and all terms in $\Gamma $ . If $\Gamma \models {\mathsf{all } \ x \ y}$ , then ${\cal M}(\Gamma ,{\mathbb {T}}) \models {\mathsf{all } \ x \ y}$ by Lemma 2.2. Then $\{x\}\in [\![ x ]\!]\subseteq [\![ y ]\!]$ by $({\scriptstyle \mathrm{AX}})$ and Lemma 2.1. By Lemma 2.1 again, $\Gamma \ {\vdash }_0\ {\mathsf{all } \ x \ y}$ .

To see that the proof uses only the rules $({\scriptstyle \mathrm{AX}})$ , $({\scriptstyle \mathrm{BARBARA}})$ , and $({\scriptstyle \mathrm{ANTI}})$ , we just need to examine the rules in the proof system and note that no rule which produces a conclusion of the form $({\mathsf{all } \ x \ y})$ has a premise of the form $({\mathsf{some} \ a \ b})$ . □

The proof of Theorem 2.3 shows that if $\Gamma \not\vdash_0\ {\mathsf{all } \ x \ y}$ , then there is a countermodel of size $O(n^2)$ , where n is the complexity of $\Gamma \cup \{ \mathsf{all } \ x \ y \}$ . However, as noted in Lemma 2.2, ${\cal M}(\Gamma ,{\mathbb {T}})$ satisfies every sentence $({\mathsf{some} \ x \ y})$ with $x,y\in {\mathbb {T}}$ . To obtain a countermodel for sentences of this form, we will look at a submodel of ${\cal M}(\Gamma ,{\mathbb {T}})$ .

The second canonical model of a theory $\Gamma$

Let $M'$ be the set of unordered pairs $\{t,u\}$ of terms in $\mathbb {T}$ such that $\Gamma \ {\vdash }_0\ {\mathsf{some} \ t \ u}$ . Note that we allow $t = u$ . Define a model ${\cal M}'(\Gamma ,{\mathbb {T}})$ with domain $M'$ just as in (6).

The proof system ${\vdash }_0$ is not complete for sentences of the form $({\mathsf{some} \ x \ y})$ , but we can prove a partial completeness result under the following additional hypothesis on $\Gamma $ , a form of which was first introduced by McAllester and Givan [Reference McAllester and Givan2].

Definition 2.4. We say that $\Gamma $ determines existentials for ${\mathbb {T}}$ if, for all verbs $r\in {\textbf {V}}$ and all terms $x,y\in {\mathbb {T}}$ , either $\Gamma \ {\vdash }_0\ {\mathsf{some} \ x \ x}$ or $\Gamma \ {\vdash }_0\ {\mathsf{all} \ y \ {({r \ \mathsf{all} \ x})}}$ .

The intuition behind this definition is that in any model ${\cal M}$ , for any term x, either ${\cal M}\models {\mathsf{some} \ x \ x}$ , or $[\![ x ]\!] = \emptyset $ , in which case ${[\![ {{r \ \mathsf{all} \ x}} ]\!]} = M$ for any verb r, and ${\cal M} \models {\mathsf{all} \ y \ (r \ \mathsf{all} \ x)}$ for any term y.

Lemma 2.5. Suppose $\Gamma $ determines existentials for ${\mathbb {T}}$ . Then:

(1) In ${\cal M}'(\Gamma ,{\mathbb {T}})$ , for all terms $x\in {\mathbb {T}}$ , $[\![ x ]\!] = \{\{t,u\}\in M' : t\leq x \text { or } u\leq x\}$ .
(2) If $({\mathsf{all } \ x \ y})\in \Gamma $ , then ${\cal M}'(\Gamma ,{\mathbb {T}})\models {\mathsf{all } \ x \ y}$ .
(3) If $x,y\in {\mathbb {T}}$ and $({\mathsf{some} \ x \ y})\in \Gamma $ , then ${\cal M}'(\Gamma ,{\mathbb {T}})\models {\mathsf{some} \ x \ y}$ .

As a consequence, if $\Gamma $ is any theory, ${\mathbb {T}}$ contains all subterms of sentences in $\Gamma $ , and $\Gamma ^*\supseteq \Gamma $ is a theory which determines existentials for ${\mathbb {T}}$ , then ${\cal M}'(\Gamma ^*,{\mathbb {T}})\models \Gamma $ .

Proof. The proof of (1) is exactly like the proof of Lemma 2.1, with the following adjustment: If $\{t,u\}\in {[\![ {{r \ \mathsf{all} \ x}} ]\!]}$ and $\{x\}\in M'$ , the proof in Lemma 2.1 goes through as written. But if $\{x\}\notin M'$ , then $\Gamma \not\vdash_0\ {\mathsf{some} \ x \ x}$ , so $t\leq {r \ \mathsf{all} \ x}$ (and also $u\leq {r \ \mathsf{all} \ x}$ ), since $\Gamma $ determines existentials for ${\mathbb {T}}$ .

The proof of (2) is exactly as in the proof of Lemma 2.2.

The proof of (3) is also exactly as in the proof of Lemma 2.2, with the following adjustment: We need to use the fact that $({\mathsf{some} \ x \ y})\in \Gamma $ to see that $\{x,y\}\in M'$ .

Putting this together, suppose $\Gamma $ is any theory, ${\mathbb {T}}$ contains all subterms of sentences in $\Gamma $ , and $\Gamma ^*\supseteq \Gamma $ determines existentials for ${\mathbb {T}}$ . If $({\mathsf{all } \ x \ y})\in \Gamma \subseteq \Gamma ^*$ , then ${\cal M}'(\Gamma ^*,{\mathbb {T}})\models {\mathsf{all } \ x \ y}$ . And if $({\mathsf{some} \ x \ y})\in \Gamma \subseteq \Gamma ^*$ , then since $x,y\in {\mathbb {T}}$ , ${\cal M}'(\Gamma ^*,{\mathbb {T}})\models {\mathsf{some} \ x \ y}$ . □

Theorem 2.6. Suppose that ${\mathbb {T}}$ contains x, y, and all subterms of sentences in $\Gamma $ , and $\Gamma ^*\supseteq \Gamma $ is a theory which determines existentials for ${\mathbb {T}}$ . If $\Gamma \models {\mathsf{some} \ x \ y}$ , then $\Gamma^*\ {\vdash }_0\ {\mathsf{some} \ x \ y}$ .

Proof. Since $\Gamma \subseteq \Gamma ^*$ , ${\cal M}'(\Gamma ^*,{\mathbb {T}})\models \Gamma $ by Lemma 2.5, so ${\cal M}'(\Gamma ^*,{\mathbb {T}})\models {\mathsf{some} \ x \ y}$ . Suppose $\{t,u\}\in [\![ x ]\!]\cap [\![ y ]\!]$ . Since $\{t,u\}\in M'$ , $\Gamma ^*\ {\vdash }_0\ {\mathsf{some} \ t \ u}$ . And by Lemma 2.5, $\Gamma ^*\ {\vdash }_0\ {\mathsf{all} \ v\ x}$ and $\Gamma ^*\ {\vdash }_0\ {\mathsf{all} \ w \ y}$ for some $v,w\in \{t,u\}$ . By analyzing the four cases and using $({\scriptstyle \mathrm{SOME}}_{1})$ , $({\scriptstyle \mathrm{SOME}}_{2})$ , and $({\scriptstyle \mathrm{DARII}})$ , we see that $\Gamma ^*\ {\vdash }_0\ {\mathsf{some} \ x \ y}$ . □

Corollary 2.7. If ${\mathbb {T}}$ contains all subterms of sentences in $\Gamma \cup \{ {\varphi} \}$ and $\Gamma $ determines existentials for ${\mathbb {T}}$ , then $\Gamma \models \varphi $ if and only if $\Gamma \ {\vdash }_0\ \varphi $ .

Proof. The implication $\Gamma \ {\vdash }_0\ \varphi $ implies $\Gamma \models \varphi $ is just soundness of the rules in ${\vdash }_0$ . The converse follows immediately from Theorems 2.3 and 2.6, taking $\Gamma ^* = \Gamma $ in Theorem 2.6. □

We conclude this section with two proof-theoretic observations about the system ${\vdash }_0$ . If ${\vec {r}} = r_1,\dots ,r_n$ is a sequence of verbs and x is a term, we use the notation $(\vec {r}\ \mathsf{all}\ x)$ for the term $({r_1}\ \mathsf{all}\ ({r_2}\ \mathsf{all}\ (\dots (r_n\ \mathsf{all}\ x ))))$ . When ${\vec {r}}$ is the empty sequence, $(\vec {r}\ \mathsf{all}\ x) = x$ .

If $\psi = {\mathsf{all} \ u \ v}$ , we define

$$ \begin{align*} {\text{Anti}}({\vec{r}},\psi) = \begin{cases} \mathsf{all}\ (\vec{r} \ \mathsf{all} \ u)\ (\vec{r} \ \mathsf{all} \ v) & \text{if the length of } {\vec{r}} \text{ is even}\\ \mathsf{all}\ (\vec{r} \ \mathsf{all} \ v)\ (\vec{r} \ \mathsf{all} \ u) & \text{if the length of } {\vec{r}} \text{ is odd}. \end{cases} \end{align*} $$

Note that $\{ \psi \}\ {\vdash }_0\ {\text {Anti}}({\vec {r}},\psi )$ by repeated applications of $({\scriptstyle \mathrm{ANTI}})$ .

Definition 2.8. Let $\Gamma $ be a theory. A $\Gamma $ -sequence is a finite sequence of terms $t_1,\dots ,t_n$ , such that for all $1\leq i \lt n$ there is a sentence $\psi = ({\mathsf{all} \ a \ b})\in \Gamma $ and a sequence of verbs ${\vec {r}}$ such that $(\mathsf{all} \ {t_i}\ {t_{i+1}}) = {\text {Anti}}({\vec {r}},\psi _i)$ .

Lemma 2.9. $\Gamma \ {\vdash }_0\ {\mathsf{all } \ x \ y}$ if and only if there is a $\Gamma $ -sequence of terms $t_1,\dots ,t_n $ such that $x = t_1$ and $y = t_n$ .

Proof. Suppose $x = t_1,\dots ,t_n = y$ is a $\Gamma $ -sequence. If $n = 1$ , then $x = y$ , and $\Gamma \ {\vdash }_0\ {\mathsf{all } \ x \ y}$ by $({\scriptstyle \mathrm{AX}})$ . If $n>1$ , then for all $1\leq i < n$ , $\Gamma\ {\vdash }_0\ {\mathsf{all} \ {t_i} \ {t_{i+1}}}$ by repeated applications of $({\scriptstyle \mathrm{ANTI}})$ . And by repeated applications of $({\scriptstyle \mathrm{BARBARA}})$ , $\Gamma \ {\vdash }_0\ {\mathsf{all } \ x \ y}$ .

We prove the converse by induction on the height of the proof tree. In the base case, $({\mathsf{all } \ x \ y})\in \Gamma $ , and $x, y$ is a $\Gamma $ -sequence as desired.

Case 1: If the root of the proof tree is

then x is a $\Gamma $ -sequence from x to x.

Case 2: If the root of the proof tree is

then by induction we have $\Gamma $ -sequences $t_1,\dots , t_n$ and $t_1^{\prime },\dots ,t_m^{\prime }$ with $x = t_1$ , $y = t_n = t_1^{\prime }$ , and $z = t_m^{\prime }$ . Then $t_1,\dots ,t_{n-1}, t_1^{\prime },\dots ,t_m^{\prime }$ is a $\Gamma $ -sequence from x to z.

Case 3: If the root of the proof tree is

then by induction we have a $\Gamma $ -sequence $t_1,\dots , t_n$ with $x = t_1$ and $y = t_n$ . Then $(r \ \mathsf{all} \ t_n),\dots ,(r \ \mathsf{all} \ {t_1})$ is a $\Gamma $ -sequence from $({r \ \mathsf{all} \ y})$ to $({r \ \mathsf{all} \ x})$ .□

Lemma 2.10. $\Gamma \ {\vdash }_0\ {\mathsf{some} \ x \ y}$ if and only if there is a sentence $({\mathsf{some} \ {t_1} \ {t_2}})\in \Gamma $ such that $\Gamma\ {\vdash }_0\ {\mathsf{all} \ {t_i} \ x}$ and $\Gamma\ {\vdash }_0\ {\mathsf{all}\ {t_j} \ y}$ , for some $i,j\in \{1,2\}$ .

Proof. Suppose there is a sentence $({\mathsf{some} \ {t_1} \ {t_2}})\in \Gamma $ such that $\Gamma\ {\vdash }_0\ {\mathsf{all} \ t_i \ x}$ and $\Gamma\ {\vdash }_0\ {\mathsf{all} \ {t_j} \ y}$ , for some $i,j\in \{1,2\}$ . Then applying $({\scriptstyle \mathrm{SOME}}_1)$ or $({{\scriptstyle \mathrm{SOME}}_2})$ if necessary, $\Gamma \ {\vdash }_0\ {\mathsf{some} \ {t_i} \ {t_j}}\ \mathsf{some}\ t_i\ t_j.$ By two applications of $({\scriptstyle \mathrm{DARII}})$ and $({\scriptstyle \mathrm{SOME}}_2)$ , $\Gamma \ {\vdash }_0\ {\mathsf{some} \ x \ y}$ .

We prove the converse by induction on the height of the proof tree. In the base case, when $({\mathsf{some} \ x \ y})\in \Gamma $ , we have $\Gamma\ {\vdash }_0\ {\mathsf{all} \ x \ x}$ and $\Gamma\ {\vdash }_0\ {\mathsf{all} \ y \ y}$ by $({\scriptstyle \mathrm{AX}})$ .

Case 1: If the root of the proof tree is

then by induction there is a sentence $({\mathsf{some} \ {t_1} \, {t_2}})\in \Gamma $ such that $\Gamma\ {\vdash }_0\ {\mathsf{all}\ t_i \ x}$ for some $i\in \{1,2\}$ .

Case 2: If the root of the proof tree is

then by induction there is a sentence $({\mathsf{some} \ {t_1}\ {t_2}})\in \Gamma $ such that $\Gamma\ {\vdash }_{0}\ {\mathsf{all}\ {t_i}\;y}$ and $\Gamma\ {\vdash }_0\ {\mathsf{all}\ {t_j}\ x}$ , for some $i,j\in \{1,2\}$ .

Case 3: If the root of the proof tree is

then by induction there is a sentence $(\mathsf{some} \ {t_1} \ {t_2})\in \Gamma $ such that $\Gamma\ {\vdash }_0 \ {\mathsf{all} \ t_i \ x}$ and $\Gamma\ {\vdash }_0\ {\mathsf{all} \ {t_j} \ y}$ , for some $i,j\in \{1,2\}$ . But also $\Gamma \ {\vdash }_0\ {\mathsf{all } \ y \ z}$ , so $\Gamma \ {\vdash }_0\ {\mathsf{all} \ {t_j} \ z}$ by $({\scriptstyle \mathrm{BARBARA}})$ .□

2.2 No sound and complete syllogistic proof system for ${\cal L}_{2}$

In this section, we prove that there is no sound and complete syllogistic proof system for ${\cal L}_{2}$ . This suggests that we need nonsyllogistic devices like those which we shall see in coming sections of this paper. But this talk of rules being “needed” is not precise, and at the end of the day, it is not quite what we shall prove. At the same time, what we do prove is in a real way stronger than the statement above. So we need to make all of this precise.

We return to our discussion of syllogistic rules in §1.3. Every syllogistic proof system defines a provability relation between theories and sentences. In this section, we write this relation as $\Gamma\ {\vdash}^* {\varphi} $ . To be sound, we require that if $\Gamma\ {\vdash}^* {\varphi} $ , then $\Gamma \models {\varphi} $ . To be complete, we require that if $\Gamma \models {\varphi} $ , then also $\Gamma\ {\vdash}^* {\varphi} $ .

The degree k consequence relation $\models _k$ is the relation between sets $\Gamma $ and sentences $ {\varphi} $ defined as follows: $\Gamma \models _k {\varphi} $ if there is a finite tree with nodes labeled by sentences, such that each node is either a leaf and in $\Gamma $ , or else is a sentence $ {\varphi} $ with children $\psi _1$ , $\ldots $ , $\psi _j$ for some $j\leq k$ , and such that $\{\psi _1, \ldots , \psi _j \}\models {\varphi} $ . If we have a sound syllogistic proof system ${\vdash }^*$ , then since it has only finitely many rules, each with finitely many premises, there is a number k (the maximum number of premises in any rule in ${\vdash }^*$ ) such that whenever $\Gamma\ {\vdash}^* {\varphi} $ , we also have $\Gamma \models _k {\varphi} $ .

Theorem 2.11. For all n, there is a theory $\Gamma _{n+1}$ and a sentence $ {\varphi} $ such that $\Gamma _{n+1} \models {\varphi} $ , and $\Gamma _{n+1}\not \models _n {\varphi} $ . As a consequence, there is no sound and complete syllogistic proof system for ${\cal L}_{2}$ .

Note that the first statement does not refer to proof systems in any way. It is completely semantic. But it immediately implies the negative result about proof systems.

The sets $\Gamma _n$

For all n, let ${\mathbf {N}} = \{a,b\}$ , let ${\textbf {V}} = \{r_1,\dots ,r_n\}$ , and let $\Gamma _n$ be the following theory:

$$ \begin{align*} \alpha &= \mathsf{some} \ ({r_1} \ \mathsf{all} \ ({r_1}\ \mathsf{all} \ a)) \ ({r_1} \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a))\\ {\varphi}_1 &= \mathsf{all} \ (r_1 \ \mathsf{all} \ b) \ (r_2 \ \mathsf{all} \ ({r_2} \ \mathsf{all} \ a)) \\ {\varphi}_2 &= \mathsf{all} \ (r_2 \ \mathsf{all} \ b) \ (r_3 \ \mathsf{all} \ ({r_3}\ \mathsf{all} \ a))\\ &\quad \vdots \\ {\varphi}_i &= \mathsf{all} \ (r_i \ \mathsf{all} \ b) \ (r_{i+1} \ \mathsf{all } \ (r_{i+1} \ \mathsf{all} \ a) )\\ &\quad \vdots \\ {\varphi}_{n-1} &= \mathsf{all} \ (r_{n-1} \ \mathsf{all} \ b) \ ({r_n} \ \mathsf{all} \ (r_n \ \mathsf{all} \ a) )\\ \omega &= \mathsf{all} \ (r_n \ \mathsf{all} \ b) \ a. \end{align*} $$

We will use $\Gamma _n$ as a recurring example in the forthcoming sections, to demonstrate proof systems.

If r is a verb, then an r-king in a model ${\cal M}$ is an element $x\in M$ such that for all $y\in M$ , $x[\![ r ]\!] y$ .

Lemma 2.12. $\Gamma _n\models {\mathsf{some} \ a \ a}$ .

Proof. Let ${\cal M}\models \Gamma _n$ . If $[\![ a ]\!] \neq \emptyset $ , we are done. So we shall assume that $[\![ a ]\!] = \emptyset $ . Then for any verb r, $[\![ {{r \ \mathsf{all} \ a}} ]\!] = M$ . By $\alpha $ , ${\cal M}$ contains an $r_1$ -king x. Then $ {\varphi} _1$ implies that x is also an $r_2$ -king. Continuing by induction, $ {\varphi} _i$ implies that x is an $r_{i+1}$ -king. Finally, x is an $r_n$ -king, and $\omega $ implies that $x\in [\![ a ]\!]$ , which is a contradiction. □

Lemma 2.13. If $\Gamma _n\models {\mathsf{all} \ u \ v}$ , then either $u = v$ , or $({\mathsf{all} \ u \ v}) = {\text {Anti}}({\vec {r}},\psi )$ , for some sequence ${\vec {r}}$ and some sentence $\psi \in \Gamma _n$ .

Proof. By Theorem 2.3, $\Gamma _n\ {\vdash }_0\ {\mathsf{all} \ u \ v}$ , and by Lemma 2.9, there is a $\Gamma _n$ -sequence $u = t_1,\dots ,t_m = v$ of length m. If $m =1$ , then $u = v$ and we are done. If $m = 2$ , then $({\mathsf{all} \ u \ v}) = {\text {Anti}}({\vec {r}},\psi )$ for some sequence ${\vec {r}}$ and some $\psi \in \Gamma _n$ ; we are again done.

It remains to prove a contradiction from $m \geq 3$ . Suppose that we have $\psi , \theta \in \Gamma _n$ and sequences of verbs ${\vec {r}}$ and ${\vec {s}}$ such that $(\mathsf{all} \ {t_1} \ {t_2}) = {\text {Anti}}({\vec {r}},\psi )$ and $(\mathsf{all} \ {t_2} \ t_3) = {\text {Anti}}({\vec {s}},\theta )$ . Notice that $t_2$ must contain either a or b. Assume that $t_2$ contains a. The argument when $t_2$ contains b is similar. Let m be the number of verbs in $t_2$ , the second term of ${\text {Anti}}({\vec {r}},\psi )$ . Since it is a rather than b which occurs in the second term of ${\text {Anti}}({\vec {r}},\psi )$ , m is even. We see this by examining the sentences in $\Gamma _n$ . And let n be the number of verbs in $t_2$ , the first term of ${\text {Anti}}({\vec {s}},\theta )$ . This time, we see that n is odd. But $m = n$ , since the second term of ${\text {Anti}}({\vec {r}},\psi )$ is the first term of ${\text {Anti}}({\vec {s}},\theta )$ . This is a contradiction. □

The sets $\Delta _{n,i}$

For all n and all $1 \leq i \leq n$ , let the theory $\Delta _{n,i}$ be given by

$$ \begin{align*} \Delta_{n,i} = \left\{\!\! \begin{array}{ll} (\Gamma_n\setminus \{ {\varphi}_{i}\}) & \text{if } 1 \leq i < n \\ (\Gamma_n\setminus \omega) & \text{if } i = n. \end{array}\right. \end{align*} $$

Lemma 2.14. Suppose that for some $1\leq i\leq n$ and some terms u and v,

$$ \begin{align*} \Delta_{n,i}\models \mathsf{some} \ u \ v. \end{align*} $$

Then $u = v = (r_1 \ \mathsf{ all } \ (r_1 \ \mathsf{ all } \ a ))$ , so that $({\mathsf{some} \ u \ v}) = \alpha $ .

Proof. It suffices to show that for every term $t\neq (r_1 \ \mathsf{all} \ (r_1\ \mathsf{all} \ a))$ , there is a model of $\Delta _{n,i}$ in which $[\![ t ]\!] = \emptyset $ . Indeed, then $\Delta _{n,i}\not \models {\mathsf{some} \ u \ v}$ when $u = t$ or $v = t$ .

We proceed by cases. In every case except $t = a$ , we actually obtain a model of $\Gamma _n$ in which $[\![ t ]\!] = \emptyset $ . Since the models ${\cal M}_4$ of $\Gamma _n$ constructed in Case 4 have ${\cal M}_4\not \models \mathsf{some} \ a \ (r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a ))$ , this implies the additional result that if $\Gamma _n\models {\mathsf{some} \ u \ v}$ , then $u = v = (r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a))$ or $u = v = a$ .

Case 1: $t = (\vec {s} \ \mathsf{all} \ b)$ or $t = (\vec {s} \ \mathsf{all} \ a)$ , where the length of ${\vec {s}}$ is odd. Let $M_1 = \{*\}$ , and define the model ${\cal M}_1$ with domain $M_1$ by $[\![ a ]\!] = [\![ b ]\!] = \{*\}$ , and ${[\![ r_i ]\!]} = \emptyset $ for all i. In ${\cal M}_1$ , we have

$$ \begin{align*} [\![ \vec{s} \ \mathsf{all} \ a ]\!] = [\![\vec{s} \ \mathsf{all} \ b ]\!] = \begin{cases} \{*\} & \text{if the length of }\ {\vec{s}} \text{ is even}\\ \emptyset & \text{if the length of }\ {\vec{s}} \ \text{ is odd}, \end{cases} \end{align*} $$

so ${\cal M}_1\models \Gamma _n$ .

Case 2: $t = (\vec {s} \ \mathsf{all} \ b)$ , where the length of ${\vec {s}}$ is even. Define ${\cal M}_2$ in the same way as ${\cal M}_1$ , but with $[\![ b ]\!] = \emptyset $ . This time, we have

$$ \begin{align*} [\![\vec{s} \ \mathsf{all} \ a ]\!] &= \begin{cases} \{ * \} & \text{if the length of } \vec{s} \text{ is even}\\ \emptyset & \text{if the length of } \vec{s} \text{ is odd}\end{cases}\\ [\![\vec{s} \ \mathsf{all} \ b]\!] &= \begin{cases} \emptyset & \text{if the length of } \vec{s} \text{ is even}\\ \{* \} & \text{if the length of } {\vec{s}} \text{ is odd},\end{cases} \end{align*} $$

so again ${\cal M}_2\models \Gamma _n$ .

Case 3: $t = a$ . Fix $1\leq i\leq n$ , let $M_3 = \{*\}$ , and define the model ${\cal M}_3(i)$ with domain $M_3$ by $[\![ a ]\!] = \emptyset $ , $[\![ b ]\!] = \{*\}$ , and

$$ \begin{align*} [\![ r_j ]\!] =\begin{cases}\{(*,*)\} & \text{for }j\leq i\\ \emptyset & \text{for }j>i \end{cases} \end{align*} $$

In ${\cal M}_3(i)$ , we have

$$ \begin{align*} [\![r_j \ \mathsf{all} \ a]\!] = [\![ b ]\!] &= \{* \}\\ {[\![r_j \ \mathsf{all} \ (r_j \ \mathsf{all} \ a) ]\!]} = [\![r_j \ \mathsf{all} \ b ]\!] & = \begin{cases} \{* \} & \text{if } j\leq i\\ \emptyset & \text{if } j> i, \end{cases} \end{align*} $$

so ${\cal M}_3(i)\models \Delta _{n,i}$ .

Case 4: $t = (\vec {s} \ \mathsf{all} \ a)$ , where ${\vec {s}} = (s_1,\dots ,s_k)$ , k is even and nonzero, and ${\vec {s}}\neq (r_1,r_1)$ . Let $M_4 = \{w,x,y,z\}$ , and define a model ${\cal M}_4$ with domain $M_4$ by $[\![ a ]\!] = \{w\}$ and $[\![ b ]\!] = \{w,x,y,z\}$ . To define the verb interpretations $[\![ r_i ]\!]$ , we break into subcases.

Subcase 4a: $s_k\neq r_1$ . Set $x[\![ r_1 ]\!]w$ , $y[\![ r_1 ]\!] x$ , $z{[\![ {s_k} ]\!]}w$ , and no other instances of verbs:

Subcase 4b: $s_k = r_1$ and $s_{k-1}\neq r_1$ . Set $x[\![ r_1 ]\!]w$ , $y[\![ r_1 ]\!] x$ , and no other instances of verbs.

Subcase 4c: $s_k = r_1$ , $s_{k-1} = r_1$ , and $k>2$ . Set $x[\![ r_1 ]\!]w$ , $y[\![ r_1 ]\!] x$ , $z[\![ s_{k-2} ]\!] y$ , and no other instances of verbs.

In all three subcases, ${[\![ r_1 \ \mathsf{all} \ a]\!]} = \{x\}$ and ${[\![ r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a) ]\!]} = \{y\}$ , so ${\cal M}_4\models \alpha $ . And for all $1\leq i\leq n$ , ${[\![ r_i \ \mathsf{all} \ b]\!]} = \emptyset $ , so ${\cal M}_4 \models \Gamma _n$ . It remains to show that ${[\![ \vec {s} \ \mathsf{all} \ a ]\!]} = \emptyset $ . We introduce some notation: write ${\vec {s}}_{\geq j}$ for the sequence $(s_j,\dots ,s_k)$ .

In subcase 4a, ${[\![s_k \ \mathsf{all } \ a]\!]} = \{z\}$ , and ${[\![s_{\geq k-1} \ \mathsf{all} \ a]\!]} = \emptyset $ . Since k is even we have

$$ \begin{align*} {[\![ \vec{s}_{\geq j} \ \mathsf{all} \ a]\!]} = \begin{cases} \emptyset & \text{if } j \ \text{ is odd} \\ M_4 & \text{if } j \text{ is even}, \end{cases} \end{align*} $$

and ${[\![ \vec {s} \ \mathsf{all} \ a]\!]} = {[\![\vec {s}_{\geq 1}\ \mathsf{all } \ a]\!]} = \emptyset $ .

In subcase 4b, $[\![ s_k \ \mathsf{all} \ a]\!] = \{x\}$ , and ${[\![ s_{\geq k-1} \ \mathsf{all} \ a]\!]} = \emptyset $ . As in subcase 4a, ${[\![ \vec {s} \ \mathsf{all} \ a ]\!]} = \emptyset $ .

In subcase 4c, ${[\![ \vec {s}_{\geq k-2} \ \mathsf{all} \ a]\!]} = \{z\}$ . So ${[\![ \vec {s}_{\geq k-3} \ \mathsf{all} \ a]\!]} = \emptyset $ . And then since k is even and $k\geq 4$ , ${[\![\vec {s}_{\geq 1} \ \mathsf{all } \ a]\!]} = \emptyset $ by the same argument which we have seen above.

This completes the proof. □

Lemma 2.15. For any natural number k, and any $n\geq k+1$ , if $\Gamma _n\models _k {\mathsf{some} \ u \ v}$ , then $u = v = (r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a))$ , so that $({\mathsf{some} \ u \ v}) = \alpha $ .

Proof. By induction on the depth of the tree witnessing $\Gamma _n\models _k {\mathsf{some} \ u \ v}$ . In the base case, $({\mathsf{some} \ u \ v})$ is a leaf in $\Gamma _n$ . Since $\alpha $ is the only sentence of the form $({\mathsf{some} \ u \ v})$ in $\Gamma _n$ , we are done.

Now suppose the root of the tree is $({\mathsf{some} \ u \ v})$ with children $\{\psi _1, \dots , \psi _j\}$ , where $j\leq k$ , $\Gamma _n\models _k \psi _i$ for all $1\leq i\leq j$ , and $\{\psi _1, \dots , \psi _j\}\models {\mathsf{some} \ u \ v}$ .

We claim that for all $\psi _i$ , there is a single sentence $\chi _i\in \Gamma _n$ such that $\{\chi _{i}\}\models \psi _i$ . By induction, if $\psi _i$ has the form $({\mathsf{some} \ x \ y})$ , then $\psi _i = \alpha $ , and we can take $\chi _i = \alpha $ . On the other hand, if $\psi _i$ has the form $({\mathsf{all } \ x \ y})$ , then since $\Gamma _n\models \psi _i$ , by Lemma 2.13 either $\psi _i = ({\mathsf{all} \ x \ x})$ and we can take $\chi _i$ to be any sentence in $\Gamma _n$ , or $\psi _i = {\text {Anti}}({\vec {r}}_i,\chi _i)$ for some $\chi _i\in \Gamma _n$ , and $\{\chi _{i}\}\models \psi _i$ .

By the claim, $\{\chi _k :k \leq j\} \models {\mathsf{some} \ u \ v}$ . And since $j\leq k \leq n-1$ , there is some $1\leq i^*\leq n$ such that $\{\chi _k :k \leq j\} \subseteq \Delta _{n,i^*}$ . For this $i^*$ , we see that $\Delta _{n,i^*}\models \psi _k$ for all $k\leq j$ , so $\Delta _{n,i^*}\models {\mathsf{some} \ u \ v}$ . Our result follows from Lemma 2.14. □

Proof of Theorem 2.11. By Lemma 2.12, $\Gamma _{n+1}\models {\mathsf{some} \ a \ a }$ . And by Lemma 2.15, $\Gamma _{n+1}\not \models _n {\mathsf{some} \ a \ a }$ .

Suppose that ${\vdash }^*$ is a sound and complete syllogistic proof system for ${\cal L}_{2}$ . Let n be the maximum number of premises in any rule in ${\vdash }^*$ . Then since $\Gamma _{n+1}\models {\mathsf{some} \ a \ a }$ , we have $\Gamma _{n+1}\ {\vdash }^*\ {\mathsf{some} \ a \ a }$ by completeness, and $\Gamma _{n+1}\models _n {\mathsf{some} \ a \ a }$ by soundness. This is a contradiction. □

We have shown that the full semantic entailment relation $\models $ for ${\cal L}_{2}$ is not $\models _n$ for any n. This contrasts with logics like the one for ${{\cal L}_{1}}$ ; for that, $\models $ coincides with $\models _2$ , since there is a sound and complete syllogistic proof system in which every rule has at most two premises.

Remark. Our work in this section used the fact that our set ${\textbf {V}}$ of verbs can be arbitrarily large. It is open whether the negative result holds when ${\textbf {V}}$ is a fixed finite set.

2.3 Completeness using the ( cases ) rule

We have just seen that ${\cal L}_{2}$ has no logical system which is syllogistic, sound, and complete. The rest of this section rectifies this, in three different ways.

In this section, we add a single rule. It is called (cases), and while it is not syllogistic as we defined the term in §1.3, it is simple and natural. Here is a statement of it.

(7)

The nonsyllogistic feature is that premises are withdrawn in derivations.Footnote ¹ Let us explain how the (cases) rule is used. To prove $ {\varphi} $ from a set $\Gamma $ , it is sufficient to take a term x, prove $ {\varphi} $ from $\Gamma \cup \{ \mathsf{some} \ x \ x\}$ , and also prove $ {\varphi} $ from $\Gamma \cup \{\mathsf{all} \ y \ (r \ \mathsf{all} \ x)\}$ . Here y can be any term and r can be any verb.

In the logic itself, we take two derivations of $ {\varphi} $ , and then in one we withdraw a sentence $({\mathsf{some} \ x \ x})$ , while in the other we withdraw a sentence of the form $(\mathsf{all} \ y \ (r \ \mathsf{all} \ x))$ (for this same term x). We may withdraw zero occurrences or more than one. The overall conclusion is $ {\varphi} $ .

In this subsection, we write ${\vdash }$ for provability in the ${\vdash }_0$ system from Figure 2.1, together with (cases).

Lemma 2.16 (Soundness). If $\Gamma\ {\vdash}\ {\varphi} $ , then $\Gamma \models {\varphi} $ .

Proof. By induction on the number n of uses of (cases) in derivations. For $n = 0$ , this is just soundness of ${\vdash }_0$ . Assume our result for n, and let $\Gamma\ {\vdash}\ {\varphi} $ via a derivation with $n+1$ uses of (cases). We may assume that the last use of (cases) is at the root of the proof tree. So we have

(1) $\Gamma \cup \{ \mathsf{some} \ x \ x \}\ {\vdash }\ {\varphi} $
(2) $\Gamma \cup \{\mathsf{all} \ y \ (r \ \mathsf{all} \ x)\}\ {\vdash }\ {\varphi} ,$

where both derivations have at most n uses of $({{\scriptstyle \mathrm{CASES}}})$ . By our induction hypotheses, (1) and (2) hold when ${\vdash }$ is replaced by $\models $ . Let ${\cal M}\models \Gamma $ . Then we have two cases. If $[\![ x ]\!] \neq \emptyset $ , then ${\cal M}\models {\mathsf{some} \ x \ x}$ , so ${\cal M}\models {\varphi} $ . And when $[\![ x ]\!] = \emptyset $ , we have ${[\![ {{r \ \mathsf{all} \ x}} ]\!]} = M$ . So ${\cal M}\models \mathsf{all} \ y \ ({r \ \mathsf{all} \ x})$ , and thus ${\cal M}\models {\varphi} $ .□

Example 2.17. Here is a sample derivation.

$$ \begin{align*} \{\mathsf{some} \ c \ d, \mathsf{all} \ a \ x, \mathsf{all} \ a \ y, \mathsf{all} \ (r \ \mathsf{all} \ a) \ x, \mathsf{all} \ (r \ \mathsf{all} \ a) \ y\}\ {\vdash}\ \mathsf{some} \ x \ y. \end{align*} $$

Let $\Gamma $ be the theory on the left. We show that

1. $\Gamma \cup \{\mathsf{some} \ a \ a \} \ {\vdash }\ {\mathsf{some} \ x \ y}.$
2. $\Gamma \cup \{\mathsf{all} \ {c} \ ({r \ \mathsf{all} \ a})\}\ {\vdash }\ {\mathsf{some} \ x \ y}.$

The first is easy from $(\mathsf{all} \ a \ x)$ and $(\mathsf{all} \ a \ y)$ . The second comes from

(Here ( barb ) abbreviates ( ${\scriptstyle \mathrm{BARBARA}}$ ).) Note that the premise $(\mathsf{all} \ {c} \ (r \ \mathsf{all} \ a))$ was used twice.

Example 2.18. We show that $\Gamma _n\ {\vdash }\ {\mathsf{some} \ a \ a }$ , where $\Gamma _n$ is the theory in §2.2.

First, note that $\Delta \cup \{{\mathsf{some} \ a \ a }\}\ {\vdash}\ {\mathsf{some} \ a \ a }$ for any theory $\Delta $ . So by n applications of $({{\scriptstyle \mathrm{CASES}}})$ , to show that $\Gamma _n\ {\vdash}\ {\mathsf{some} \ a \ a }$ , it suffices to show that $\Gamma _n\cup \{\mathsf{all} \ b \ (r_i \ \mathsf{all}\ a): 1\leq i\leq n\}\ {\vdash}\ {\mathsf{some} \ a \ a }$ . Let $\Gamma _n^*$ be the theory on the left.

Let $\psi _{i} = \mathsf{all} \ (r_i \ \mathsf{all} \ (r_i \ \mathsf{all} \ a)) \ (r_i \ \mathsf{all} \ b)$ . By $({\scriptstyle \mathrm{ANTI}})$ , $\Gamma _{n}^{*}\ {\vdash}\ \psi _i$ for all $1\leq i\leq n$ . Repeatedly applying $({\scriptstyle \mathrm{BARBARA}})$ to the sequence $\psi _1, {\varphi} _1,\psi _2, {\varphi} _2,\dots , \psi _n,\omega $ , we find that $\Gamma _{n}^{*}\ {\vdash}\ \mathsf{all} \ (r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a)) \ a$ . So by $\alpha $ , $({\scriptstyle \mathrm{DARII}})$ , and $({{\scriptstyle \mathrm{SOME}}_{2}})$ , $\Gamma _{n}^{*} \ {\vdash}\ {\mathsf{some} \ a \ a }$ .

Theorem 2.19 (Completeness). If $\Gamma \models {\varphi} $ , then $\Gamma \ {\vdash} \ {\varphi}. $

Proof. By Theorem 2.3, if $\Gamma \models {\mathsf{all } \ x \ y}$ , then already $\Gamma \ {\vdash }_0\ {\mathsf{all } \ x \ y}$ . So we may assume that $\varphi $ has the form $({\mathsf{some} \ x \ y})$ . We prove the contrapositive, so assume $\Gamma \not\vdash \varphi $ . Let ${\mathbb {T}}$ be a set of terms, closed under subterms, which contains x, y, and all subterms of sentences in $\Gamma $ . By Zorn’s LemmaFootnote ² , let $\Gamma ^*\supseteq \Gamma $ be a maximal extension, such that $\Gamma ^*\not\vdash \varphi $ . The sentences in $\Gamma ^*$ may contain any terms in the language.

Assume for contradiction that $\Gamma ^*$ does not determine existentials for ${\mathbb {T}}$ . Then there are terms $x,y\in {\mathbb {T}}$ and a verb $r\in {\textbf {V}}$ such that $\Gamma ^*\not\vdash_0\ {\mathsf{some} \ x \ x}$ and $\Gamma ^*\not\vdash_0\ \mathsf{all} \ {y} \ (r \ \mathsf{all} \ x)$ . In particular, $\Gamma ^*$ does not contain either of these sentences. By maximality, we have $\Gamma ^*\cup \{{\mathsf{some} \ x \ x}\}\ {\vdash}\ \varphi $ and $\Gamma ^* \cup \{\mathsf{all} \ y \ (r \ \mathsf{all} \ x)\} \ {\vdash}\ \varphi $ . By $({{\scriptstyle \mathrm{CASES}}})$ , $\Gamma ^*\ {\vdash}\ \varphi $ , contradiction. Thus $\Gamma ^*$ determines existentials for ${\mathbb {T}}$ .

Now since $\Gamma ^*\not\vdash \varphi $ , we clearly have $\Gamma ^*\not\vdash _0\varphi $ . So by Theorem 2.6, $\Gamma \not \models \varphi $ . This is what was to be shown. □

The proofs of Theorem 2.3 and Theorem 2.6 show that if $\Gamma \not\vdash \varphi $ , then either ${\cal M}(\Gamma ,{\mathbb {T}})$ or ${\cal M}'(\Gamma ^*,{\mathbb {T}})$ are countermodels, depending on the form of $\varphi $ . Both of these models have size $O(n^2)$ , where n is the complexity of $\Gamma \cup \{\varphi \}$ .

Remark. Since ${\vdash }_0$ is already complete for sentences of the form $({\mathsf{all} \ a \ b})$ , we only need to use ( cases ) in proofs of sentences of the form $({\mathsf{some} \ a \ b})$ . Using Lemma 2.10, it is possible to show that ( cases ) is equivalent over ${\vdash }_0$ to the following rule:

So the system with rules $({\scriptstyle \mathrm{AX}})$ , $({\scriptstyle \mathrm{BARBARA}})$ , $({\scriptstyle \mathrm{ANTI}})$ , ( ${\scriptstyle \mathrm{SOME}}_1$ ), ( ${\scriptstyle \mathrm{SOME}}_2$ ), (darii), and ( ${{\scriptstyle \mathrm{CASES}}}^*$ ) is also sound and complete for ${\cal L}_{2}$ . We chose to emphasize ( cases ) rather than $({{\scriptstyle \mathrm{CASES}}}^*)$ because it made the completeness proof quicker, and because we will use ( cases ) again in §3.2.

2.4 Completeness using the ( chains ) schema

In Theorem 2.11, we proved that there are no syllogistic proof systems for ${\cal L}_{2}$ which are sound and complete. We have just seen that ${\cal L}_{2}$ has a sound and complete proof system, but one which is not syllogistic in our sense. In this section, we give another proof system, which this time makes use of a schema of rules with arbitrarily long (but finite) premise sets.

Definition 2.20. Let a and b be nouns. A chain linking a to b is a sequence C of sentences

$$ \begin{align*} C = (\mathsf{all} \ {a} \ {u_1}, \mathsf{all} \ {v_1} \ {u_2},\dots,\mathsf{all} \ {v_i} \ u_{i+1},\dots,\mathsf{all} \ {v_m} \ b ), \end{align*} $$

such that for all $1\leq i \leq m$ , either

1. $u_i = (\vec {r} \ \mathsf{all} \ {z_i})$ and $v_{i} = (\vec {r}\ \mathsf{all} \ (r\ \mathsf{all} \ t_i))$ , where ${\vec {r}}$ is a sequence of even length, or
2. $u_i = (\vec {r} \ \mathsf{all} \ (r\ \mathsf{all}\ {t_i}))$ and $v_{i} = (\vec {r} \ \mathsf{all} \ {z_i})$ , where ${\vec {r}}$ is a sequence of odd length.

We say that this chain has length $(m+1)$ , and the terms $t_1,\dots ,t_m$ are the missing link terms in C.

Note that a chain of length $1$ linking a to b is just the single sentence $({\mathsf{all} \ a \ b})$ and has no missing link terms. Here are two chains of length $2$ linking a to b:

$$ \begin{align*} &(\mathsf{all} \ a \ z, \mathsf{all} \ (r \ \mathsf{all} \ t) \ b)\\ &(\mathsf{all} \ a \ (s\ \mathsf{all} \ (r \ \mathsf{all} \ t) ), \mathsf{all} \ (s\ \mathsf{all} \ z) \ b). \end{align*} $$

In both of these chains, t is the missing link term.

Returning to the definition, we emphasize that the terms denoted $z_1,\dots ,z_m$ may be arbitrary (they need not be nouns) and are not missing link terms. The sequence ${\vec {r}}$ is also arbitrary.

Lemma 2.21. Suppose $C = (\mathsf{all} \ a \ {u_1}, \mathsf{all} \ {v_1} \ {u_2},\dots ,\mathsf{all} \ {v_m} \ b)$ is a chain linking a to b, ${\cal M}\models C$ , and $[\![ t ]\!] = \emptyset $ for every missing link term t in C. Then $[\![ a ]\!]\subseteq [\![ b ]\!]$ .

Proof. Since ${\cal M}$ satisfies all the sentences in C, $[\![ a ]\!]\subseteq {[\![ u_1 ]\!]}$ , ${[\![ v_m ]\!]}\subseteq [\![ b ]\!]$ , and ${[\![ v_i ]\!]}\subseteq {[\![ {u_{i+1}} ]\!]}$ for all i. So it suffices to show that $[\![ u_i ]\!]\subseteq {[\![ {v_i} ]\!]}$ for all i.

Let $t_i$ be the missing link term for $u_i$ and $v_i$ . Since ${[\![ {t_i} ]\!]} = \emptyset $ , ${[\![r \ \mathsf{all} \ t_i]\!]} = {\cal M}$ . So ${[\![ {z_i} ]\!]}\subseteq [\![ r \ \mathsf{all} \ t_i]\!]$ for any term $z_i$ . This is the desired inclusion when ${\vec {r}}$ is the empty sequence. The result then follows by induction on the length of ${\vec {r}}$ , using the fact that if $[\![ x ]\!]\subseteq [\![ y ]\!]$ , then ${[\![ {{r \ \mathsf{all} \ y}} ]\!]}\subseteq {[\![ {{r \ \mathsf{all} \ x}} ]\!]}$ . □

Definition 2.22. Let x and y be terms. An $(x,y)$ chain system is a sequence of chains $C_1,\dots ,C_l$ such that for every missing link term t in every chain $C_n$ , there exist $m,m'<n$ such that $C_m$ links t to x and $C_{m^{\prime }}$ links t to y. When $x = y$ , we may take $m = m'$ .

If chains $C_m$ and $C_{m'}$ link t to x and y, respectively, we can think of these chains as witnessing that we are allowed to use t as a missing link term later in the $(x,y)$ chain system. Of course, the first chain in an $(x,y)$ chain system must have length $1$ , since there are no available missing link terms from previous chains.

Lemma 2.23. Let $C_1,\dots ,C_l$ be an $(x,y)$ chain system. Suppose ${\cal M}\models C_i$ for all i, and suppose that $[\![ x ]\!]\cap [\![ y ]\!] = \emptyset $ in ${\cal M}$ . Then $[\![ a ]\!]\subseteq {[\![ {b} ]\!]}$ whenever some $C_i$ links a to b.

Proof. By induction on l. When $l = 0$ , there are no chains, so the conclusion is vacuously satisfied. Now suppose $C_1,\dots ,C_{l+1}$ is an $(x,y)$ chain system. By induction, the conclusion holds for the $(x,y)$ chain system $C_1,\dots ,C_l$ . So suppose $C_{l+1}$ links a to b. For any missing link term t in $C_{l+1}$ , there are chains $C_i$ and $C_j$ with $i,j\leq l$ , linking t to x and t to y. So $[\![ t ]\!]\subseteq [\![ x ]\!]\cap [\![ y ]\!] = \emptyset $ . By Lemma 2.21, $[\![ a ]\!]\subseteq [\![ b ]\!]$ , as desired. □

We introduce the new rule schema

where $C_1,\dots ,C_l$ is an $(x,y)$ chain system, some $C_m$ links a to x, and some $C_n$ links b to y.

In this section, we write ${\vdash }$ for the proof system ${\vdash }_0$ augmented by the rule schema (chains).

Theorem 2.24. The $({{\scriptstyle \mathrm{CHAINS}}})$ schema is sound.

Proof. Let $({\mathsf{some} \ a \ b}), C_1,\dots ,C_l$ be the premises of an instance of (chains), and suppose that ${\cal M}$ satisfies these premises. Suppose towards a contradiction that $[\![ x ]\!] \cap [\![ y ]\!] = \emptyset $ , so Lemma 2.23 applies. Since some $C_m$ links a to x and some $C_n$ links b to y, we use Lemma 2.23 to see that $[\![ a ]\!]\cap [\![ b ]\!]\subseteq [\![ x ]\!]\cap [\![ y ]\!] = \emptyset $ . But this contradicts the assumption that ${\cal M}\models {\mathsf{some} \ a \ b}$ . □

Example 2.25. We show that $\Gamma _n\ {\vdash}\ {\mathsf{some} \ a \ a }$ , where $\Gamma _n$ is the theory in §2.2. We will find an $(a,a)$ chain system which contains a chain linking $(r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a))$ to a.

$C_1 = ({\mathsf{all} \ a \ a})$ is a chain of length $1$ linking a to a. This allows a to be used as a missing link term in $C_2$ . Let $\beta $ be the sentence

$$ \begin{align*} \mathsf{all} \ (r_1\ \mathsf{all} \ (r_1\ \mathsf{all} \ a)) \ (r_1\ \mathsf{all} \ (r_1 \ {\mathsf{ all}} \ a)). \end{align*} $$

Then

$$ \begin{align*} C_2 = (\beta, {\varphi}_1, \ldots, {\varphi}_{n-1}, \omega) \end{align*} $$

is a chain linking $(r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a))$ to a, in which the only missing link term is a.

Let’s check that $C_2$ is a chain. First, $u_1 = (r_1 \ \mathsf{all} \ (r_1 \ \mathsf{all} \ a))$ and $v_1 = (r_1 \ \mathsf{all} \ c )$ , so this is alternative $2$ in Definition 2.20, with $t = a$ , $z = c$ , $r = r_1$ , and ${\vec {r}} = r_1$ . All of the rest of the links from $u_i$ to $v_i$ are justified in the same way.

Then we have a derivation:

This shows that $\Gamma _n\ {\vdash}\ {\mathsf{some} \ a \ a }$ , because $\alpha \in \Gamma _n$ , and each sentence in $C_1, C_2$ is either $\beta $ , which is an instance of $({\scriptstyle \mathrm{AX}})$ , or an element of $\Gamma _n$ .

Theorem 2.26 (Completeness). If $\Gamma \models {\varphi} $ , then $\Gamma \ {\vdash}\ {\varphi} $ .

For any term $t\in {\mathbb {T}}$ , let $\Delta _t = \{\mathsf{all} \ z \ (r \ \mathsf{all} \ t): z \in \mathbb {T}, r \in {\textbf {V}}\}$ . In order to extend $\Gamma $ to a theory which determines existentials for ${\mathbb {T}}$ , we define an increasing sequence of theories by induction. Set $\Gamma _0 = \Gamma $ , and given $\Gamma _n$ , define

$$ \begin{align*} \Gamma_{n+1} = \Gamma_n\cup \{\mathsf{all} \ z \ (r \ \mathsf{all} \ t) : t,z\in {\mathbb{T}}, r\in {\textbf{V}}, \text{and }\Gamma_n\cup \{\mathsf{some} \ t \ t\}\ {\vdash}_0\ \varphi\}. \end{align*} $$

So $\Gamma _{n+1}$ includes $\Delta _t$ for all terms $t\in {\mathbb {T}}$ such that $\Gamma _n\cup \{{\mathsf{some} \ t \ t}\}\ {\vdash }_0\ \varphi $ . Let $\Gamma _{\omega } = \bigcup _{n\in \omega } \Gamma _n$ , and let

$$ \begin{align*} \Gamma^* = \Gamma_{\omega}\cup \{{\mathsf{some} \ t \ t} : t\in {\mathbb{T}}\text{ and }\Delta_t\not\subseteq \Gamma_{\omega}\}. \end{align*} $$

Claim 2.27. There is some n such that $\Gamma _n\ {\vdash }_0\ \varphi $ . □

Proof. $\Gamma ^*$ determines existentials for ${\mathbb {T}}$ , and by Theorem 2.6, $\Gamma ^*\ {\vdash }_0\ \varphi $ . By Lemma 2.10, there is a sentence $({\mathsf{some}\ {a_1} \ {a_2}})\in \Gamma ^*$ such that $\Gamma ^*\ {\vdash }_0\ \mathsf{all} \ {a_i} \ x$ and $\Gamma ^*\ {\vdash }_0\ \mathsf{all} \ {a_j} \ y$ for some $i,j\in \{1,2\}$ . Since ${\vdash }_0$ -proofs of all-sentences are finite and never contain some-sentences, there is already some n such that $\Gamma _n\ {\vdash }_0\ \mathsf{all} \ {a_i} \ x$ and $\Gamma _n\ {\vdash }_0\ \mathsf{all} \ {a_j} \ y$ , and we have $\Gamma _n\cup \{\mathsf{some} \ a_1 \ a_2\}\ {\vdash }_0\ \varphi $ .

It remains to show that the sentence $({\mathsf{some} \ a_1 \ a_2})$ belongs to $\Gamma \subseteq \Gamma _n$ , since then $\Gamma _n\ {\vdash }_0\ \varphi $ . Suppose not. Then $({\mathsf{some} \ a_1 \ a_2})$ is not in $\Gamma _{\omega }$ , since the sets $\Gamma _n$ only add sentences of the form $(\mathsf{all} \ z \ (r \ \mathsf{all} \ t))$ to $\Gamma $ . So $({\mathsf{some} \ a_1 \ a_2})$ is a sentence $({\mathsf{some} \ t \ t})$ such that $\Delta _t\not \subseteq \Gamma _{\omega }$ . But then $\Gamma _n\cup \{{\mathsf{some} \ t \ t}\}\ {\vdash }_0\ \varphi $ , so $\Delta _t\subseteq \Gamma _{n+1}\subseteq \Gamma _{\omega }$ , which is a contradiction.□

Claim 2.28. If there is some n such that $\Gamma _n\ {\vdash }_0\ {\varphi} $ , then $\Gamma \ {\vdash}\ {\varphi} $ .

Proof. Assume $\Gamma \not\vdash {\varphi} $ . Then we will prove the following two claims for all n, by induction:

$(1)_n$ For every $k\geq 1$ and every $\Gamma _{n}$ -sequence of terms $t_1,\dots ,t_k$ , there is an $(x,y)$ chain system $C_1,\dots ,C_{\ell }$ , such that $C_{\ell }$ links $t_1$ to $t_k$ , and such that for all $1\leq i\leq \ell $ and all $\psi \in C_i$ , $\Gamma \ {\vdash}\ \psi $ .
$(2)_n$ $\Gamma _n\not \vdash_0\ {\varphi} $ .

So assume $(1)_m$ and $(2)_m$ hold for all $m<n$ . We will first prove $(1)_n$ by induction on k.

In the base case, when $k = 1$ , we have $t_1 = t_k$ , and $\Gamma \ {\vdash}\ \mathsf{all} \ {t_1} \ {t_k}$ by $({\scriptstyle \mathrm{AX}})$ . The chain $(\mathsf{all} \ {t_1} \ {t_k})$ has no missing link terms and links $t_1$ to $t_k$ . So we have the required $(x,y)$ chain system, consisting of just this one chain.

Now suppose $k> 1$ , and fix a $\Gamma _{n}$ -sequence $t_1,\dots ,t_k$ . Let $C_1,\dots ,C_{\ell }$ be the $(x,y)$ chain system obtained by induction for the $\Gamma _n$ -sequence $t_1,\dots , t_{k-1}$ . Then $C_{\ell }$ links $t_1$ to $t_{k-1}$ , so the last sentence in $C_{l}$ is $(\mathsf{all} \ {c} \ {t_{k-1}})$ for some term c. By the definition of $\Gamma _n$ -sequence, there is a sentence $(\mathsf{all} \ d \ e)\ \in \Gamma _n$ and a sequence of verbs ${\vec {r}}$ such that $(\mathsf{all} \ {t_{k-1}} \ {t_k}) = {\text {Anti}}({\vec {r}},(\mathsf{all} \ d \ e))$ .

If $(\mathsf{all} \ d \ e)\ \in \Gamma $ , then by repeated applications of $({\scriptstyle \mathrm{ANTI}})$ , $\Gamma \ {\vdash}\ \mathsf{all} \ {t_{k-1}} \ {t_k}$ . Since also $\Gamma \ {\vdash}\ \mathsf{all} \ c \ {t_{k-1}}$ by induction, we have $\Gamma \ {\vdash}\ \mathsf{all} \ c\ t_k$ by $({\scriptstyle \mathrm{BARBARA}})$ . Replacing the last sentence of $C_{\ell }$ with $(\mathsf{all} \ c \ t_k)$ , we are done.

If $(\mathsf{all} \ d \ e)\ \notin \Gamma $ , then $(\mathsf{all} \ d \ e)\in \Gamma _{m+1}\setminus \Gamma _{m}$ for some $0\leq m < n$ . It follows that $e = (r \ \mathsf{all} \ t)$ for some term t such that $\Gamma _{m}\cup \{{\mathsf{some} \ t \ t}\}\ {\vdash }_0\ {\varphi} $ . By Lemma 2.10, there is a sentence $({\mathsf{some} \ p \ q})$ in $\Gamma _{m}\cup \{{\mathsf{some} \ t \ t}\}$ such that for all $w\in \{x,y\}$ there is some $v\in \{p,q\}$ such that $\Gamma _{m}\ {\vdash }_0\ \mathsf{all} \ v \ w$ . If $({\mathsf{some} \ p \ q})\ \in \Gamma _{m}$ , then $\Gamma _{m}\ {\vdash }_0\ {\varphi} $ , contradicting $(2)_m$ . Otherwise, the sentence $({\mathsf{some} \ p \ q})$ must be $({\mathsf{some} \ t \ t})$ . That is, $p = q = t$ , so $\Gamma _{m}\ {\vdash }_0\ \mathsf{all} \ t \ x$ and $\Gamma _{m}\ {\vdash }_0\ \mathsf{all} \ t\ y$ .

By Lemma 2.9, there are $\Gamma _{m}$ -sequences linking t to x and t to y. By $(1)_m$ , there are $(x,y)$ chain systems $C^{\prime }_1,\dots ,C^{\prime }_{\ell ^{\prime }}$ and $C^{\prime \prime }_1,\dots ,C^{\prime \prime }_{\ell ^{\prime \prime }}$ such that $C^{\prime }_{\ell ^{\prime }}$ links t to x, $C^{\prime \prime }_{\ell ^{\prime \prime }}$ links t to y, and for every sentence $\psi $ in every chain, $\Gamma \ {\vdash}\ \psi $ . Let $C_{\ell }^*$ be the chain $C_{\ell }$ with the sentence $(\mathsf{all} \ {t_k} \ {t_k})$ appended. Then $C_{\ell }^*$ links $t_1$ to $t_k$ . We either have $t_{k-1} = (\vec {r}\ \mathsf{all} \ d)$ and $t_k = (\vec {r} \ \mathsf{all} \ (r \ \mathsf{all} \ t))$ where ${\vec {r}}$ has even length, or $t_{k-1} = (\vec {r} \ \mathsf{all} \ (r \ \mathsf{all} \ t))$ and $t_k = (\vec {r} \ \mathsf{all} \ d)$ where ${\vec {r}}$ has odd length, so the missing link terms in $C_{\ell }^*$ are those in $C_{\ell }$ , together with t. Also $\Gamma \ {\vdash}\ \psi $ for every sentence $\psi $ in $C^*_{\ell }$ , by our assumption about $C_{\ell }$ and $({\scriptstyle \mathrm{AX}})$ . So $C^{\prime }_1,\dots ,C^{\prime }_{\ell ^{\prime }},C^{\prime \prime }_1,\dots ,C^{\prime \prime }_{\ell ^{\prime \prime }},C_1,\dots ,C_{\ell -1},C_{\ell }^*$ is our desired $(x,y)$ chain system.

Having established $(1)_n$ , we prove $(2)_n$ . Assume for contradiction that $\Gamma _n\ {\vdash }_0\ {\varphi} $ . By Lemma 2.10, there is a sentence ( ${\mathsf{some} \ a_1 \ a_2}$ ) in $\Gamma _n$ such that $\Gamma _n\ {\vdash }_0\ \mathsf{all} \ {a_i}\ x$ and $\Gamma _n\ {\vdash }_0\ \mathsf{all} \ {a_j} \ y$ for some $i,j\in \{1,2\}$ . By Lemma 2.10, we thus have $\Gamma _n$ -sequences from $a_i$ to x and from $a_j$ to y. Apply $(1)_n$ to these sequences to obtain $(x,y)$ chain systems $C_1,\dots ,C_{\ell }$ and $C^{\prime }_1,\dots ,C^{\prime }_{\ell ^{\prime }}$ such that $C_l$ links $a_i$ to x and $C^{\prime }_{\ell ^{\prime }}$ links $a_j$ to y, and all sentences in all chains are ${\vdash }$ -provable from $\Gamma $ . Then we have an instance of chains

The sentence $({\mathsf{some} \ a_1 \ a_2})$ belongs to $\Gamma _n$ , hence to $\Gamma $ . Applying $({\scriptstyle \mathrm{SOME}}_1)$ or $({\scriptstyle \mathrm{SOME}}_2)$ as needed, $\Gamma \ {\vdash}\ {\mathsf{some} \ a_i \ a_j}$ . So this deduction is the root of a proof tree proving $({\mathsf{some} \ x \ y})$ from $\Gamma $ .□

Claims 2.27 and 2.28 complete the proof of Theorem 2.26.

2.5 Completeness and ${\scriptstyle \mathrm{PTIME}}$ decidability for the extended language ${\cal L}_{2}^+$

We have seen that ${\cal L}_{2}$ has no sound and complete syllogistic proof system. And we have seen proof systems which go beyond the “purely syllogistic” in earlier sections. But this section goes in a different direction. We show that if we enhance the syntax of ${\cal L}_{2}$ in a certain way, then we are able to find a boundedly complete syllogistic proof system for the larger language.

We add to ${\cal L}_{2}$ a new four-place sentence former

$$ \begin{align*} (\mathsf{all} \ a \ b)\lor (\mathsf{some} \ x \ y) \end{align*} $$

with the evident semantics

$$ \begin{align*} {\cal M}\models ({\mathsf{all} \ a \ b})\lor (\mathsf{some} \ x \ y) {\quad \text{iff}\quad} [\![ a ]\!]\subseteq [\![ b ]\!] \text{ or } [\![ x ]\!] \cap [\![ y ]\!] \neq \emptyset. \end{align*} $$

We call the larger language ${\cal L}_{2}^+$ . Note that we do not allow the disjunction of arbitrary sentences. Rather, there is a new kind of sentence, which is the disjunction of exactly one sentence $({\mathsf{all} \ a \ b})$ and one sentence $({\mathsf{some} \ x \ y})$ . For a proof system, we take the rules in Figure 3, and we write ${\vdash }$ for provability in this system.

Fig. 3 Rules in §2.5. These rules are added on top of the rules in Figure 2.

Lemma 2.29. The proof system is sound.

Proof. We argue soundness for $({{\scriptstyle \mathrm{EMPTY}}_1})$ , $({{\scriptstyle \mathrm{EMPTY}}_2})$ , and $({{\scriptstyle \mathrm{DD}}})$ , since soundness of the other rules is clear from the meaning of disjunction. Fix a model ${\cal M}$ .

For $({{\scriptstyle \mathrm{EMPTY}}_1})$ , either $[\![ a ]\!]\neq \emptyset $ , or $\emptyset = [\![ a ]\!]\subseteq [\![ b ]\!]$ . In either case, ${\cal M}\models (\mathsf{all }\ a \ b)\lor (\mathsf{some }\ a \ a)$ .

For $({{\scriptstyle \mathrm{EMPTY}}_2})$ , either $[\![ a ]\!] \neq \emptyset $ , or $[\![ a ]\!] = \emptyset $ . In the latter case, $[\![ b ]\!]\subseteq [\![ {{r \ \mathsf{all} \ a}} ]\!] = M$ . In either case, ${\cal M}\models (\mathsf{all} \ b \ (r \ \mathsf{all} \ a))\lor (\mathsf{some}\ a \ a)$ .

For $({{\scriptstyle \mathrm{DD}}})$ , suppose that ${\cal M}$ satisfies the premises of the rule, and assume for contradiction that $[\![ x ]\!]\cap [\![ y ]\!] = \emptyset $ . Then ${\cal M}\models \mathsf{all} \ t \ x$ and ${\cal M}\models \mathsf{all} \ u \ y$ . So $[\![ t ]\!]\cap [\![ u ]\!]\subseteq [\![ x ]\!]\cap [\![ y ]\!] = \emptyset $ , contradicting ${\cal M}\models {\mathsf{some} \ t \ u}$ . □

Example 2.30. The rules $({{\scriptstyle \mathrm{DD}}})$ and its companion $({{\scriptstyle \mathrm{DD}}'})$ stand for “double darii”. To see the connection between these rules and $({\scriptstyle \mathrm{DARII}})$ , we note that $({\scriptstyle \mathrm{DARII}})$ is redundant in this system:

Note also that $({{\scriptstyle \mathrm{DD}}})$ is the only new rule in our proof system which produces a conclusion in the original syntax of ${\cal L}_{2}$ . It is responsible, together with $({{\scriptstyle \mathrm{EMPTY}}_2})$ , for the reasoning captured by $({{\scriptstyle \mathrm{CASES}}})$ and $({{\scriptstyle \mathrm{CHAINS}}})$ in the previous two sections.

Example 2.31. We show that $\Gamma _n\ {\vdash}\ {\mathsf{some} \ a \ a }$ , where once again $\Gamma _n$ is from §2.2. For each $1\leq i < n$ , we have the derivation:

and similarly, we have:

By n applications of $({{\scriptstyle \mathrm{BARB}}'})$ , we obtain $(\mathsf{all} \ (r_1^2 \ \mathsf{all} \ a) \ a) \lor (\mathsf{some} \ a\ a)$ . And then we conclude:

Completeness and ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ -decidability

At this point, we turn to the completeness and ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ -decidability of the logic. We are going to apply Theorem 1.5. For any set $\Delta $ , let $T(\Delta )$ be the set of subterms of sentences in $\Delta $ . Let $T^+(\Delta )$ be $T(\Delta )$ together with the terms $(r \ \mathsf{all} \ w)$ where $w\in T(\Delta )$ and where r occurs in $\Delta $ .

Let $g(\Delta )$ be the set consisting of

(i) All sentences $({\mathsf{all } \ x \ y})$ , where $x\in T(\Delta )$ and $y\in T^+(\Delta ).$
(ii) All sentences $({\mathsf{some} \ u \ v})$ , where $u, v\in T(\Delta )$ .
(iii) All sentences $(\mathsf{all} \ x \ y)\lor (\mathsf{some} \ u \ v)$ , where $x,u,v\in T(\Delta )$ and $y\in T^+(\Delta )$ .

Note that g is computable in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Theorem 2.32. If $\Gamma \models {\varphi} $ , then $\Gamma \ {\vdash}_{g(\Gamma \cup \{ {\varphi} \})}\ {\varphi} $ . Hence the consequence relation for ${\cal L}_{2}^+$ is in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Proof. As in §1.4, we are going to save on some notation below by writing T for $T(\Gamma \cup \{ {\varphi} \})$ , $T^+$ for $T^+(\Gamma \cup \{ {\varphi} \})$ , and A for $g(\Gamma \cup \{ {\varphi} \})$ .

We are going to do this entire proof in two parts. The first part handles the case that $ {\varphi} $ is either $(\mathsf{all} \ a \ b)\lor (\mathsf{some} \ x \ y)$ or else $(\mathsf{some} \ x \ y)$ . After that, we handle the relatively simpler case that $ {\varphi} $ is $(\mathsf{all} \ a \ b)$ .

So until further notice, we are in the first part of this theorem. Please note that x, y, a, and b are fixed throughout the rest of this proof.

Let $M = M_{xy}$ be the set of unordered pairs $\{t,u\}$ of terms from T such that there is some $z\in \{x,y\}$ such that for all $v\in \{t,u\}$ , $\Gamma \not\vdash_{ A} (\mathsf{all} \ v \ z )\lor (\mathsf{some} \ x\ y)$ .

We allow $t= u$ , and it follows that whenever M contains $\{t,u\}$ , then it also contains $\{t\} = \{t,t\}$ . The point of the definition of M will become clearer after we see the Truth Lemma and Claim 2.35 below: we are building a model which is guaranteed to have $[\![ x ]\!]\cap [\![ y ]\!] = \emptyset $ .

We define a model ${\cal M}$ with domain M by setting

$$ \begin{align*} \begin{aligned} \{t,u\}\in [\![ p ]\!] &{\quad \text{iff}\quad} \text{either } \Gamma \ {{\vdash}_{ A}}\ ({\mathsf{all} \ t \ p})\lor (\mathsf{some} \ x \ y) \text{, or } \Gamma \ {{\vdash}_{ A}}\ {(\mathsf{all} \ u \ p)}\lor (\mathsf{some} \ x \ y)\\\{t,u\}[\![ r ]\!]\{v,w \} & \quad \text{iff}\quad \text{for some}\ c\in \{t,u\}\ \text{and } d\in\{v,w \},\\& \quad \hphantom{\text{iff}}\quad \Gamma\ \vdash_{ A}\ (\mathsf{all} \ c \ (r \ \mathsf{all} \ d))\lor (\mathsf{some} \ x \ y). \end{aligned} \end{align*} $$

Claim 2.33 (Truth Lemma). In ${\cal M}$ ,we have the following for all terms $z\in T$ ,

$$ \begin{align*} {[\![ {z} ]\!]} = \{\{t,u\}\in M : \Gamma \ {{\vdash}_{ A}}\ (\mathsf{all} \ t \ z)\lor (\mathsf{some} \ x \ y) \text{ or }\Gamma \ {{\vdash}_{ A}}\ (\mathsf{all} \ u \ z)\lor (\mathsf{some} \ x \ y)\}.\\ \end{align*} $$

□

Proof. By induction on z. For a noun in ${\mathbf {N}}$ , this is by definition of the model. So we assume our statement for z and prove it for $({r \ \mathsf{all} \ z})\in T$ . Note that $z\in T$ , since T is closed under subterms.

Fix $\{t,u\}\in M$ , and suppose (without loss of generality) $\Gamma\ {\vdash}_{ A}\ (\mathsf{all} \ t \ (r \ \mathsf{all} \ z))\lor (\mathsf{some} \ x \ y)$ . We show that $\{t,u\}\in {[\![ r \ \mathsf{all} \ z ]\!]}$ . Let $\{v,w \}\in M$ be an element of ${[\![ {z} ]\!]}$ . By the induction hypothesis, we have (without loss of generality) $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ v \ z)}\lor (\mathsf{some} \ x \ y)$ . Then by $({{\scriptstyle \mathrm{ANTI}}'})$ , $\Gamma\ {{\vdash }_{ A}}\ (\mathsf{all} \ (r \ \mathsf{all} \ z) \ (r \ \mathsf{all} \ v))\lor (\mathsf{some} \ x \ y)$ . Note that the sentence $(\mathsf{all} \ (r \ \mathsf{all} \ z) \ (r \ \mathsf{all} \ v))\lor (\mathsf{some} \ x \ y)$ belongs to A because $({r \ \mathsf{all} \ z})$ , x, and y belong to T and ${r \ \mathsf{all} \ v}$ to $T^+$ . Moreover, $(\mathsf{all} \ {t} \ (r \ \mathsf{all} \ v))\lor (\mathsf{some} \ x \ y)$ again belongs to A. By $({{\scriptstyle \mathrm{BARB}}'})$ , $\Gamma\ {{\vdash }_{ A}}\ (\mathsf{all} \ t \ ({r \ \mathsf{all} \ v}))\lor (\mathsf{some} \ x \ y)$ , and hence $\{t,u\}[\![ r ]\!] \{v,w \}$ . So $\{t,u\}\in [\![ r \ \mathsf{all} \ z ]\!]$ .

Conversely, suppose $\{t,u\}\in {[\![ {r \ \mathsf{all} \ z} ]\!]}$ .

Case 1: $\{z\}\in M$ . Notice that $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ z \ z)}\lor (\mathsf{some} \ x \ y)$ by $({\scriptstyle \mathrm{AX}})$ and $({{\scriptstyle \mathrm{WL}}})$ . By induction, $\{z\}\in {[\![ {z} ]\!]}$ . Hence $\{t,u\}[\![ r ]\!]\{z\}$ . So by the definition of the model, we have either $\Gamma\ {{\vdash }_{ A}}\ (\mathsf{all} \ t \ (r \ \mathsf{all} \ z)) \lor (\mathsf{some} \ x \ y)$ or $\Gamma\ {\vdash}_{ A}\ (\mathsf{all} \ u \ (r \ \mathsf{all} \ z))\lor (\mathsf{some} \ x \ y)$ , as desired.

Case 2: $\{z\}\notin M$ . Then $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ z \ x)}\lor (\mathsf{some} \ x \ y)$ and $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ z \ y)}\lor (\mathsf{some} \ x \ y)$ . So we have a proof from $\Gamma $ :

As before, all sentences shown belong to A. We are done. Incidentally, the same argument shows that also

$\Gamma\ {{\vdash }_{ A}}\ (\mathsf{all} \ u \ (r \ \mathsf{all} \ z))\lor (\mathsf{some} \ x \ y)$ .□

We conclude the first part of our proof of Theorem 2.32 with two claims. Together with the assumption that $\Gamma \models {\varphi} $ , they show that $\Gamma\ {{\vdash }_{ A}}\ {\varphi} $ , where $ {\varphi} $ is the sentence in the statement of our theorem. In this part of the proof, recall that $ {\varphi} $ is either $(\mathsf{all} \ a \ b)\lor (\mathsf{some} \ x \ y)$ or $({\mathsf{some} \ x \ y})$ .

Claim 2.34. Either $\Gamma\ {{\vdash }_{ A}}\ {\varphi} $ , or ${\cal M} \models \Gamma $ .

Proof. Let $\psi $ be a sentence in $\Gamma $ . We check that either $\Gamma\ {{\vdash }_{ A}}\ {\varphi} $ , or ${\cal M} \models \psi $ .

Case 1: $\psi $ is $(\mathsf{all} \ c\ d)$ . By $({{\scriptstyle \mathrm{WL}}})$ , $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ c \ d)}\lor (\mathsf{some} \ x \ y)$ . For any $\{t,u\}\in {[\![ {c} ]\!]}$ , by the Truth Lemma (without loss of generality) $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ t \ c)}\lor (\mathsf{some} \ x \ y)$ . By $({{\scriptstyle \mathrm{BARB}}'})$ , $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ t \ d)}\lor (\mathsf{some} \ x \ y)$ . So $\{t,u\}\in {[\![ d ]\!]}$ by the Truth Lemma again. So in this case, we have ${\cal M}\models \psi $ .

Case 2: $\psi $ is $({\mathsf{some} \ c \ d})$ . There are two subcases, depending on whether or not $\{c,d\}$ belongs to M. If it does, then $\{c,d\}\in {[\![ {c} ]\!]}\cap {[\![ {d} ]\!]}$ , so ${\cal M}\models \psi $ . So we assume that $\{c,d\}\notin M$ . Then there are $e,f\in \{c,d\}$ such that $\Gamma\ {{\vdash }_{ A}}\ (\mathsf{all} \ e\ x)\lor (\mathsf{some} \ x \ y)$ and $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ f \ y)}\lor (\mathsf{some} \ x \ y)$ . Applying $({{\scriptstyle \mathrm{SOME}}_1})$ and $({{\scriptstyle \mathrm{SOME}}_2})$ as needed, $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{some} \ e \ f}$ . Then by $({{\scriptstyle \mathrm{DD}}})$ , $\Gamma\ {{\vdash }_{ A}}\ {\mathsf{some} \ x \ y}$ . If $ {\varphi} $ is $({\mathsf{some} \ x \ y})$ , then we are immediately done. And if $ {\varphi} $ is $({\mathsf{all} \ a \ b})\lor (\mathsf{some} \ x \ y)$ , then we are done after applying $({{\scriptstyle \mathrm{WR}}})$ .

Case 3: $\psi $ is $(\mathsf{all} \ s \ t)\lor (\mathsf{some} \ c \ d)$ . This is a combination of the two previous arguments.

Again there are two subcases, depending on whether or not $\{c,d\}$ belongs to M. If it does, then $\{c,d\}\in {[\![ {c} ]\!]}\cap {[\![ {d} ]\!]}$ , so ${\cal M}\models \psi $ . So we assume that $\{c,d\}\notin M$ . Then there are $e,f\in \{c,d\}$ such that $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ e \ x)}\lor (\mathsf{some} \ x \ y)$ and $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ f \ y)}\lor (\mathsf{some} \ x \ y)$ . Applying $({{\scriptstyle \mathrm{SOME}}_1^{\prime }})$ and $({{\scriptstyle \mathrm{SOME}}_2^{\prime }})$ as needed, $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ s \ t)}\lor (\mathsf{some} \ e \ f )$ . Then by $({{\scriptstyle \mathrm{DD}}'})$ , $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ s \ t)}\lor (\mathsf{some} \ x \ y)$ . For any $\{u,v\}\in {[\![ {s} ]\!]}$ , by the Truth Lemma (without loss of generality) $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ u \ s)}\lor (\mathsf{some} \ x \ y)$ . By $({{\scriptstyle \mathrm{BARB}}'})$ , $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ u \ t)}\lor (\mathsf{some} \ x \ y)$ . So $\{u,v\}\in [\![ t ]\!]$ by the Truth Lemma again. So we have ${\cal M}\models \mathsf{all} \ s \ t$ , and ${\cal M}\models \psi $ . □

Claim 2.35. If ${\cal M} \models {\varphi} $ , then $\Gamma\ {{\vdash }_{ A}}\ {\varphi} $ .

Proof. Case 1: $ {\varphi} $ is $({\mathsf{some} \ x \ y})$ . In this case, we claim that ${\cal M} \not \models {\varphi} $ . The reason is that by the Truth Lemma and the definition of M,

$$ \begin{align*} \{t,u\}\in M \quad\text{iff}\quad \{t,u\}\notin[\![ x ]\!]\cap[\![ y ]\!]. \end{align*} $$

Case 2: $ {\varphi} $ is $(\mathsf{all} \ a \ b)\lor (\mathsf{some} \ x \ y)$ . By Case 1, we assume that ${\cal M}\models {\mathsf{all} \ a \ b}$ . Consider $\{a\}$ . If $\{a\}\notin M$ , then $\Gamma\ {{\vdash }_{ A}}\ (\mathsf{all} \ a \ x)\lor (\mathsf{some} \ x \ y)$ and $\Gamma\ {{\vdash }_{ A}}\ {(\mathsf{all} \ a \ y)}\lor (\mathsf{some} \ x \ y)$ . Then we have the following proof of $ {\varphi} $ from $\Gamma $ :

All sentences shown belong to A. So we have the desired conclusion $\Gamma\ {{\vdash }_{ A}}\ {\varphi} $ . On the other hand, if $\{a\}\in M$ , then since $\{a\}\in [\![ a ]\!]$ and ${\cal M}\models {\mathsf{all} \ a \ b}$ , we have $\{a\}\in [\![ b ]\!]$ . By the Truth Lemma, we again have $\Gamma\ {{\vdash }_{ A}}\ (\mathsf{all} \ a \ b)\lor (\mathsf{some} \ x \ y)$ .□

This concludes the first part of the proof of Theorem 2.32. The second part is when $ {\varphi} $ is a sentence $(\mathsf{all} \ c \ d)$ . In this part, we repeat the construction and proof above, with the following adjustments:

1. We let M be the set of all unordered pairs of terms from T, with no restriction.
2. We drop the disjunct $\lor ({\mathsf{some} \ x \ y})$ from all sentences which appear in the proof, including the definition of ${\cal M}$ and the statement of the Truth Lemma.
3. In the proof of the Truth Lemma, Case 2 does not occur, since $\{z\}\in M$ .
4. In the proof of Claim 2.34, the subcases where $\{c,d\}\notin M$ do not occur.
5. We replace the proof of Claim 2.35 with the following argument. Recall that $ {\varphi} $ is $(\mathsf{all} \ c \ d)$ . We assume that ${[\![ {c} ]\!]}\subseteq {[\![ {d} ]\!]}$ , and we need to show that $\Gamma\ {{\vdash }_{ A}}\ \mathsf{all} \ c \ d$ . By the Truth Lemma, $\{c\}\in {[\![ {c} ]\!]}$ . Thus, $\{c\}\in {[\![ {d} ]\!]}$ . By the Truth Lemma again, $\Gamma\ {{\vdash }_{ A}}\ \mathsf{all} \ c \ d$ .

3 ${\cal L}_{3}$ and ${\cal L}_{3.5}$ : Adding the term former $({\textit{r} \ \mathsf{some} \ x})$ to ${{\cal L}_{1}}$ and ${\cal L}_{2}$

In this section, we study ${\cal L}_{3}$ , the language with term formers $({r \ \mathsf{all} \ x})$ and $({\textit{r} \ \mathsf{some} \ x})$ , and with sentence former $({\mathsf{all } \ x \ y})$ . We also study the larger language ${\cal L}_{3.5}$ which adds the sentence former $({\mathsf{some} \ x \ y})$ .

The language ${\cal L}_{3.5}$ has essentially already been studied by McAllester and Givan in [Reference McAllester and Givan2], but that paper is primarily concerned with complexity results rather than completeness results. What McAllester and Givan would call a quantifier-free atomic formula without constants is exactly what we call a sentence of ${\cal L}_{3.5}$ . What they would call a quantifier-free literal without constants is either an ${\cal L}_{3.5}$ sentence $ {\varphi} $ or its negation $\neg {\varphi} $ . They show that the satisfiability problem for sets $\Gamma $ of literals which determine existentials is in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ . And from this, they derive that the satisfiability problem for sets $\Gamma $ of literals (which perhaps do not determine existentials) of literals is in ${\rm NPT}{\scriptstyle \mathrm{IME}}$ . Thus, their result implies that the consequence relation for ${\cal L}_{3.5}$ is in ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ . They prove a matching hardness result as well, and so the consequence relation for ${\cal L}_{3.5}$ is ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ complete.

We show that the consequence relation for ${\cal L}_{3}$ is ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ hard. Our proof is based on the one in [Reference McAllester and Givan2], and the result here is a slight improvement on [Reference McAllester and Givan2] because ${\cal L}_{3}$ is a little weaker than their language. As a corollary, if P $\neq $ NP, then there is no boundedly complete syllogistic proof system for ${\cal L}_{3}$ or any language larger than it.

In §§3.3 and 3.2 we formulate proof systems and obtain completeness results for ${\cal L}_{3}$ and ${\cal L}_{3.5}$ . We also reprove the ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ decidability of ${\cal L}_{3.5}$ by a polynomial-size countermodel construction.

3.1 Co-NPTime hardness of the consequence relation of ${\cal L}_{3}$

Theorem 3.1. The problem of deciding whether $\Gamma \not \models {\varphi} $ , for $\Gamma \cup \{ {\varphi} \}$ a finite set of ${\cal L}_{3}$ -sentences, is ${\rm NPT}{\scriptstyle \mathrm{IME}}$ hard.

Proof. We use a reduction from the one-in-three positive $3$ -SAT problem first studied by Schaefer [Reference Thomas11]. This problem is defined as follows. We are given a set $\mathcal {S}$ of clauses of the form $U\lor V\lor W$ , where U, V, and W are distinct. (Note that negation is not used.) The problem is to find a truth assignment f to the variables making exactly one variable in each clause ${\sf T}$ and the other two variables ${\sf F}$ . We call this a $1$ -valued assignment for ${\cal S}$ . This problem was shown to be ${\rm NPT}{\scriptstyle \mathrm{IME}}$ complete in Schaefer [Reference Thomas11].

We define a set $\Gamma = \Gamma ({\cal S})$ below, in two steps. We use nouns which correspond to the variables of ${\cal S}$ , writing u for the noun corresponding with U, etc. $\Gamma $ also uses a number of other nouns and verbs. It is defined as follows:

(1) For each clause $c\in {\cal S}$ , say $c = U\lor V\lor W$ , put the following sentences in $\Gamma $ :
$$ \begin{align*} \begin{array}{ll} \mathsf{all} \ \mathsf{start} \ {(r^1_c \ \mathsf{all} \ u)} & \mathsf{all} \ {(r^1_c \ \mathsf{some} \ u)} \ y_c \\ \mathsf{all} \ {y_c} \ (r^2_c \ \mathsf{all} \ v) & \mathsf{all} \ {(r^2_c \ \mathsf{some} \ v) \ {z_c}} \\ \mathsf{all} \ {z_c} \ {(r^3_c \ \mathsf{all}\ w)} & \mathsf{all} \ {(r^3_c\ \mathsf{some} \ w)} \ \mathsf{finish}. \\ \end{array} \end{align*} $$
Here $\mathsf{start}$ and $\mathsf{finish}$ are new nouns (not varying with the clause), $y_c$ and $z_c$ are also new nouns (these do vary with c), and $r^1_2$ , $r^2_c$ , and $r^3_c$ are new verbs.
(2) Let P and Q be any two distinct variables which occur together in some clause c. Then add to $\Gamma $ the sentence $ {\varphi} _{p,q}$ :
$$ \begin{align*} \mathsf{all} \ (r_{p,q} \ \mathsf{all} \ p) \ (\mathsf{r}_{p,q}^{\prime} \text{ some } q). \end{align*} $$
Here $r_{p,q}$ and $r^{\prime }_{p,q}$ are new verbs. (By symmetry, we also add $ {\varphi} _{q,p}$ .)

So if ${\cal S}$ has k clauses, then the first point will add $2 + 2k$ new nouns and $3k$ new verbs. The second clause will add at most $2\cdot \binom {3k}{2} < 18 k^2$ new verbs.

Claim 3.2. ${\cal S}$ has a $1$ -valued assignment iff $\Gamma \not \models \mathsf{all} \ {\mathsf{start}} \ {\mathsf{finish}}$ . □

Proof. In one direction, assume that ${\cal M}\models \Gamma $ and ${\cal M}\not \models \mathsf{all} \ {\mathsf{start}} \ \mathsf{finish}$ . Define a truth assignment f by $f(U) = {\sf F}$ iff $[\![ u ]\!] \neq \emptyset $ . Consider a clause $c = U\lor V\lor W$ of ${\cal S}$ . If $f(U) = f(V) = f(W) = {\sf F}$ , then $[\![ u ]\!]$ , $[\![ v ]\!]$ , and ${[\![ {w} ]\!]}$ are all nonempty. By the sentences in (1),

$$ \begin{align*} {[\![ {\mathsf{start}} ]\!]} \subseteq {[\![ {y_c} ]\!]} \subseteq {[\![ {z_c} ]\!]} \subseteq {[\![ {\mathsf{finish}} ]\!]}. \end{align*} $$

But this contradicts that ${\cal M}\not \models \mathsf{all} \ {\mathsf{start}} \ {\mathsf{finish}}$ . Thus we know that at least one variable in c is assigned the value ${\sf T}$ by f. We claim that only one variable can be ${\sf T}$ . For suppose towards a contradiction that (for example) $U \neq V$ but $f(U) = f(V) = {\sf T}$ . Then $[\![ u ]\!] = [\![ v ]\!] = \emptyset $ . So $[\![ r_{p,q} \ \mathsf{all} \ u]\!] = M$ and ${[\![ r^{\prime }_{p,q} \ \mathsf{some } v ]\!]} = \emptyset $ . By the sentence $ {\varphi} _{p,q}$ in point (2), M is empty. But this is impossible, since ${\cal M}\not \models \mathsf{all} \ {\mathsf{start}} \ {\mathsf{finish}}$ .

Conversely, suppose f is $1$ -valued on ${\cal S}$ . We must find a model ${\cal M} \models \Gamma $ where ${\cal M}\not \models \mathsf{all} \ {\mathsf{start}} \ {\mathsf{finish}}$ . Let M be the set of variables U such that $f(U) = {\sf F}$ , together with $\mathsf{start}$ and $\mathsf{finish}$ . For a variable X, define $[\![ x ]\!] = \emptyset $ if $f(X) = {\sf T}$ , and $[\![ x ]\!] = \{x\}$ if $f(X) = {\sf F}$ . We also take ${[\![ {\mathsf{start}} ]\!]} = \{\mathsf{start}\}$ , and ${[\![ {\mathsf{finish}} ]\!]} = \{\mathsf{finish}\}$ .

We still need to define ${[\![ {y_c} ]\!]}$ , ${[\![ {z_c} ]\!]}$ , ${[\![ {r^1_c} ]\!]}$ , ${[\![ {r^2_c} ]\!]}$ , ${[\![ {r^2_c} ]\!]}$ for all clauses c, and also ${[\![ {r_{p,q}} ]\!]}$ and ${[\![ {r^{\prime }_{p,q}} ]\!]}$ when P and Q are distinct variables in the same clause.

Suppose that P and Q are distinct variables which happen to belong to the same clause. We must arrange that ${\cal M}\models {\varphi} _{p,q}$ . Set ${[\![ {r_{p,q}} ]\!]} = \emptyset $ and ${[\![ {r^{\prime }_{p,q}} ]\!]} = M\times M$ . We know that either $f(P) = {\sf F}$ or $f(Q) = {\sf F}$ (or both). In the first case, $[\![ r_{p,q} \ \mathsf{all} \ p ]\!] = \emptyset $ , so ${\cal M}\models {\varphi} _{p,q}$ . In the second case, $[\![ r^{\prime }_{p,q}\ \mathsf{some } \ q ]\!] = M$ so again ${\cal M}\models {\varphi} _{p,q}$ .

Finally, we consider the sentences in (1). There are three cases.

If $f(U) = {\sf T}$ , $f(V) = {\sf F}$ , and $f(W) = {\sf F}$ , then we have $[\![ u ]\!] = \emptyset $ , $[\![ v ]\!] = \{v\}$ , and ${[\![ {w} ]\!]} = \{w\}$ . We set ${[\![ {y_c} ]\!]} = \emptyset $ , ${[\![ {z_c} ]\!]} = \emptyset $ , ${[\![ {r^1_c} ]\!]} = M\times M$ , ${[\![ {r^2_c} ]\!]} = \emptyset $ , and ${[\![ {r^3_c} ]\!]} = \emptyset $ .

If $f(U) = {\sf F}$ , $f(V) = {\sf T}$ , and $f(W) = {\sf F}$ , we have $[\![ u ]\!] = \{u\}$ , $[\![ v ]\!] = \emptyset $ , and ${[\![ {w} ]\!]} = \{w\}$ . We set ${[\![ {y_c} ]\!]} = M$ , ${[\![ {z_c} ]\!]} = \emptyset $ , ${[\![ {r^1_c} ]\!]} = M\times M$ , ${[\![ {r^2_c} ]\!]} = \emptyset $ , and ${[\![ {r^3_c} ]\!]} = \emptyset $ .

If $f(U) = {\sf F}$ , $f(V) = {\sf F}$ , and $f(W) = {\sf T}$ , we have $[\![ u ]\!] = \{u\}$ , $[\![ v ]\!] = \{v\}$ , and ${[\![ {w} ]\!]} = \emptyset $ . We set ${[\![ {y_c} ]\!]} = M$ , ${[\![ {z_c} ]\!]} = M$ , ${[\![ {r^1_c} ]\!]} = M\times M$ , ${[\![ {r^2_c} ]\!]} = M\times M$ , and ${[\![ {r^3_c} ]\!]} = \emptyset $ .

In all cases, the resulting model ${\cal M}$ satisfies all sentences in (1), hence all sentences in $\Gamma $ . And in all cases, ${\cal M}\not \models \mathsf{all} \ {\mathsf{start}} \ {\mathsf{finish}}$ .□

The claim concludes the proof Theorem 3.1.

3.2 Completeness and Co-NPTime decidability for ${\cal L}_{3.5}$

We first present a sound and complete proof system for ${\cal L}_{3.5}$ , because it is actually a bit simpler than ${\cal L}_{3}$ , and mirrors more closely our work from §2. The rules are in Figure 4. We return to ${\cal L}_{3}$ in §3.3 below.

Fig. 4 The logic of §3.2. We also use the rules in Figure 2.

Since the consequence relations for ${\cal L}_{3}$ and ${\cal L}_{3.5}$ are ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ hard, by Theorem 1.5 we cannot hope for a boundedly complete syllogistic proof system for these languages (unless P $=$ NP). We regard it as unlikely that they admit any sound and complete syllogistic proof system. Instead, we settle for a proof system with ( cases ) from §2.3, as well as a variant, $({\scriptstyle \mathrm{CASES}}_2)$ . Figure 4 gives proof rules for this logic.

In this section, we write ${\vdash }$ for provability in the system with rules $({\scriptstyle \mathrm{AX}})$ , $({\scriptstyle \mathrm{BARBARA}})$ , $({\scriptstyle \mathrm{ANTI}})$ , $({\scriptstyle \mathrm{SOME}}_1)$ , $({\scriptstyle \mathrm{SOME}}_2)$ , (darii), (mono), (sa), (ss), ( cases ), and $({\scriptstyle \mathrm{CASES}}_2)$ . Given a theory $\Gamma $ , we write $x\leq y$ when $\Gamma\ {\vdash}\ {\mathsf{all } \ x \ y}$ . We say that $\Gamma $ determines existentials for a set of terms ${\mathbb {T}}$ if for every $x\in {\mathbb {T}}$ , either $\Gamma\ {\vdash}\ {\mathsf{some} \ x \ x}$ , or else $\Gamma\ {\vdash}\ {\mathsf{all } \ x \ y}$ and $\Gamma\ {\vdash}\ \mathsf{all} \ y \ (r \ \mathsf{all} \ x)$ for all terms $y\in {\mathbb {T}}$ and all verbs r.

The canonical model

Let $\Gamma $ be a theory, and let ${\mathbb {T}}$ be a set of terms, closed under subterms as usual. Define

$$ \begin{align*} M = \{{\langle {x,y,Q} \rangle}\in {\mathbb{T}}\times {\mathbb{T}} \times \{\forall,\exists\}: \Gamma\ {\vdash}\ {\mathsf{some} \ x \ y}\}. \end{align*} $$

We define a model ${\cal M}(\Gamma ,{\mathbb {T}})$ with domain M by setting

$$ \begin{align*} {\langle {x,y,Q} \rangle}\in [\![ p ]\!] & {\quad \text{iff}\quad} x\leq p \text{ or } y\leq p \\ {\langle {x_1, y_1, Q_1} \rangle} [\![ r ]\!] {\langle {x_2,y_2, Q_2} \rangle} & {\quad \text{iff}\quad} \text{for some}\ z_1\in \{x_1,y_1\} \text{ and } z_2\in \{x_2,y_2\}, \text{ either}\\ &\qquad\quad \text{(a) }z_1\leq (r \ \mathsf{all} \ {z_2}), \text{or}\\ & \qquad\quad \text{(b) } Q_2 = \exists, x_2 = y_2, \text{and } z_1\leq (r\ \mathsf{ some } \ {z_2}). \end{align*} $$

Lemma 3.3 (Truth Lemma). If $\Gamma $ determines existentials for ${\mathbb {T}}$ , then in ${\cal M}'(\Gamma ,{\mathbb {T}})$ , for all $t\in {\mathbb {T}}$ ,

(8)

$$ \begin{align} [\![ t ]\!] = \{{\langle {x,y,Q} \rangle}\in M: x\leq p\text{ or }y\leq p\}. \end{align} $$

Proof. The proof is by induction on t. For a noun p, this is by definition of the model.

The induction step for $(r \ \mathsf{all} \ t).$ Since ${\mathbb {T}}$ is closed under subterms, $t\in {\mathbb {T}}$ . Take some ${\langle {c_1,c_2,Q} \rangle }$ such that $c_i\leq (r \ \mathsf{all} \ t)$ for some $i\in \{1,2\}$ . We show that ${\langle {c_1,c_2,Q} \rangle }\in {[\![ r\ \mathsf{ all }\ t ]\!]}$ . For this, suppose ${\langle {d_1,d_2,Q'} \rangle }\in [\![ t ]\!]$ . By induction hypothesis, $d_j\leq t$ for some $j\in \{1,2\}$ . Using $({\scriptstyle \mathrm{ANTI}})$ , $(r \ \mathsf{all} \ {t})\leq (r \ \mathsf{all} \ {d_j})$ , so $c_i\leq (r \ \mathsf{all} \ {d_j})$ . Thus, ${\langle {c_1,c_2,Q} \rangle }[\![ r ]\!]{\langle {d_1,d_2,Q'} \rangle }$ . Since ${\langle {d_1,d_2,Q'} \rangle }$ was arbitrary, we have shown that ${\langle {c_1,c_2,Q} \rangle }\in {[\![ {r \ \mathsf{all} \ {t}} ]\!]}$ , as desired.

In the other direction, suppose that ${\langle {c_1,c_2,Q} \rangle }\in {[\![ r \ \mathsf{all} \ t]\!]}$ . If $\Gamma\ {\vdash}\ {\mathsf{some} \ t \ t}$ , then ${\langle {t,t,\forall } \rangle }\in M$ . By induction, since $t\leq t$ , ${\langle {t,t,\forall } \rangle }\in [\![ t ]\!]$ , so ${\langle {c_1,c_2,Q} \rangle }[\![ r ]\!]{\langle {t,t,\forall } \rangle }$ . Since the last component in ${\langle {t,t,\forall } \rangle }$ is $\forall $ rather than $\exists $ , case (a) in the definition of $[\![ r ]\!]$ holds, and $c_i\leq (r \ \mathsf{all} \ {t})$ for some $i\in \{1,2\}$ .

It remains to consider the case when $\Gamma \not {\vdash }\ {\mathsf{some} \ t \ t}$ . But since $\Gamma $ determines existentials, $\Gamma\ {\vdash}\ \mathsf{all} \ y \ (r \ \mathsf{all} \ t)$ for all terms y, and in particular $c_1\leq (r \ \mathsf{all} \ t)$ .

The induction step for $({\textit{r} \ \mathsf{some} \ t}).$ Again, since ${\mathbb {T}}$ is closed under subterms, $t\in {\mathbb {T}}$ . Take some ${\langle {c_1,c_2,Q} \rangle }$ such that $c_i\leq ({r}\ \mathsf{ some }\ t)$ . The fact that ${\langle {c_1,c_2,Q} \rangle }\in M$ implies that $\Gamma\ {\vdash}{\mathsf{some} \ c_1 \ c_2}$ . By $({\scriptstyle \mathrm{SOME}}_2)$ and $({\scriptstyle \mathrm{SOME}}_1)$ , $\Gamma\ {\vdash}\ {\mathsf{some} \ c_i \ c_i}$ , and by (darii), $\Gamma\ {\vdash}\ {\mathsf{some} \ c_i \ {({\textit{r} \ \mathsf{some} \ t})}}$ . But then by (ss), $\Gamma\ {\vdash}\ {\mathsf{some} \ t \ t}$ , and hence ${\langle {t,t,\exists } \rangle }\in M$ . By case (b) in the definition of $[\![ r ]\!]$ , ${\langle {c_1,c_2,Q} \rangle }[\![ r ]\!]{\langle {t,t,\exists } \rangle }$ . By induction ${\langle {t,t,\exists } \rangle }\in [\![ t ]\!]$ , so ${\langle {c_1,c_2,Q} \rangle }\in {[\![ {{\textit{r} \ \mathsf{some} \ t}} ]\!]}$ .

In the other direction, suppose that ${\langle {c_1,c_2,Q} \rangle } \in {[\![ {{\textit{r} \ \mathsf{some} \ t}} ]\!]}$ . Then we have ${\langle {c_1,c_2,Q} \rangle } [\![ r ]\!]{\langle {d_1,d_2,Q'} \rangle }$ for some ${\langle {d_1,d_2,Q'} \rangle }\in [\![ t ]\!]$ . Since ${\langle {d_1,d_2,Q'} \rangle }\in M$ , $\Gamma\ {\vdash}\ \mathsf{some}\ {d_1} \ {d_2}$ . And also $d_k\leq t$ for some $k\in \{1,2\}$ , by induction.

We first consider case (a) in the definition of $[\![ r ]\!]$ : there are i and j so that $c_i\leq (r \ \mathsf{all} \ {d_j})$ . From $\Gamma\ {\vdash}\ {\mathsf{some} \ d_1 \ d_2}$ and $d_k\leq t$ , using (darii), $({\scriptstyle \mathrm{SOME}}_1)$ , and $({\scriptstyle \mathrm{SOME}}_2)$ , $\Gamma\ {\vdash}\ \mathsf{some} \ {d_j} \ t$ . By (sa), $(r \ \mathsf{all} \ {d_j})\leq (\textit{r} \ \mathsf{some} \ t)$ , so by $({\scriptstyle \mathrm{BARBARA}})$ , $c_i\leq ({\textit{r} \ \mathsf{some} \ t})$ .

In case (b), $Q' = \exists $ , $d_1 = d_2$ , and for some i, $c_i\leq (\textit{r} \ \mathsf{some } \ {d_1})$ . By (mono), $(\textit{r} \ \mathsf{some} \ {d_1})\leq ({\textit{r} \ \mathsf{some} \ t})$ , so $c_i\leq ({\textit{r} \ \mathsf{some} \ t})$ . □

Lemma 3.4. Suppose that ${\mathbb {T}}$ contains all subterms of sentences in $\Gamma $ and $\Gamma ^*\supseteq \Gamma $ is a theory which determines existentials for ${\mathbb {T}}$ . Then ${\cal M}(\Gamma ^*,{\mathbb {T}})\models \Gamma $ .

Proof. For a sentence $({\mathsf{all } \ x \ y})\in \Gamma $ , we have $\Gamma ^*\ {\vdash }\ {\mathsf{all } \ x \ y}$ . We are going to apply Lemma 3.3 to ${\cal M}(\Gamma ^*,{\mathbb {T}})$ (that is, the $\leq $ symbol here is for provability in $\Gamma ^*$ ). If ${\langle {c_1,c_2,Q} \rangle }\in [\![ x ]\!]$ in ${\cal M}(\Gamma ^*,{\mathbb {T}})$ , then $c_i\leq x$ for some $i\in \{1,2\}$ . But then also $c_i\leq y$ , so ${\langle {c_1,c_2,Q} \rangle }\in [\![ y ]\!]$ .

For a sentence $({\mathsf{some} \ x \ y})\in \Gamma $ , we have $\Gamma ^*\ {\vdash }\ {\mathsf{some} \ x \ y}$ , so ${\langle {x,y,\forall } \rangle }\in M$ . And ${\langle {x,y,\forall } \rangle }\in [\![ x ]\!]\cap [\![ y ]\!]$ in ${\cal M}(\Gamma ^*,{\mathbb {T}})$ . □

Theorem 3.5. $\Gamma \models {\varphi} $ iff $\Gamma\ {\vdash} {\varphi} $ .

Proof. The soundness is easy, with soundness of $({\scriptstyle \mathrm{CASES}}_2)$ following just as in the proof of Lemma 2.16.

The argument for completeness is the same as we saw in the proof of Theorem 2.19. Let ${\mathbb {T}}$ be the set of all subterms of sentences in $\Gamma \cup \{ {\varphi} \}$ . We assume that $\Gamma \not {\vdash } {\varphi} $ and show that $\Gamma \not \models {\varphi} $ . We may pass to a maximal extension $\Gamma ^*\supseteq \Gamma $ with the property that $\Gamma ^*\not {\vdash }\ {\varphi} $ . It follows from $({{\scriptstyle \mathrm{CASES}}})$ and $({\scriptstyle \mathrm{CASES}}_2)$ that $\Gamma ^*$ determines existentials for ${\mathbb {T}}$ , just as in the proof of Theorem 2.19. By Lemma 3.4, ${\cal M}(\Gamma ^*,{\mathbb {T}})\models \Gamma $ . We claim that $ {\varphi} $ is false in this model.

Case 1: $ {\varphi} $ is $({\mathsf{all } \ x \ y})$ . Since $\Gamma ^*$ determines existentials for ${\mathbb {T}}$ and $\Gamma ^*\not {\vdash }\ {\mathsf{all } \ x \ y}$ , we have $\Gamma ^*{\vdash }\ {\mathsf{some} \ x \ x}$ . So ${\langle {x,x,\forall } \rangle }\in M$ . By Lemma 3.4, ${\langle {x,x,\forall } \rangle }\in [\![ x ]\!]\setminus [\![ y ]\!]$ , since $x\leq x$ but $x\not \leq y$ .

Case 2: $ {\varphi} $ is $({\mathsf{some} \ x \ y})$ . Suppose towards a contradiction that ${\cal M}(\Gamma ^*,{\mathbb {T}})\models {\varphi} $ . Specifically, let ${\langle {d_1,d_2,Q} \rangle }\in [\![ x ]\!] \cap [\![ y ]\!]$ . Then $\Gamma ^*\ {\vdash }\ {\mathsf{some} \ d_1 \ d_2}$ , and also there are i and j such that $d_i\leq x$ and $d_j\leq y$ . Using $({\scriptstyle \mathrm{SOME}}_1)$ , $({\scriptstyle \mathrm{SOME}}_2)$ , and (darii), $\Gamma ^*\ {\vdash } {\mathsf{some} \ x \ y}$ . □

The proof shows that if $\Gamma \not {\vdash }\ {\varphi} $ , then there is a countermodel of size $O(n^2)$ , where n is the complexity of $\Gamma \cup \{ {\varphi} \}$ . Since ${\cal M}\models \Gamma $ and ${\cal M} \not \models {\varphi} $ can be checked in time polynomial in the size of ${\cal M}$ and the complexity of $\Gamma \cup \{ {\varphi} \}$ , this shows that the consequence relation for ${\cal L}_{3.5}$ is in ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ .

3.3 Completeness for ${\cal L}_{3}$

In ${\cal L}_{3}$ , we do not have the sentence former $({\mathsf{some} \ x \ y})$ , which was used in the previous section to formulate the condition that $\Gamma $ determines existentials. Nevertheless, we are able to follow the same strategy as for ${\cal L}_{3.5}$ to prove a completeness theorem. The key observation is that for a term x and a nonempty model ${\cal M}$ , $[\![ x ]\!]\neq \emptyset $ in ${\cal M}$ if and only if ${\cal M} \models \mathsf{all} \ (r \ \mathsf{all} \ x) \ (\textit{r} \ \mathsf{some} \ x)$ for every verb r. We use the collection of all sentences of this form as a replacement for $({\mathsf{some} \ x \ x})$ .

Figure 5 gives our proof system for this logic. The soundness of (mono) is immediate. For (mix), note that if ${{\cal N}}\models \mathsf{all} \ (r \ \mathsf{all} \ y) \ (r\ \mathsf{some} \ y)$ , then either $N = \emptyset $ , or $[\![ y ]\!] \neq \emptyset $ . If $N=\emptyset $ , the conclusion of the rule holds. And if $[\![ y ]\!] \neq \emptyset $ and ${{\cal N}}\models \mathsf{all} \ y \ (\textit{r} \ \mathsf{some} \ x)$ , then $[\![ x ]\!] \neq \emptyset $ also. Again, the conclusion of the rule follows. The soundness of the ( cases ) variants follow as in Lemma 2.16, using the above observation about sentences of the form $\mathsf{all} \ (r \ \mathsf{all} \ y) \ (\textit{r} \ \mathsf{some} \ y)$ .

Fig. 5 The logic of §3.3. We also use (ax), $({\scriptstyle \mathrm{BARBARA}})$ , $({\scriptstyle \mathrm{ANTI}})$ from Figure 2.

Note that in (mix), the verb s appearing in the conclusion may be different than the verb r appearing in the premises. And in ${\scriptstyle \mathrm{CASES}}'$ , the withdrawn premises are $(\mathsf{all} \ (r \ \mathsf{all} \ x) \ (\textit{r} \ \mathsf{some} \ x))$ and $(\mathsf{all} \ y \ (s \ \mathsf{ all} \ x))$ , where the verbs r and s may be different.

As usual, we write ${\vdash }$ for provability in this system, and given a theory $\Gamma $ , we write $x\leq y$ when $\Gamma\ {\vdash}\ {\mathsf{all } \ x \ y}$ .

Definition 3.6. Let $\Gamma $ be a theory and ${\mathbb {T}}$ a set of terms. A term $x\in {\mathbb {T}}$ is effectively nonempty (for $\Gamma $ ) if for all verbs r, $\Gamma\ {\vdash}\ \mathsf{all} \ (r \ \mathsf{all} \ x) \ (\textit{r} \ \mathsf{some} \ x)$ . And x is effectively empty (for $\Gamma $ ) if for all $y\in {\mathbb {T}}$ , $\Gamma\ {\vdash}\ {\mathsf{all } \ x \ y}$ and $\Gamma\ {\vdash}\ \mathsf{all} \ y \ (r \ \mathsf{all} \ x)$ for all verbs r.

$\Gamma $ effectively determines existentials for ${\mathbb {T}}$ if every $x\in {\mathbb {T}}$ is either effectively empty or effectively nonempty for $\Gamma $ .

The canonical model

Let $\Gamma $ be a theory, and let ${\mathbb {T}}$ be a set of terms. Define

$$ \begin{align*} M = \{{\langle {x,Q} \rangle} \in {\mathbb{T}}\times \{\forall,\exists \} :x\ \text{is effectively nonempty for } \Gamma \}. \end{align*} $$

We define a model ${\cal M}(\Gamma ,{\mathbb {T}})$ with domain M by setting

$$ \begin{align*} {\langle {x,Q} \rangle}\in [\![ p ]\!] & {\quad \text{iff}\quad} x\leq p \\ {\langle {x, Q} \rangle} [\![ r ]\!] {\langle {y,Q'} \rangle} & {\quad \text{iff}\quad} \text{either (a) } x\leq (r \ \mathsf{all} \ y), \\ & \qquad \quad \text{or (b) } Q' = \exists, \text{ and } x\leq (\textit{r} \ \mathsf{some}\ y). \end{align*} $$

Lemma 3.7 (Truth Lemma). Assume that $\Gamma $ effectively determines existentials for ${\mathbb {T}}$ . Then for all $x\in {\mathbb {T}}$ ,

$$ \begin{align*} [\![ x ]\!] = \{{\langle {y,Q} \rangle}\in M :y \leq x\}. \end{align*} $$

Proof. By induction on x. When x is a noun, this follows immediately from the definition.

Here is the induction step for $({r \ \mathsf{all} \ x})$ . Suppose that ${\langle {y,Q} \rangle }\in {[\![ {{r \ \mathsf{all} \ x}} ]\!]}$ . If x is effectively empty, then ${y} \leq ({r \ \mathsf{all} \ x})$ , and so we are done. If x is effectively nonempty, then ${\langle {x,\forall } \rangle }\in M$ . Since $x\leq x$ , by induction ${\langle {x,\forall } \rangle }\in {[\![ {X} ]\!]}$ , and ${\langle {y,Q} \rangle }[\![ r ]\!]{\langle {x,\forall } \rangle }$ . Then case (a) holds and again $y\leq ({r \ \mathsf{all} \ x})$ .

In the other direction, suppose that $y\leq ( {r \ \mathsf{all} \ x})$ . We show that for any ${\langle {y,Q} \rangle }\in M$ , ${\langle {y,Q} \rangle }[\![ r ]\!]{\langle {z,Q'} \rangle }$ for all ${\langle {z,Q'} \rangle }\in [\![ x ]\!]$ . By induction, $z \leq x$ . Using $({\scriptstyle \mathrm{ANTI}})$ , $ ( {r \ \mathsf{all} \ x}) \leq ({r \ \mathsf{all} \ z})$ . Thus, $y\leq ( {r \ \mathsf{all} \ z})$ , and so indeed ${\langle {y,Q} \rangle }[\![ r ]\!]{\langle {z,Q'} \rangle }$ .

Finally, we have the induction step for $({\textit{r} \ \mathsf{some}\ x})$ . Suppose that ${\langle {y,Q} \rangle }\in [\![ {{\text {r}\ \mathsf{some}\ x}} ]\!]$ . Then there is some ${\langle {z,Q'} \rangle }\in [\![ x ]\!]$ such that ${\langle {y,Q} \rangle }[\![ r ]\!]{\langle {z,Q'} \rangle }$ . So z is effectively nonempty, and by induction $z\leq x$ .

Case 1: $Q'=\exists $ . Then $y\leq ({\textit{r} \ \mathsf{some}\ z})$ . By (mono), $({r\ \mathsf{some}\ z})\leq ({r\ \mathsf{some} \ x})$ , so also $y\leq ({r\ \mathsf{some}\ x})$ , as was to be shown.

Case 2: $Q'=\forall $ . Then $y\leq ({r \ \mathsf{all} \ x})$ . By $({\scriptstyle \mathrm{ANTI}})$ , $({r \ \mathsf{all} \ x})\leq ({r \ \mathsf{all} \ z})$ , so $y\leq ({r \ \mathsf{all} \ z})$ . The fact that z is effectively nonempty means that $({r \ \mathsf{all} \ z})\leq ({\textit{r} \ \mathsf{some} \ z})$ . And by (mono), as observed above, $({\textit{r} \ \mathsf{some} \ z})\leq ({r\ \mathsf{some} \ x})$ . So again we have $y\leq ({\textit{r} \ \mathsf{some} \ x})$ .

In the other direction, let $y\in {\mathbb {T}}$ be such that $y\leq ({\textit{r} \ \mathsf{some}\ x})$ . Suppose that ${\langle {y,Q} \rangle }\in M$ . Then y is effectively nonempty. In particular, $({r \ \mathsf{all} \ y})\leq ({\textit{r} \ \mathsf{some} \ y})$ , and by (mix), for every verb s, $(s \ \mathsf{ all} \ x)\leq ({s\ \mathsf{some}\ x})$ , so x is effectively nonempty. Then ${\langle {x,\exists } \rangle }\in M$ . By case (b), ${\langle {y,Q} \rangle }[\![ r ]\!]{\langle {x,\exists } \rangle }$ , and by induction ${\langle {x,\exists } \rangle }\in [\![ x ]\!]$ , so ${\langle {y,Q} \rangle }\in [\![ {{\textit{r} \ \mathsf{some} \ x}} ]\!]$ . □

Lemma 3.8. Suppose ${\mathbb {T}}$ contains all subterms of sentences in $\Gamma $ and $\Gamma ^*\supseteq \Gamma $ is a theory which effectively determines existentials for ${\mathbb {T}}$ . Then ${\cal M}(\Gamma ^*,{\mathbb {T}})\models \Gamma $ .

Proof. For a sentence $({\mathsf{all } \ x \ y})\in \Gamma $ , we have $\Gamma ^*\ {\vdash } {\mathsf{all } \ x \ y}$ , so $x\leq y$ . If ${\langle {z,Q} \rangle }\in [\![ x ]\!]$ in ${\cal M}(\Gamma ^*,{\mathbb {T}})$ , then $z\leq x$ . But then also $z\leq y$ , so ${\langle {z,Q} \rangle }\in [\![ y ]\!]$ . □

Theorem 3.9. $\Gamma \models {\varphi} $ iff $\Gamma\ {\vdash} {\varphi} $ .

Proof. We discussed the soundness when we introduced the rules.

For completeness, let ${\mathbb {T}}$ be the set of all subterms of sentences in $\Gamma \cup \{ {\varphi} \}$ . We assume that $\Gamma \not {\vdash }\ {\varphi} $ and show that $\Gamma \not \models {\varphi} $ . We may pass to a maximal extension $\Gamma ^*\supseteq \Gamma $ with the property that $\Gamma ^*\not {\vdash }\ {\varphi} $ . It follows from $({\scriptstyle \mathrm{CASES}}')$ and $({\scriptstyle \mathrm{CASES}}_{2}')$ that $\Gamma ^*$ effectively determines existentials for ${\mathbb {T}}$ , just as in the proof of Theorem 2.19. By Lemma 3.8, ${\cal M}(\Gamma ^*,{\mathbb {T}})\models \Gamma $ .

We claim that $ {\varphi} $ is false in this model. Write $ {\varphi} $ as $({\mathsf{all } \ x \ y})$ . Since $\Gamma ^*$ effectively determines existentials for ${\mathbb {T}}$ and $\Gamma ^*\not {\vdash }\ {\mathsf{all } \ x \ y}$ , x is effectively nonempty for $\Gamma ^*$ . So ${\langle {x,\forall } \rangle }\in M$ . By Lemma 3.7, ${\langle {x,\forall } \rangle }\in [\![ x ]\!]\setminus [\![ y ]\!]$ , since $x\leq x$ but $x\not \leq y$ . □

The proof shows that if $\Gamma \not {\vdash }\ {\varphi} $ , then there is a countermodel of size $O(n)$ , where n is the complexity of $\Gamma \cup \{ {\varphi} \}$ .

4 ${\cal L}_{4}$ and ${\cal L}_{4.5}$ : Adding term complementation to ${{\cal L}_{1}}$ and ${\cal L}_{2}$

This section extends ${{\cal L}_{1}}$ and ${\cal L}_{2}$ by adding term complements. That is, we extend the syntax of ${\cal L}$ so that whenever t is a term, ${\overline {t}}$ is a term, and we extend the semantics so that in a model ${\cal M}$ , ${[\![ {{\overline {t}}} ]\!]} = {\cal M}\setminus [\![ t ]\!]$ . If we add term complements to ${{\cal L}_{1}}$ , we get ${\cal L}_{4}$ , and if we add term complements to ${\cal L}_{2}$ , we get ${\cal L}_{4.5}$ .

4.1 Co-NPTime hardness of the consequence relation of ${\cal L}_{4}$

We begin with a negative complexity-theoretic result, reducing $3$ -SAT to the relation $\Gamma \not \models \varphi $ in ${\cal L}_{4}$ .

Let $BV = \{P_i : i \in N\}$ be a set of boolean variables. Suppose we have an instance of $3$ -SAT, $c_1\land \dots \land c_k$ , where each clause $c_i$ has the form $U_i \lor V_i\lor W_i$ , where $U_i$ , $V_i$ and $W_i$ are literals: variables in $BV$ or their negations.

Then we consider the language with nouns

$$ \begin{align*} \{p : P\in BV\} \cup \{q\} \cup \{y_i,z_i: 1\leq i\leq k\} \end{align*} $$

and verbs $\{r_i: 1\leq i\leq k\}$ . Notice that we write p for the noun corresponding to the variable P. We will also write u for the literal U, where if U is a variable P, then u is the noun p, and if U is a negated variable $\lnot P$ , then u is the term ${\overline {p}}$ .

For each clause $c_i = U_i\lor V_i\lor W_i$ , we define the following sentences:

$$ \begin{align*} \psi^i_1 &= \mathsf{all} \ \overline{u_i} \ (r_i \ \mathsf{all} \ y_i) \\ \psi^i_2 &= \mathsf{all} \ \overline{v_i} \ (r_i \ \mathsf{all} \ \overline{y_i}) \\ \psi^i_3 &= \mathsf{all} \ (r_i \ \mathsf{all} \ {z_i}) \ w_i. \end{align*} $$

And we define

$$ \begin{align*} \Gamma &= \{ \psi^{i}_1,\psi^{i}_2,\psi^{i}_3: 1\leq i\leq k \} \\ \varphi &= \mathsf{all} \ q \ {\overline{q}}. \end{align*} $$

Lemma 4.1. $\Gamma \not \models \varphi $ if and only if $\Gamma $ has a nonempty model.

Proof. Suppose $\Gamma $ has a model ${\cal M}$ with nonempty domain M. Let ${\cal M}'$ be a model with domain M and the same interpretations of all of the nouns and verbs, except for q, which we interpret as all of M. Since the sentences in $\Gamma $ do not mention q, we still have ${\cal M}' \models \Gamma $ . And for any $x\in M$ , we have $x\in [\![ q ]\!] = M$ and $x\notin {[\![ {{\overline {q}}} ]\!]} = \emptyset $ , so $\Gamma \not \models \varphi $ . Conversely, suppose every model of $\Gamma $ is empty. Then in any model ${\cal M}$ of $\Gamma $ , we have $[\![ q ]\!] = \emptyset \subseteq {[\![ {{\overline {q}}} ]\!]} = \emptyset $ , so $\Gamma \models \varphi $ . □

Theorem 4.2. $\Gamma \not \models \varphi $ if and only if $c_1\land \dots \land c_k$ is satisfiable.

Proof. Suppose $\Gamma \not \models \varphi $ . By Lemma 4.1, $\Gamma $ has a model ${\cal M}$ with nonempty domain M. Let $x\in M$ . Define a truth assignment f for the proposition letters, by

$$ \begin{align*} f(P) = \begin{cases} {\sf T} &\text{if }x\in [\![ p ]\!]\\ {\sf F} &\text{if }x\notin [\![ p ]\!].\end{cases} \end{align*} $$

The truth assignment extends to literals in the natural way: $f(\lnot P) = {\sf T}$ if $f(P) = {\sf F}$ , and $f(\lnot P) = {\sf F}$ if $f(P) = {\sf T}$ . Note that if U is a literal with corresponding term u, then we have $x\in [\![ u ]\!]$ if and only if $f(U) = {\sf T}$ .

We check that each clause $c_i = U_i\lor V_i \lor W_i$ is satisfied. If $f(U_i) = {\sf T}$ or $f(V_i) = {\sf T}$ , then $c_i$ is satisfied. So suppose that $f(U_i) = f(V_i) = {\sf F}$ . Then, since ${[\![ {z_i} ]\!]}\subseteq {[\![ {y_i} ]\!]}\cup {[\![ {{\overline {y_i}}} ]\!]}$ ,

$$ \begin{align*} x\in {[\![ {{\overline{u_i}}} ]\!]} \cap {[\![ {{\overline{v_i}}} ]\!]} \subseteq {[\![ r_i\ \mathsf{all} \ y_i ]\!]} \cap {[\![ r_i \ \mathsf{all} \ \overline{y_i}]\!]}\subseteq {[\![ r_i \ \mathsf{all}\ {z_i}]\!]} \subseteq [\![ w_i ]\!]. \end{align*} $$

So $x\in [\![ w_i ]\!]$ , and $f(w_i) = {\sf T}$ , and $c_i$ is satisfied.

Conversely, suppose f is a truth assignment such that $c_1\land \dots \land c_k$ is satisfied. By Lemma 4.1, it suffices to build a nonempty model of $\Gamma $ . Let ${\cal M} = \{x\}$ , and for each proposition letter P, set

$$ \begin{align*} [\![ p ]\!] = \begin{cases} \{x\} & \text{if }f(P) = {\sf T}\\ \emptyset & \text{if }f(P) = {\sf F}. \end{cases} \end{align*} $$

Note that again, if U is a literal with corresponding term u, we have $x\in [\![ u ]\!]$ if and only if $f(U) = {\sf T}$ .

Consider a clause $c_i = U_i\lor V_i\lor W_i$ . We must define the interpretations of $y_i$ , $z_i$ , and $r_i$ , so that $\psi ^i_1$ , $\psi ^i_2$ , and $\psi ^i_3$ are satisfied for $1\leq i \leq k$ .

Case 1: $x\in [\![ w_i ]\!]$ . Set $[\![ r_i ]\!] = \{(x,x)\}$ . The interpretations of $y_i$ and $z_i$ are irrelevant. Indeed, $\psi ^i_1$ , $\psi ^i_2$ , and $\psi ^i_3$ are satisfied, since ${[\![r_i \ \mathsf{all } \ {y_i}]\!]} = {[\![ r_i \ \mathsf{all }\;{\overline {y_i}} ]\!]} = [\![ w_i ]\!] = M$ .

Case 2: $x\notin [\![ w_i ]\!]$ . Set ${[\![ z_i ]\!]} = \{x\}$ and $[\![ r_i ]\!] = \emptyset $ . Then $\psi ^i_3$ is satisfied, since ${[\![ r_i \ \mathsf{all} \ z_i ]\!]} = {[\![ {w_i} ]\!]} = \emptyset $ . Since the clause $c_i = U_i \lor V_i\lor W_i$ is satisfied, x must be in at least one of $[\![ u_i ]\!]$ or ${[\![ {v_i} ]\!]}$ . If it’s in both, we’re done (and the interpretation of $y_i$ is irrelevant), since ${[\![ {{\overline {u_i}}} ]\!]} = {[\![ {{\overline {v_i}}} ]\!]} = \emptyset $ . Otherwise, set

$$ \begin{align*} {[\![ {y_i} ]\!]} = \begin{cases} \emptyset & \text{if }x\notin [\![ u_i ]\!], x\in {[\![ {v_i} ]\!]}\\ \{x\} & \text{if }x\in [\![ u_i ]\!], x\notin {[\![ v_i ]\!]}.\end{cases} \end{align*} $$

In the first case, ${\cal M}\models \psi ^{i}_1$ , since $[\![ r_{i}\ \mathsf{all} \ y_i]\!] = M$ , and ${\cal M} \models \psi ^{i}_2$ , since $[\![ \overline {v_i}]\!] = \emptyset $ . In the second case, ${\cal M} \models \psi ^{i}_2$ , since $[\![\overline {u_i}]\!] = \emptyset $ , and ${\cal M} \models \psi ^{i}_2$ , since $[\![ r_i \ \mathsf{all} \ \overline {y_i}]\!] = M$ .□

4.2 Completeness for the extended languages ${\cal L}_{4}^+$ and ${\cal L}_{4.5}^+$

We enlarge our syntax of ${\cal L}_{4.5}$ from sentences $({\mathsf{all } \ x \ y})$ and $({\mathsf{some} \ x \ y})$ to expressions of the form $[x_1, \dots , x_n]$ and ${\langle {x_1,\dots ,x_n} \rangle }$ for $n\geq 1$ . Note that this is a departure from the languages we have studied previously, since here we have infinitely many sentence formers, of arbitrary finite length. We emphasize that $[x_1,\ldots , x_n]$ and ${\langle {x_1,\dots ,x_n} \rangle }$ are sentences, not terms. The terms of ${\cal L}_{4.5}^+$ are the same as the terms of ${\cal L}_{4.5}$ .

The semantics of this new language ${\cal L}_{4.5}^+$ is:

$$ \begin{align*} {\cal M}\models [x_1, \ldots, x_n] &{\quad \text{iff}\quad} \bigcap_{i=1}^n [\![ x_i ]\!] = \emptyset\\ {\cal M}\models{\langle {x_1,\ldots, x_n} \rangle} & {\quad \text{iff}\quad}\bigcap_{i=1}^n [\![ x_i ]\!] \neq \emptyset. \end{align*} $$

It is clear that ${\langle {x_1,\dots ,x_n} \rangle }$ generalizes $({\mathsf{some} \ x \ y})$ , since the latter sentence can be translated into ${\cal L}_{4.5}^+$ as ${\langle {x,y} \rangle }$ . To see that $[x_1,\dots ,x_n]$ generalizes $({\mathsf{all } \ x \ y})$ , note that $\bigcap _{i=1}^n[\![ x_i ]\!] = \emptyset $ if and only if for some j (equivalently, for all j), $\bigcap _{i\neq j}[\![ x_i ]\!]\subseteq [\![ \overline {x_j} ]\!]$ . So the sentence $({\mathsf{all } \ x \ y})$ can be translated into ${\cal L}_{4.5}^+$ as $[x,{\overline {y}}]$ .

Our proof rules are shown in Figure 6. $({\scriptstyle \mathrm{AX}})$ is a version of the axiom rule as we have seen it throughout the paper. (res) is named for resolution. (But please note that the sentence $[x_1, \ldots , x_n]$ is not interpreted disjunctively as in resolution.) The name $({{\scriptstyle \mathrm{REL}}})$ stands for relational, since it is the only rule of the system that mentions relations. We name (str) after structural rules of sequent calculi. The side condition on this rule is that each $x_i$ appears in the list $y_1,\ldots , y_n$ . It implies the usual rules of weakening, contraction, and exchange. (efq) and (raa) are our formulations of ex falso quodlibet and reductio ad absurdum. In this system, $[y_1,\ldots , y_n]$ and ${\langle {y_1,\ldots , y_n} \rangle }$ are contradictories. Given two derivations with contradictory conclusions, one may use (efq) to put these two together and conclude any sentence of the form $[x_1,\dots ,x_n]$ . Alternatively, one may use (raa) to withdraw all occurrences of any one assumption of the form $[x_1,\dots ,x_n]$ , and conclude the contradictory of that assumption, ${\langle {x_1,\dots ,x_n} \rangle }$ . Note the asymmetry between $[x_1,\dots ,x_n]$ and ${\langle {x_1,\dots ,x_n} \rangle }$ ; this is arranged so as to allow an easy proof-theoretic argument (Corollary 4.12) that $({\scriptstyle \mathrm{AX}})$ , (res), (rel), and (str) give a sound and complete proof system for the smaller language ${\cal L}_{4}^+$ , which only has sentences of the form $[x_1,\dots ,x_n]$ .

Fig. 6 Rules of the logical system for ${\cal L}_{4.5}^+$ . The side condition on the structural rule (str) is that each $x_i$ must be identical to some $y_j$ .

Example 4.3. Here is how the translations of (barbara) and $({{\scriptstyle \mathrm{SOME}}_1})$ are derived in this system:

And here are (darii) and $({\scriptstyle \mathrm{ANTI}})$ :

Lemma 4.4 (Soundness). If $\Gamma\ {\vdash}\ {\varphi} $ , then $\Gamma \models {\varphi} $ .

Proof. The soundness of $({\scriptstyle \mathrm{AX}})$ and (str) are clear. For (res), (rel), and (efq), fix a model ${\cal M}$ .

For (res), assume that $[\![ x ]\!]\cap (\bigcap _{i\leq n} {[\![ {y_i} ]\!]}) = \emptyset $ and $[\![ {{\overline {x}}} ]\!]\cap (\bigcap _{j\leq m}{[\![ {z_j} ]\!]}) = \emptyset $ . Suppose there is some $\ell \in (\bigcap _{i\leq n} {[\![ {y_i} ]\!]})\cap (\bigcap _{j\leq m}{[\![ {z_j} ]\!]})$ . Then either $\ell \in [\![ x ]\!]$ or $\ell \in [\![ {{\overline {x}}} ]\!]$ , which is a contradiction in either case.

For (rel), assume that $(\bigcap _{i\lt n} [\![\overline {x_i} ]\!])\cap {[\![ {x_n} ]\!]} = \emptyset $ , so $\bigcap _{i\lt n} {[\![ \overline {x_i} ]\!]} \subseteq {[\![ \overline {x_n} ]\!]}$ . Suppose there is some $\ell \in (\bigcap _{i\lt n} [\![ r \ \mathsf{all} \ {x_i} ]\!]) \cap [\![ \overline{ {r} \ \mathsf{all} \ {x_n}}]\!]$ . Then there is some $m\in {[\![ {x_n} ]\!]}$ such that it is not true that $\ell [\![ r ]\!] m$ . For all $i<n$ , since $\ell \in [\![ r \ \mathsf{all} \ {x_i}]\!]$ , $m\in [\![\overline {x_i}]\!]$ . But then $m\in [\![\overline {x_n}]\!]$ , and this is a contradiction.

For (efq), it is vacuously true that if ${\cal M} \models {\langle {y_1,\dots ,y_m} \rangle }$ and ${\cal M} \models [y_1,\dots ,y_m]$ , then ${\cal M} \models [x_1,\dots ,x_n]$ , since ${\langle {y_1,\dots ,y_m} \rangle }$ and $[y_1,\dots ,y_m]$ are contradictory.

The soundness of (raa) is by induction on the number of instances of (raa) in the proof tree. Assume that the last use of (raa) is at the root of the proof tree showing $\Gamma\ {\vdash}\ {\langle {x_1,\dots ,x_n} \rangle }$ . Then we have $\Gamma \cup \{[x_1,\dots ,x_n]\}\ {\vdash }\ {\langle {y_1,\dots ,y_m} \rangle }$ and $\Gamma \cup \{[x_1,\dots ,x_n]\}\ {\vdash }\ [y_1,\dots ,y_m]$ . By induction, these deductions are sound, so every model of $\Gamma \cup \{[x_1,\dots ,x_n]\}$ satisfies both ${\langle {y_1,\dots ,y_m} \rangle }$ and $[y_1,\dots ,y_m]$ . But these sentences are contradictory, so $\Gamma \cup \{[x_1,\dots ,x_n]\}$ has no models. In other words, every model of $\Gamma $ satisfies ${\langle {x_1,\dots ,x_n} \rangle }$ , as was to be shown. □

Definition 4.5. Let $\Gamma $ be a theory, and let ${\mathcal{S}}$ be a set of terms.

1. $\Gamma $ is inconsistent if it proves both $[x_1,\dots ,x_n]$ and ${\langle {x_1,\dots ,x_n} \rangle }$ for some list of terms $x_1,\dots ,x_n$ . Otherwise, $\Gamma $ is consistent .
2. ${\mathcal{S}}$ is $\Gamma $ -inconsistent if there is a list of terms $x_1,\dots ,x_n$ from ${\mathcal{S}}$ such that $\Gamma\ {\vdash}\ [x_1, \ldots , x_n]$ . Otherwise, ${\mathcal{S}}$ is $\Gamma $ -consistent.

Lemma 4.6. Let ${\mathcal{S}}$ be $\Gamma $ -consistent. Then for all x, either ${\mathcal{S}}\cup \{x\}$ or ${\mathcal{S}}\cup \{{\overline {x}}\}$ is $\Gamma $ -consistent.

Proof. Suppose not. Then by (str), there are $y_1,\ldots , y_n\in {\mathcal{S}}$ such that $\Gamma\ {\vdash}\ [x,y_1,\ldots , y_n]$ , and there are $z_1,\ldots , z_m\in {\mathcal{S}}$ such that $\Gamma\ {\vdash}\ [{\overline {x}},z_1,\ldots , z_m]$ . By (res), $\Gamma\ {\vdash}\ [y_1,\ldots , y_n,z_1,\ldots , z_m]$ . This contradicts the $\Gamma $ -consistency of ${\mathcal{S}}$ . □

Lemma 4.7. Let ${\mathcal{S}}$ be a maximal $\Gamma $ -consistent set of terms. Then for all x, exactly one of x or ${\overline {x}}$ belongs to ${\mathcal{S}}$ .

Proof. By Lemma 4.6 and maximality, either x or ${\overline {x}}$ belongs to ${\mathcal{S}}$ . Both cannot belong to ${\mathcal{S}}$ , since this would contradict $\Gamma $ -consistency, due to $({\scriptstyle \mathrm{AX}})$ . □

Lemma 4.8. Let ${\mathcal{S}}$ be maximal $\Gamma $ -consistent, and suppose that $({r \ \mathsf{all} \ x})\notin {\mathcal{S}}$ . Then there is some maximal $\Gamma $ -consistent ${\cal T}$ such that $x\in {\cal T}$ ; and whenever $({r \ \mathsf{all} \ y})\in {\mathcal{S}}$ , we have ${\overline {y}}\in {\cal T}$ .

Proof. Let ${\cal T}_0 = \{x\}\cup \{\overline {y} : (r \ \mathsf{all} \ y)\in \mathcal{S}\}$ . If ${\cal T}_0$ is $\Gamma $ -inconsistent, then by (str) there are $y_1, \ldots , y_n$ such that $(r \ \mathsf{all} \ {y_i})\in \mathcal{S}$ for all i and $\Gamma\ {\vdash}\ [{\overline {y_1}}, \ldots , {\overline {y_n}},x]$ . By (rel),

$$ \begin{align*} \Gamma\ {\vdash}\ [r \ \mathsf{all} \ y_1,\dots,r \ \mathsf{all} \ y_n, {\overline{{{r \ \mathsf{all} \ x}}}}]. \end{align*} $$

Since ${\mathcal{S}}$ contains each term $(r \ \mathsf{all} \ y_i)$ , by $\Gamma $ -consistency it does not contain $({\overline {{{r \ \mathsf{all} \ x}}}})$ . So by Lemma 4.7, $\Gamma $ contains $({r \ \mathsf{all} \ x})$ . This is a contradiction.

So ${\cal T}_0$ is $\Gamma $ -consistent. By Zorn’s Lemma, ${\cal T}_0$ has a maximal $\Gamma $ -consistent extension, say ${\cal T}$ .□

The canonical model of $\Gamma $

Let M be the set of all maximal $\Gamma $ -consistent sets of terms. We define a model $ {\cal M}(\Gamma )$ with domain M by defining:

$$ \begin{align*} {\mathcal{S}}\in [\![ p ]\!] &{\quad \text{iff}\quad} p\in {\mathcal{S}}\\ \mathcal{S}[\![r]\!]{\cal T} &{\quad \text{iff}\quad} \text{for some } z\in {\cal T}, ({r \ \mathsf{all} \ z}) \in {\mathcal{S}}. \end{align*} $$

Lemma 4.9 (Truth Lemma). In ${\cal M}(\Gamma )$ , for any term x, $[\![ x ]\!] = \{\mathcal{S} \in M : x \in \mathcal{S}\}$ .

Proof. By induction on x. When x is a noun, this is by the definition of the model.

Induction step for $({r \ \mathsf{all} \ x})$ : Suppose $({r \ \mathsf{all} \ x})\in {\mathcal{S}}$ , and suppose ${\cal T}\in [\![ x ]\!]$ . By induction $x\in {\cal T}$ , so by the definition of the model, ${\mathcal{S}}[\![ r ]\!]{\cal T}$ . Thus ${\mathcal{S}}\in {[\![ {{r \ \mathsf{all} \ x}} ]\!]}$ . Conversely, suppose ${\mathcal{S}}\in {[\![ {{r \ \mathsf{all} \ x}} ]\!]}$ , and assume for contradiction that $({r \ \mathsf{all} \ x})\notin {\mathcal{S}}$ . By Lemma 4.8, there is some ${\cal T}\in M$ such that $x\in {\cal T}$ , and whenever $({r \ \mathsf{all} \ y})\in {\mathcal{S}}$ , we have ${\overline {y}}\in {\cal T}$ , so $y\notin {\cal T}$ by Lemma 4.7. By induction, ${\cal T}\in [\![ x ]\!]$ , but it is not the case that ${\mathcal{S}}[\![ r ]\!]{\cal T}$ , which is a contradiction.

Induction step for ${\overline {x}}$ : For any ${\mathcal{S}}\in M$ , we have ${\mathcal{S}}\in [\![ {{\overline {x}}} ]\!]$ if and only if ${\mathcal{S}}\notin [\![ x ]\!]$ . By induction, this is equivalent to $x\notin {\mathcal{S}}$ . And by Lemma 4.7, $x\notin {\mathcal{S}}$ if and only if ${\overline {x}}\in {\mathcal{S}}$ . □

Lemma 4.10. If $\Gamma $ is consistent, then ${\cal M}(\Gamma )\models \Gamma $ .

Proof. Let $\varphi \in \Gamma $ . First, suppose $\varphi = [x_1,\ldots , x_n]$ . Suppose for contradiction that there exists ${\mathcal{S}}\in \bigcap _i[\![ x_i ]\!]$ . Then by the Truth Lemma, $x_i\in {\mathcal{S}}$ for all i, contradicting $\Gamma $ -consistency of ${\mathcal{S}}$ .

Now suppose $\varphi = {\langle {x_1,\dots ,x_n} \rangle }$ . We claim that the set $\{x_1,\ldots , x_n\}$ is $\Gamma $ -consistent. If not, then using $({{\scriptstyle \mathrm{STR}}})$ , $\Gamma\ {\vdash}\ [x_1,\ldots , x_n]$ . So $\Gamma $ is inconsistent, contradicting our assumption.

Since $\{x_1,\ldots , x_n\}$ is $\Gamma $ -consistent, we can extend it to a maximal $\Gamma $ -consistent set ${\mathcal{S}}$ , and by the Truth Lemma, ${\mathcal{S}}\in \bigcap _{i=1}^n [\![ x_i ]\!]$ . □

Theorem 4.11 (Completeness). For any sentence $\varphi $ , if $\Gamma \models \varphi $ , then $\Gamma\ {\vdash}\ \varphi $ .

Proof. Suppose $\varphi = [x_1,\dots ,x_n]$ . If $\Gamma $ is inconsistent, then by (efq), $\Gamma\ {\vdash}\ \varphi $ , and we are done. So we may assume that $\Gamma $ is consistent. Assume for contradiction that $\Gamma \not {\vdash }\ [x_1, \ldots , x_n]$ . Consider the canonical model ${\cal M}(\Gamma )$ . By Lemma 4.10, ${\cal M}(\Gamma ) \models \Gamma $ , so ${\cal M}(\Gamma )\models {\varphi} $ . By (str), the set $\{x_1,\ldots , x_n\}$ is $\Gamma $ -consistent. So we can extend it to a maximal $\Gamma $ -consistent set ${\mathcal{S}}$ . By the Truth Lemma, ${\mathcal{S}}\in \bigcap _{i=1}^n [\![ x_i ]\!]$ , so in ${\cal M}(\Gamma )\not \models [x_1,\dots , x_n]$ , which is a contradiction.

Now suppose $\varphi = {\langle {x_1,\dots ,x_n} \rangle }$ . If $\Gamma \models \varphi $ , then $\Gamma \cup \{[x_1,\dots ,x_n]\}$ has no models, so by Lemma 4.10, $\Gamma \cup \{[x_1,\dots ,x_n]\}$ is inconsistent. This means that $\Gamma \cup \{[x_1,\dots ,x_n]\}$ proves both ${\langle {y_1,\dots ,y_m} \rangle }$ and $[y_1,\dots ,y_m]$ , so by (raa), $\Gamma\ {\vdash}\ {\langle {x_1,\dots ,x_n} \rangle }$ , as was to be shown. □

We have just seen the completeness theorem for ${\cal L}_{4.5}^+$ . Let ${\cal L}_{4}^+$ be the generalization of ${\cal L}_{4}$ obtained by adding sentences of the form $[x_1,\dots ,x_n]$ (but not ${\langle {x_1,\dots ,x_n} \rangle }$ ). Then we can restrict our logical system for ${\cal L}_{4.5}^+$ to all the rules except for (efq) and (raa). These rules are still sound for ${\cal L}_{4}^+$ , and we will show that they are complete as well.

Corollary 4.12. Let ${\vdash }_0$ be the proof system consisting of the rules $({\scriptstyle \mathrm{AX}})$ , $({{\scriptstyle \mathrm{STR}}})$ , $({{\scriptstyle \mathrm{RES}}})$ , and $({{\scriptstyle \mathrm{REL}}})$ . Then ${\vdash }_0$ is sound and complete for ${\cal L}_{4}^+$ .

Proof. We have already observed the soundness. So suppose $\Gamma $ is a theory in ${\cal L}_{4}^+$ and $\varphi $ is a sentence in ${\cal L}_{4}^+$ such that $\Gamma \models \varphi $ . Moving up to the larger language ${\cal L}_{4.5}^+$ , we have $\Gamma\ {\vdash}\ \varphi $ by Theorem 4.11. Our goal is to show that $\Gamma\ {\vdash}_0\ {\varphi} $ .

We first claim that if $\Gamma $ is any theory in ${\cal L}_{4}^+$ and $\psi $ is any sentence in ${\cal L}_{4.5}^+$ such that $\Gamma\ {\vdash}\ \psi $ , then (raa) is not used in the proof. The argument is by induction on height of the proof tree. Suppose for contradiction that (raa) is used. We may assume that the root of the proof tree is an application of (raa):

The left subtree is a proof of ${\langle {y_1,\dots ,y_m} \rangle }$ from $\Gamma \cup \{[x_1,\dots ,x_n]\}$ . Since $\Gamma \cup \{[x_1,\dots ,x_n]\}$ is a theory in ${\cal L}_{4}^+$ , by induction (raa) is not used in this proof. But none of the other rules produce consequences of the form ${\langle {y_1,\dots ,y_m} \rangle }$ , so this is a contradiction.

Now it is easy to see that if $\Gamma $ is any theory in ${\cal L}_{4}^+$ , then no ${\vdash }$ proof from $\Gamma $ uses (efq), since none of our rules other than $({{\scriptstyle \mathrm{RAA}}})$ allow us to produce or introduce a premise of the form ${\langle {y_1,\dots ,y_m} \rangle }$ .

Therefore, if $\Gamma\ {\vdash}\ {\varphi} $ , then $\Gamma\ {\vdash}_0\ {\varphi} $ .□

4.3 Open problems concerning ${\cal L}_{4}$ and ${\cal L}_{4.5}$

We began this section with the result that the consequence relation for ${\cal L}_{4}$ is ${{\rm C}{\scriptstyle \mathrm{O-}}}{\rm NPT}{\scriptstyle \mathrm{IME}}$ hard. This implies that, assuming P $\neq $ NP, there is no boundedly complete syllogistic proof system for either ${\cal L}_{4}$ or ${\cal L}_{4.5}$ Instead, we added to the syntax and formulated a proof system which went beyond the “purely syllogistic”; in that it used schematic rules and also (raa). But we did not find proof systems of any kind for the original languages ${\cal L}_{4}$ and ${\cal L}_{4.5}$ . We leave this as an open problem. (There is a result of possible relevance in [Reference Moss5]: a syllogistic system for sentences of the form $(\mathsf{all} \ p \ x)$ , where p is a (complemented) noun, and x is either a (complemented) noun or a term $(r \ \mathsf{all} \ q)$ , where q is a (complemented) noun.) For that matter, we also leave open the question of determining the exact complexities of the consequence relations for ${\cal L}_{4}$ and ${\cal L}_{4.5}$ .

5 ${\cal L}_{5}$ and ${{\cal L}_{5.5}}$ : Putting it all together

The largest logics in this paper are ${\cal L}_{5}$ and ${{\cal L}_{5.5}}$ , as described in Figure 1. We have much less to say about them than about their sublanguages because a notational variant of ${{\cal L}_{5.5}}$ has already been studied. This is the language $\mathcal {R}^{*\dagger }$ in [Reference Pratt-Hartmann and Moss10]. Here is the syntax of this language. We begin with a set ${\mathbf {N}}$ of nouns a set ${\textbf {V}}$ of verb atoms. A verb literal is either a verb r or its complement ${\overline {r}}$ . We define terms and sentences via the syntax below:

$$ \begin{align*} \begin{aligned} \text{terms } x, y, \ldots &\qquad p\in {\mathbf{N}} \mid {\overline{p}} \mid r \ \mathsf{all} \ p \mid \overline{r}\ \mathsf{all} \ p \mid {r \ \mathsf{some} \ p} \mid \overline{r} \ \mathsf{some} \ p \\ \text{sentences }\ {\varphi}, \psi, \ldots &\qquad {\mathsf{all } \ x \ y} \mid {\mathsf{some} \ x \ y}. \end{aligned} \end{align*} $$

Note that we do not have recursion for terms. In the semantics, we interpret complemented verbs using relational complement:

$$ \begin{align*} {[\![ {{\overline{r}}} ]\!]} = (M\times M)\setminus[\![ r ]\!]. \end{align*} $$

The rest of the semantics is clear.

Proposition 5.1. There is a translation map $ {\varphi} \mapsto {\varphi} ^*$ of sentences in $\mathcal {R}^{*\dagger }$ to sentences in ${{\cal L}_{5.5}}$ with the following properties:

1. $\Gamma \models {\varphi} $ iff $\Gamma ^*\models {\varphi} ^*$ , where $\Gamma ^* = \{\psi ^* : \psi \in \Gamma \}$ .
2. $ {\varphi} \mapsto {\varphi} ^*$ is computable in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Proof. It is sufficient to translate terms x of $\mathcal {R}^{*\dagger }$ . For this, we use

$$ \begin{align*} \begin{array}{lcl} (\overline{r} \ \mathsf{all} \ x)^* & =& \overline{\textit{r} \ \mathsf{some} \ x} \\ (\overline{r} \ \mathsf{some }\ x)^* & = & {\overline{r \ \mathsf{all} \ x.}} \\ \end{array} \end{align*} $$

The point is that the left sides of the equations above have the same interpretations as the right side in every model. □

The translation in the other direction is more complicated due to the complex terms in the languages of this paper. We use the standard technique of flattening.

Proposition 5.2. There is a translation map $(\Gamma , {\varphi} ) \mapsto (\Gamma ^*, {\varphi} ^*)$ from assertions in ${{\cal L}_{5.5}}$ to assertions in $\mathcal {R}^{*\dagger }$ with the following properties:

1. $\Gamma \models {\varphi} $ iff $\Gamma ^*\models {\varphi} ^*$ .
2. If $\Gamma $ is finite, so is $\Gamma ^*$ . In this case, $(\Gamma , {\varphi} ) \mapsto (\Gamma ^*, {\varphi} ^*)$ is computable in ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ .

Proof. Fix $\Gamma $ and $ {\varphi} $ in ${{\cal L}_{5.5}}$ . Let ${\mathbb {T}}$ be the set of all terms in $\Gamma \cup \{ {\varphi} \}$ , including subterms. For each $t\in {\mathbb {T}}$ , let $x_t$ be a new noun. Let $\Delta _1$ be the set of all sentences of $\mathcal {R}^{*\dagger }$ below, for $x\in {\mathbb {T}}$ :

$$ \begin{align*} \begin{array}{l@{\qquad}l} \mathsf{all} \ {x_p} \ p & \mathsf{all} \ p \ x_p \\ \mathsf{all} \ x_{(r \ \mathsf{all} \ t)} \ (r \ \mathsf{all} \ {x_t}) & \mathsf{all} \ (r \ \mathsf{all} \ {x_t}) \ x_{(r \ \mathsf{all} \ t)}\\ \mathsf{all} \ x_{(r \ \mathsf{some }\ t)} \ (r\ \mathsf{some}\ x_t) & \mathsf{all} \ (r \ \mathsf{some} \ x_t) \ x_{(r \ \mathsf{some} \ t)} \\ \mathsf{all} \ x_{{\overline{p}}} \ {{\overline{p}}} & \mathsf{all} \ {{\overline{p}}} \ x_{{\overline{p}}} \\ \mathsf{all} \ x_{\overline{r \ \mathsf{all} \ t}} \ ({{\overline{r}}}\ \mathsf{some} \ x_t) & \mathsf{all} \ ({{\overline{r}}}\ \mathsf{some} \ x_t) \ x_{\overline{r \ \mathsf{all} \ {t}}} \\ \mathsf{all} \ x_{\overline{r \ \mathsf{some} \ t}} \ ({{\overline{r}}} \ \mathsf{all} \ {x_t}) & \mathsf{all} \ ({{\overline{r}}}\ \mathsf{all} \ {x_t}) \ {x_{\overline{r\ \mathsf{some} \ t}} } \\ \mathsf{all} \ x_{\overline{\overline{t}}} \ {x_t} & \mathsf{all}\ {x_t} x_{\overline{\overline{t}}} \end{array} \end{align*} $$

An easy induction shows that for all $t\in {\mathbb {T}}$ , and all models ${\cal M}\models \Delta _1$ , ${[\![ {x_t} ]\!]} = [\![ t ]\!]$ . We translate the sentences of ${{\cal L}_{5.5}}$ to those of $\mathcal {R}^{*\dagger }$ (with the new nouns) in the obvious way: $(\mathsf{all} \ {t} \ {u})^*= \mathsf{all} \ {x_t} \ {x_u}$ , and $({\mathsf{some} \ t \ u})^*= {\mathsf{some} \ {x_t} \ {x_u}}$ . Let $\Delta _2 = \{\psi ^* : \psi \in \Gamma \}$ . Finally, we take $\Gamma ^*$ to be $\Delta _1 \cup \Delta _2$ , and $ {\varphi} ^*$ to be the translation that we just saw.

We check point (1): $\Gamma \models {\varphi} $ iff $\Gamma ^*\models {\varphi} ^*$ . Assume that $\Gamma \models {\varphi} $ , and let ${\cal M}\models \Gamma ^*$ . Due to $\Delta _1$ , we have our key fact: for all relevant terms t, $[\![ t ]\!] = {[\![ {x_t} ]\!]}$ . Using this and the fact that ${\cal M}\models \Delta _2$ , it follows that ${\cal M}\models \Gamma $ . And so ${\cal M}\models {\varphi} $ . But then using our key fact again, ${\cal M}\models {\varphi} ^*$ . The converse is similar. □

As shown in [Reference Pratt-Hartmann and Moss10], the consequence relation for $\mathcal {R}^{*\dagger }$ is ${{\scriptstyle \mathrm{EXPTIME}}}$ complete. Moreover, there are no proof systems which are finite, sound, and complete for the logic, even allowing reductio ad absurdum. (Nevertheless, there are logical systems for $\mathcal {R}^{*\dagger }$ . For example, Fitch-style system may be found in [Reference Moss4, Reference Moss6]. That system uses individual variables, as in first-order logic, but in a controlled way.)

It follows from the translations in Propositions 5.1 and 5.2 that the consequence relation for ${{\cal L}_{5.5}}$ is ${{\scriptstyle \mathrm{EXPTIME}}}$ complete. Moreover, there can be no sound and complete syllogistic proof system for ${{\cal L}_{5.5}}$ , even allowing all of the ( cases ) rules in this paper. Indeed, any rule allowing proof by cases would correspond to a rule in the language of $\mathcal {R}^{*\dagger }$ which is derivable from reductio ad absurdum.

We would like to point out that the ${{\scriptstyle \mathrm{EXPTIME}}}$ hardness result for ${{\cal L}_{5.5}}$ extends to the weaker logic ${\cal L}_{5}$ . To see this, we must recall the outline of the argument in [Reference Pratt-Hartmann and Moss10]. The starting point is Spaan’s theorem [Reference Spaan12] that the satisfiability problem for ${\cal L}_U$ , modal logic with the universal modality, is ${{\scriptstyle \mathrm{EXPTIME}}}$ hard. One takes a sentence $ {\varphi} $ in ${\cal L}_U$ and translates it to a finite set $S_{ {\varphi} }$ of sentences of $\mathcal {R}^{*\dagger }$ with the property that $ {\varphi} $ and $S_{ {\varphi} }$ are equisatisfiable. By our translation, $S_{ {\varphi} }$ may be taken to be a set of sentences in ${{\cal L}_{5.5}}$ . By examining the details, $S_{ {\varphi} }$ is a set $S^*_{ {\varphi} }$ of sentences in ${\cal L}_{5}$ , together with one additional sentence of the form $({\mathsf{some} \ x \ x})$ . The upshot is that $ {\varphi} $ is unsatisfiable iff $S^*_{ {\varphi} }\models \mathsf{all} \ {x} \ {\overline {x}}$ . Note that $({\mathsf{all}\ {x} \ {\overline {x}}} )$ is a sentence in ${\cal L}_{5}$ . In this way, the consequence relation for ${\cal L}_{5}$ is at least as hard as the (un)satisfiability problem for ${\cal L}_U$ .

This paper also explored extensions of syllogistic logic using schemes like (chains). It is possible that there is a schematic extension of ${{\cal L}_{5.5}}$ , and it is also possible that extensions to the syntax will help. We have not explored this. The logical system for $\mathcal {R}^{*\dagger }$ which uses individual variables adapts to ${{\cal L}_{5.5}}$ in a straightforward way, and we expect that the completeness and finite model properties which were shown in [Reference Moss4] hold in the adapted system.

Footnotes

1 The reader might wonder why we are indicating withdrawal of premises using a large “X” rather than the standard notation of square brackets. The reason is that later in the paper we use square brackets in our syntax, and we will thus need a different notation later to indicate withdrawals.

2 Of course, we do not actually need the Axiom of Choice when the set of all sentences in the language is countable.

References

BIBLIOGRAPHY

McAllester, D. A. (1993). Automatic recognition of tractability in inference relations. Journal of the ACM (JACM), 40(2), 284–303.CrossRef Google Scholar

McAllester, D. A., & Givan, R. (1992). Natural language syntax and first-order inference. Artificial Intelligence, 56, 1–20.CrossRef Google Scholar

Moss, L. S. (2008). Completeness theorems for syllogistic fragments. In Hamm, F., & Kepser, S., editors. Logics for Linguistic Structures. Berlin: de Gruyter Mouton, pp. 143–173.Google Scholar

Moss, L. S. (2010). Logics for two fragments beyond the syllogistic boundary. Fields of Logic and Computation: Essays Dedicated to Yuri Gurevich on the Occasion of His 70th Birthday, volume 6300 of Lecture Notes in Computer Science. Berlin, Germany: Springer-Verlag, pp. 538–563.CrossRef Google Scholar

Moss, L. S. (2010). Syllogistic logics with verbs. Journal of Logic and Computation, 20(4), 947–967.CrossRef Google Scholar

Moss, L. S. (2015). Natural logic. In Handbook of Contemporary Semantic Theory (second edition, chapter 18). Hoboken, NJ: John Wiley & Sons.Google Scholar

Moss, L. S. (2020). Logic from Language. Bloomington, IN: Indiana University.Google Scholar

Moss, L. S., & Kruckman, A. (2017). All and only. In Partiality and Underspecification in Natural Language Processing. Cambridge: Cambridge Scholars Publishers.Google Scholar

Pratt-Hartmann, I. (2014). The relational syllogistic revisited. Linguistic Issues in Language Technology, 9, 195–227.CrossRef Google Scholar

Pratt-Hartmann, I., & Moss, L. S. (2009). Logics for the relational syllogistic. Review of Symbolic Logic, 2(4), 647–683.CrossRef Google Scholar

Thomas, J. S. (1978). The complexity of satisfiability problems. Proceedings of the Tenth Annual ACM Symposium on Theory of Computing. San Diego, CA: San Diego, pp. 216–226.Google Scholar

Spaan, E. (1993). Complexity of Modal Logics. Ph.D. Thesis, ILLC, University of Amsterdam.Google Scholar

Fig. 1 Languages in this paper, given by section.

Fig. 2 The proof rules of ${\vdash }_0$.

Fig. 3 Rules in §2.5. These rules are added on top of the rules in Figure 2.

Fig. 4 The logic of §3.2. We also use the rules in Figure 2.

Fig. 5 The logic of §3.3. We also use (ax), $({\scriptstyle \mathrm{BARBARA}})$, $({\scriptstyle \mathrm{ANTI}})$ from Figure 2.

Fig. 6 Rules of the logical system for ${\cal L}_{4.5}^+$. The side condition on the structural rule (str) is that each $x_i$ must be identical to some $y_j$.

Article contents

EXPLORING THE LANDSCAPE OF RELATIONAL SYLLOGISTIC LOGICS

Abstract

Keywords

MSC classification

1 Introduction

1.1 Syntax and semantics of the logics in this paper

1.2 Related work

1.3 Syllogistic proof systems and bounded completeness

1.4 Example: Completeness and ${\scriptstyle \mathrm{PTIME}}$ decidability for ${{\cal L}_{1}}$

Proposition 1.6 [Reference Moss7].

1.5 Results

2 ${\cal L}_{2}$ : Adding the sentence former $({\mathsf{some} \ x \ y})$ to ${{\cal L}_{1}}$

2.1 The base system ${\vdash }_0$

The first canonical model of a theory $\Gamma $

The second canonical model of a theory $\Gamma$

2.2 No sound and complete syllogistic proof system for ${\cal L}_{2}$

The sets $\Gamma _n$

The sets $\Delta _{n,i}$

2.3 Completeness using the ( cases ) rule

2.4 Completeness using the ( chains ) schema

2.5 Completeness and ${\scriptstyle \mathrm{PTIME}}$ decidability for the extended language ${\cal L}_{2}^+$

Completeness and ${{\rm PT}{\scriptstyle \mathrm{IME}}}$ -decidability

3 ${\cal L}_{3}$ and ${\cal L}_{3.5}$ : Adding the term former $({\textit{r} \ \mathsf{some} \ x})$ to ${{\cal L}_{1}}$ and ${\cal L}_{2}$

3.1 Co-NPTime hardness of the consequence relation of ${\cal L}_{3}$

3.2 Completeness and Co-NPTime decidability for ${\cal L}_{3.5}$

The canonical model

3.3 Completeness for ${\cal L}_{3}$

The canonical model

4 ${\cal L}_{4}$ and ${\cal L}_{4.5}$ : Adding term complementation to ${{\cal L}_{1}}$ and ${\cal L}_{2}$

4.1 Co-NPTime hardness of the consequence relation of ${\cal L}_{4}$

4.2 Completeness for the extended languages ${\cal L}_{4}^+$ and ${\cal L}_{4.5}^+$

The canonical model of $\Gamma $

4.3 Open problems concerning ${\cal L}_{4}$ and ${\cal L}_{4.5}$

5 ${\cal L}_{5}$ and ${{\cal L}_{5.5}}$ : Putting it all together

Footnotes

References

BIBLIOGRAPHY

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests