Hostname: page-component-745bb68f8f-f46jp Total loading time: 0 Render date: 2025-02-06T00:47:15.357Z Has data issue: false hasContentIssue false

APPROXIMATING TREES AS COLOURED LINEAR ORDERS AND COMPLETE AXIOMATISATIONS OF SOME CLASSES OF TREES

Published online by Cambridge University Press:  08 June 2021

RUAAN KELLERMAN
Affiliation:
DEPARTMENT OF MATHEMATICS AND APPLIED MATHEMATICS UNIVERSITY OF PRETORIAPRETORIA, SOUTH AFRICAE-mail:ruaan.kellerman@up.ac.za
VALENTIN GORANKO
Affiliation:
DEPARTMENT OF PHILOSOPHY STOCKHOLM UNIVERSITYSTOCKHOLM, SWEDEN and SCHOOL OF MATHEMATICS (VISITING PROFESSORSHIP) UNIVERSITY OF THE WITWATERSRAND JOHANNESBURG, SOUTH AFRICA E-mail:valentin.goranko@philosophy.su.se
Rights & Permissions [Opens in a new window]

Abstract

We study the first-order theories of some natural and important classes of coloured trees, including the four classes of trees whose paths have the order type respectively of the natural numbers, the integers, the rationals, and the reals. We develop a technique for approximating a tree as a suitably coloured linear order. We then present the first-order theories of certain classes of coloured linear orders and use them, along with the approximating technique, to establish complete axiomatisations of the four classes of trees mentioned above.

Type
Article
Copyright
© Association for Symbolic Logic 2021

1 Introduction

1.1 Background and motivation

The logic-based study of linear orders was comprehensively presented in the early 1980s in the still very relevant classic book [Reference Rosenstein16], and there have been several important further developments since then, mentioned below. In particular, the first-order (FO) theories of various naturally arising classes of linear orders are now well-known, for example, the FO theories of each ordinal $\alpha $ for $\alpha < \omega ^{\omega }$ (see [Reference Rosenstein16]), as well as the FO theory of the rational numbers (implicit in [Reference Rosenstein16]). Furthermore, in [Reference Doets1, Reference Doets2] Doets studies several natural classes of coloured linear orders (i.e., linear orders enriched with unary predicates) and obtains complete axiomatisations for the first-order theories of: the class of coloured scattered linear orders; the class of coloured expansions of the natural numbers, from which the case of the integers follows easily; the class of coloured finite linear orders; the class of coloured complete linear orders; the class of coloured well-orders; and the class of coloured expansions of the order of the real numbers. Also closely related are the papers [Reference Mwesigye and Truss13, Reference Mwesigye and Truss14], which investigate coloured finite linear orders and coloured ordinals respectively.

The study and axiomatisation of the first-order theories of naturally arising classes of trees is substantially more complicated, however, even when the first-order theory of the corresponding class of linear orders is known. A more systematic attempt was made in [Reference Goranko and Kellerman8] to explore the general problem of transferring the first-order theory of a class of linear orders to the class of trees whose paths are all contained in that class of linear orders. That work left many open questions and indicated some inherent difficulties. They are mainly due to the following facts:

  1. (i) Since paths (maximal linearly ordered chains) are special sets of nodes in a given tree, the first-order language for trees cannot, in general, impose first-order properties on all paths of the tree, but only on the first-order definable ones. A path $\mathsf {P}$ in a tree $\mathfrak {T} := (T;<)$ is called singular when it contains a node u such that the set $\{x \in T : u \leqslant x \}$ is a linear order within $\mathfrak {T}$ . All singular paths are parametrically definable. However, trees may also contain emerging paths, which are paths that are not singular. In such non-definable emerging paths, behaviour in the terminal part of the path cannot generally be controlled within the expressive means of first-order logic.

  2. (ii) The branching structure of a tree cannot be captured by the properties of its paths.

Consequently, there are very few known complete axiomatisations of first-order theories of classes of trees, in essence comprising the following classes: the class of finitely branching trees (implicit in [Reference Schmerl18]), the class of (ordinary or coloured) well-founded trees (see [Reference Doets2]), and the class of finite trees (see [Reference Rogers, Backofen and Vijay-Shankar15]). Also, [Reference Goranko7] contains some general results on axiomatising subclasses of the class of finitely branching trees relative to the respective classes of trees with no restriction on their branching. Further, the first-order theories of the class of trees, all of whose paths contain greatest elements (leaves), and the class of trees whose paths are all isomorphic to some given ordinal $\alpha $ with $\alpha < \omega ^{\omega }$ , are investigated in [Reference Kellerman11], but without deriving complete axiomatisations of these first-order theories. Lastly, even though not directly related to the present paper, we should mention the very important works by Gurevich and Shelah [Reference Gurevich and Shelah9, Reference Gurevich, Shelah, Dorn and Weingartner10] on decidability of first-order theories of coloured trees with additional quantification over branches.

1.2 Goal and main contributions of the present work

The goal of the present paper is to study and axiomatise in first-order logic the classes of trees naturally arising from some important linear orders. More precisely, we obtain axiomatisations of the first-order theories of these classes of trees, rather than axiomatisations of those classes themselves. This amounts to the following: given a class of trees $\mathcal {K}$ , we seek a recursive (i.e., decidable) set of first-order sentences $\Sigma $ such that $\Sigma \subseteq \mathrm {Th}(\mathcal {K})$ and $\Sigma \models \mathrm {Th}(\mathcal {K})$ . In turn, $\Sigma \models \mathrm {Th}(\mathcal {K})$ if and only if for each natural number n and each model $\mathfrak {T}$ of $\Sigma $ , there exists a tree $\mathfrak {S}$ in $\mathcal {K}$ such that $\mathfrak {T}$ and $\mathfrak {S}$ satisfy the same sentences of quantifier rank at most n.

Now for any order type $\alpha $ , a tree whose paths are all isomorphic to $\alpha $ is called an $\alpha $ -tree. This paper addresses and solves the problems of axiomatising the first-order theories of the classes of (coloured) $\omega $ -trees, $\zeta $ -trees, $\eta $ -trees, and $\lambda $ -trees, where $\omega $ , $\zeta $ , $\eta $ , and $\lambda $ are the order types of the sets of natural numbers, integers, rational numbers, and real numbers, respectively. While the case of $\eta $ -trees is easy and the case of $\omega $ -trees was essentially known from [Reference Doets2], the cases of $\zeta $ -trees and $\lambda $ -trees turned out to be quite non-trivial. The complete axiomatisations of their first-order theories are obtained here by using a new construction for approximating a given tree by a suitably coloured linear order and then using the axiomatisations of the first-order theories of the classes of coloured expansions of $\zeta $ and $\lambda $ respectively.

1.3 Structure of the paper

First, after briefly covering some basic definitions and notation in Section 2 : Preliminaries, we present in Section 3 : First-order theories of coloured linear orders the axiomatisations of several important classes of coloured linear orders. All these axiomatisations are either already known in the literature or are easily derived from such known axiomatisations. However we include them here as they are necessary for the subsequent main results on trees, for the sake of completeness of the presentation and for uniformity of the underlying axioms that are used. As is to be expected, the analysis of these various classes of coloured linear orders tends to be substantially more complex than for the corresponding classes of monochromatic linear orders. We note in passing that, apart from their use in axiomatising the first-order theories of classes of trees, coloured linear orders are also useful, inter alia, as models of (possibly infinite) words over an alphabet. Then, in Section 4 : From trees to coloured linear orders and back we introduce the construction used for approximating a tree as a coloured linear order, which we use in Section 5 : First-order theories of coloured trees to axiomatise the first-order theories of the classes of coloured $\zeta $ -trees and coloured $\lambda $ -trees. In addition, easier axiomatisations of the first-order theories of the classes of coloured $\omega $ -trees and coloured $\eta $ -trees are also given. In all of these cases, the axiomatisations of the first-order theories of the classes of trees make essential use of the axiomatisations of the first-order theories of their corresponding classes of coloured linear orders. We end with a summary and some suggestions for future work in Section 6 : Concluding remarks.

2 Preliminaries

2.1 Notation and logical preliminaries

We will be working with first-order (FO) languages with signatures that contain a binary relation $<$ (possibly, in addition to constants and unary relations). Expressions such as $x> y$ , $x \leqslant y$ , and $x \not < y$ , when used in first-order formulas, are to be interpreted in the usual way, as abbreviations for the formulas $y < x$ , $x < y \vee x = y$ , and $\neg \left (x<y\right )$ respectively; similarly for other variations of the relation $<$ .

Equality will always be assumed to be included in the language. The FO signature that consists only of equality plus the relation symbol $<$ will be denoted by $L_0$ , and the signature that extends $L_0$ with unary relation symbols $\mathbf {c}_{1},\mathbf {c}_{2},\ldots ,\mathbf {c}_{k}$ (regarded as colours) will be denoted by $L_k$ . For $k \geqslant 0$ , $L_k'$ will denote the signature $L_k$ expanded with a single constant symbol, i.e., the signature of structures of the form $\left (A;<,a\right )$ when $k=0$ or $\left (A;<,a,\mathbf {c}_{1},\ldots ,\mathbf {c}_{k}\right )$ when $k>0$ .

Consider an arbitrary fixed first-order language L. The quantifier rank of a formula $\varphi $ is defined as expected and will be denoted by $\mathrm {qr}\left (\varphi \right )$ . We denote by FO $_n$ the set of first-order formulas (of the given signature L) of quantifier rank at most n and for any L-structures $\mathfrak {A}$ and $\mathfrak {B}$ , by $\mathfrak {A}\equiv _n \mathfrak {B}$ we mean equivalence of $\mathfrak {A}$ and $\mathfrak {B}$ with respect to all sentences in FO $_n$ , whereas $\mathfrak {A}\equiv \mathfrak {B}$ means, as usual, their elementary equivalence.

The domain of a structure $\mathfrak {A}$ will be denoted by $\left |\mathfrak {A}\right |$ . The cardinality of a set X will also be denoted by $\left |X\right |$ . From the context, it will always be clear which one of the two meanings applies.

Given an L-structure $\mathfrak {A}$ with domain A, the parametrically definable subsets of $\mathfrak {A}$ are those sets of the form $\left \{x \in A : \left (\mathfrak {A};\bar {a}\right ) \models \varphi \left (x,\bar {a}\right ) \right \}$ , where $\varphi \left (x,\bar {y}\right )$ is a first-order formula with $\bar {y}$ possibly empty, and $\bar {a}$ is a tuple of elements from A of the same arity as $\bar {y}$ .

Given formulas $\varphi $ and $\theta = \theta \left (x,\bar {y}\right )$ that have no variables in common, the relativisation of $\varphi $ to $\theta $ , obtained from $\varphi $ by replacing each subformula of the form $\exists u \left (\psi \left (u,\bar {w}\right )\right )$ with $\exists u \left (\theta \left (u,\bar {y}\right ) \wedge \psi \left (u,\bar {w}\right )\right )$ , and each subformula of the form $\forall u \left (\psi \left (u,\bar {w}\right )\right )$ with the formula $\forall u \left (\theta \left (u,\bar {y}\right ) \rightarrow \psi \left (u,\bar {w}\right )\right )$ , will be denoted by $\varphi ^{\theta }$ . For a detailed account of relativisations, see, e.g., [Reference Rosenstein16, p. 259]. If $\theta = \theta \left (x,y\right )$ is the formula $y \leqslant x$ , with y here playing the role of a parameter, then $\varphi ^{\theta }$ will be written simply as $\varphi ^{\geqslant y}$ . The formulas $\varphi ^{> y}$ , $\varphi ^{\leqslant y}$ , and $\varphi ^{< y}$ are interpreted similarly. If $\theta \left (x,y,z\right )$ is the formula $y \leqslant x \land x < z$ , with y and z now treated as parameters, then $\varphi ^{\theta }$ will be written as $\varphi ^{\left [y,z\right )}$ , and similarly for the other bounded intervals. Given a structure $\mathfrak {A}$ with domain A, a (possibly empty) tuple of elements $\bar {a} \in A^k$ , and a formula $\theta \left (x,\bar {y}\right )$ with $\bar {y}$ being a (possibly empty) k-tuple of variables, $\left (\mathfrak {A};\bar {a}\right )^{\theta }$ will denote the set $\left \{v \in A : \left (\mathfrak {A};\bar {a}\right ) \models \theta \left (v,\bar {a}\right ) \right \}$ .

For any natural number m, the linear order $\left (\left \{0,1,\ldots ,m-1\right \};<\right )$ , ordered in the usual way, will be denoted by $\mathbf {m}$ .

We briefly recall the definitions and main result about characteristic first-order formulas (using the notation of [Reference Doets1, Section 1.6]) that will be needed.

Given a structure $\mathfrak {A}$ with domain A, natural numbers k and n, a k-tuple of elements $\bar {a} = \left (a_0,a_1,\ldots ,a_{k-1}\right )$ from $A^k$ , and a k-tuple of variables $\bar {x} = \left (x_0,x_1,\ldots ,x_{k-1}\right )$ (where $\bar {a}$ and $\bar {x}$ are empty when $k=0$ ), the n-characteristic formula of the structure $\mathfrak {A}$ over the tuple $\bar {a}$ is denoted by $[\kern -0.15em[ \left ( \mathfrak {A}; \bar {a} \right ) ]\kern -0.15em]^n \left ( \bar {x} \right )$ and is defined as follows:

  • $[\kern -0.15em[( \mathfrak {A}; \bar {a})]\kern -0.15em]^0 ( \bar {x} ) := \bigwedge \big \{ \varphi \left ( \bar {x} \right ) : \varphi \ \text {is an atomic or negated atomic formula with}$ $\mathfrak {A} \models \varphi \left ( \bar {a} \right ) \big \}$ ;

  • $[\kern -0.15em[ ( \mathfrak {A}; \bar {a} ) ]\kern -0.15em]^{m+1} ( \bar {x} ) := \bigwedge _{a_k \in A} ( \exists x_k ( [\kern -0.15em[ ( \mathfrak {A}; \bar {a}a_k ) ]\kern -0.15em]^m ( \bar {x}x_k))) \wedge \forall x_k ( \bigvee _{a_k \in A} [\kern -0.15em[ ( \mathfrak {A}; \bar {a}a_k ) ]\kern -0.15em]^m ( \bar {x}x_k))$ .

For languages with finite relational signatures it is well known (see, e.g., [Reference Ebbinghaus, Flum and Thomas5, Theorem 3.4 of Chapter 12]) that, for all natural numbers n and k there are, up to logical equivalence, only finitely many n-characteristic formulas over the class of all structures for that signature and all k-tuples in those structures. If $\bar {a}$ is the empty tuple then $[\kern -0.15em[( \mathfrak {A};\bar {a}) ]\kern -0.15em]^n ( \bar {x} )$ is written as $[\kern -0.15em[ \mathfrak {A} ]\kern -0.15em]^n$ and is called the n-characteristic sentence of $\mathfrak {A}$ . Here are some important facts about characteristic formulas:

  1. 1. The formula $[\kern -0.15em[( \mathfrak {A}; \bar {a} ) ]\kern -0.15em]^n ( \bar {x} )$ has quantifier rank n.

  2. 2. $\mathfrak {A} \models [\kern -0.15em[ ( \mathfrak {A}; \bar {a} ) ]\kern -0.15em]^n ( \bar {a} )$ .

  3. 3. If $\mathfrak {B}$ is a structure in the signature of $\mathfrak {A}$ and $\bar {b}$ is a k-tuple of elements from the domain of $\mathfrak {B}$ then the following three statements are equivalent for any natural number n.

    1. (a) $( \mathfrak {A}; \bar {a} ) \equiv _n ( \mathfrak {B}; \bar {b} )$ .

    2. (b) $\mathfrak {B} \models [\kern -0.15em[ ( \mathfrak {A}; \bar {a} ) ]\kern -0.15em]^n ( \bar {b} )$ .

    3. (c) $[\kern -0.15em[ ( \mathfrak {A}; \bar {a}) ]\kern -0.15em]^n ( \bar {x} ) \equiv [\kern -0.15em[ ( \mathfrak {B}; \bar {b} ) ]\kern -0.15em]^n ( \bar {x} )$ , where $\equiv $ denotes logical equivalence of formulas.

Given a finite relational FO signature L, the set

$$ \begin{align*}\sigma_n\left(L\right) := {\left\{ [\kern-0.15em[ \mathfrak{A} ]\kern-0.15em]^n : \mathfrak{A} \ \text{is any}\ L\text{-structure} \right\}} / \mathord{\equiv}\end{align*} $$

that consists of all equivalence classes of logically equivalent n-characteristic L-sentences will be called the n-spectrum of L, and its cardinality will be denoted by $f\left (L,n\right ) := \left |\sigma _n\left (L\right )\right |$ . For each finite relational signature L, natural number n, and integer i with $1 \leqslant i \leqslant f\left (L,n\right )$ , we fix a sentence $\tau _{L,n,i}$ such that

$$ \begin{align*}\tau_{L,n,1},\,\tau_{L,n,2},\,\ldots,\,\tau_{L,n,f\left(L,n\right)}\end{align*} $$

is an enumeration of all n-characteristic L-sentences, up to logical equivalence.

Given a structure $\mathfrak {A}$ , a subset $A'$ of A that has the property that, for each $a \in A$ , there exists $b \in A'$ such that $\left (\mathfrak {A};a\right ) \equiv _n \left (\mathfrak {A};b\right )$ , will be called an n-support of $\mathfrak {A}$ . It follows from the properties of characteristic formulas that if $\mathfrak {A}$ has a finite relational signature then it has a finite n-support, for each n.

The reader is referred to [Reference Doets3] for background on Ehrenfeucht–Fraïssé games; the players Di and Sy of [Reference Doets3] will be called respectively Spoiler and Duplicator here and the n-round game on structures $\mathfrak {A}$ and $\mathfrak {B}$ will be denoted by $\mathrm {EF}_{n}\!\left (\mathfrak {A},\mathfrak {B}\right )$ .

2.2 Linear orders

A k-coloured linear order, or simply a coloured linear order, is a structure of the type $\mathfrak {A} = \left (A;<,\mathbf {c}_{1},\mathbf {c}_{2},\ldots ,\mathbf {c}_{k}\right )$ where k is a positive integer, $\left (A;<\right )$ is a linear order, and each $\mathbf {c}_{i}$ is a unary predicate in A, called a colour. (Thus, an element in A may have more than one colour, or none whatsoever.) The class of all k-coloured linear orders will be denoted by $\mathcal {L}_k$ . Putting $\bar {\mathbf {c}} = \left (\mathbf {c}_{1},\mathbf {c}_{2},\ldots ,\mathbf {c}_{k}\right )$ , $\mathfrak {A}$ can also be written simply as $\mathfrak {A} = \left (A;<,\bar {\mathbf {c}}\right )$ . The k-tuple of colours $\bar {\mathbf {c}}$ will sometimes be written as $\bar {\mathbf {c}}_k$ , to emphasise that there are k colours. Given a subset B of A, we define ${\bar {\mathbf {c}}}\mathord {\upharpoonright }_{B} := \left ({\mathbf {c}_{1}}\mathord {\upharpoonright }_{B},{\mathbf {c}_{2}}\mathord {\upharpoonright }_{B},\ldots ,{\mathbf {c}_{k}}\mathord {\upharpoonright }_{B}\right )$ and $\mathfrak {A}^B := \left (B;{<}\mathord {\upharpoonright }_{B},{\bar {\mathbf {c}}}\mathord {\upharpoonright }_{B}\right )$ . To keep the notation simple, $\mathfrak {A}^B$ may sometimes also be written simply as $\mathfrak {A}^B = \left (B;<,\bar {\mathbf {c}}\right )$ , tacitly assuming that $<$ and $\bar {\mathbf {c}}$ are restricted in the obvious way. If $a \in A$ then $\mathfrak {A}^{\geqslant a}$ will mean $\mathfrak {A}^B$ where $B = \{ x \in A : a \leqslant x \}$ , and similarly for $\mathfrak {A}^{>a}$ , $\mathfrak {A}^{\leqslant a}$ , and $\mathfrak {A}^{<a}$ . A linear order $\left (A;<\right )$ that is not enriched with any colours will be called monochromatic. For technical convenience, monochromatic linear orders may also be thought of as $0$ -coloured and for which the tuple of colours $\bar {\mathbf {c}}$ is empty. Given a coloured linear order $\mathfrak {A} = \left (A;<,\bar {\mathbf {c}}\right )$ , the monochromatic reduct $\left (A;<\right )$ will be denoted by $\mathfrak {A}^-$ . Unless otherwise specified, the domains of structures $\mathfrak {A}$ , $\mathfrak {B}$ , $\mathfrak {C}_i$ , etc. will be denoted by A, B, and $C_i$ , respectively. To avoid ambiguity, when several structures are under consideration, their order relations will be denoted by $<_{\mathfrak {A}}$ , $<_{\mathfrak {B}}$ , and $<_{\mathfrak {C}_i}$ , respectively. In the interest of readability, $<_{\mathfrak {A}}$ , $<_{\mathfrak {B}}$ , and $<_{\mathfrak {C}_i}$ may sometimes all be denoted simply by $<$ , with the understanding that the relation $<$ is then to be understood as $<_{\mathfrak {A}}$ , $<_{\mathfrak {B}}$ , or $<_{\mathfrak {C}_i}$ , depending on the structure being worked in.

Given two coloured linear orders $\mathfrak {A} = \left (A;<_{\mathfrak {A}},\bar {\mathbf {c}}\right )$ and $\mathfrak {B} = \left (B;<_{\mathfrak {B}},\bar {\mathbf {d}}\right )$ , the sum $\mathfrak {A} + \mathfrak {B}$ of $\mathfrak {A}$ and $\mathfrak {B}$ is defined to be the coloured linear order $\mathfrak {A} + \mathfrak {B} := \left (A \sqcup B;<,\bar {\mathbf {e}}\right )$ , where $A \sqcup B := \left (A \times \left \{0\right \}\right ) \cup \left (B \times \left \{1\right \}\right )$ , the relation $<$ is the union of the three sets $\left \{\left (\left (x,0\right ),\left (y,0\right )\right ) : x <_{\mathfrak {A}} y \right \}$ , $\left \{\left (\left (x,1\right ),\left (y,1\right )\right ): x <_{\mathfrak {B}} y\right \}$ , and $\left \{\left (\left (x,0\right ),\left (y,1\right )\right ): x \in A \ \text {and} \ y \in B \right \}$ , and $\bar {\mathbf {e}} = \bar {\mathbf {c}}\bar {\mathbf {d}}'$ where $\bar {\mathbf {d}}'$ is the tuple that is obtained from $\bar {\mathbf {d}}$ by removing all colours that are already in $\bar {\mathbf {c}}$ , and for any colour $\mathbf {e}_{i}$ in $\bar {\mathbf {e}}$ and any $u \in A \sqcup B$ , $\mathfrak {A}+\mathfrak {B} \models \mathbf {e}_{i}\left (u\right )$ if and only if either u has the form $u = \left (a,0\right )$ and $\mathfrak {A} \models \mathbf {e}_{i}\left (a\right )$ , or u has the form $u = \left (b,1\right )$ and $\mathfrak {B} \models \mathbf {e}_{i}\left (b\right )$ . To keep the notation simple, the elements $(a,0)$ and $(b,1)$ may sometimes be identified simply with the elements $a \in A$ and $b \in B$ respectively. The following Feferman–Vaught style result (see [Reference Feferman and Vaught6]) is easily proved by a straightforward application of Ehrenfeucht–Fraïssé games.

Lemma 2.1. Let $\mathfrak {A}_1$ , $\mathfrak {A}_2$ , $\mathfrak {B}_1$ , and $\mathfrak {B}_2$ be coloured linear orders, where $\mathfrak {A}_1$ and $\mathfrak {B}_1$ have the same set of colours $\bar {\mathbf {c}}$ , and $\mathfrak {A}_2$ and $\mathfrak {B}_2$ have the same set of colours $\bar {\mathbf {d}}$ . If $\mathfrak {A}_1 \equiv _n \mathfrak {B}_1$ and $\mathfrak {A}_2 \equiv _n \mathfrak {B}_2$ then $\mathfrak {A}_1 + \mathfrak {A}_2 \equiv _n \mathfrak {B}_1 + \mathfrak {B}_2$ .

2.3 Trees

A forest is a structure of type $\mathfrak {T} = (T; <)$ in which the relation $<$ is irreflexive, transitive, and left-linear (for each $a \in T$ , the set $\left \{x : x < a\right \}$ is linear). A tree is a forest which is left-connected: for any $a,b \in T$ there exists $c \in T$ such that $c \leqslant a$ and $c \leqslant b$ . A subset of a forest T that is maximal with respect to being connected is called a (connected) component of $\mathfrak {T}$ . The elements of T are called nodes. The least node of $\mathfrak {T}$ , if it exists, is called the root of $\mathfrak {T}$ , and a node is called a leaf when it is maximal with respect to $<$ . For a maximal linearly ordered subset $A \subseteq T$ , the linear order $\mathsf {A} = \left (A;{<}\mathord {\upharpoonright }_{A}\right )$ is called a path in $\mathfrak {T}$ . For a left-closed subset B of a path A, i.e., such that for each $b \in B$ , if $c < b$ then $c \in B$ , too, the linear order $\mathsf {B} = \left (B;{<}\mathord {\upharpoonright }_{B}\right )$ is called a stem (of $\mathsf {A}$ ) in $\mathfrak {T}$ . For ease of notation, the path $\mathsf {A}$ and the stem $\mathsf {B}$ will often be identified with their domains A and B. The set of all paths in $\mathfrak {T}$ will be denoted by $\mathcal {H}\left (\mathfrak {T}\right )$ . A set $I \subseteq T$ is called an interval when, for $a,b,c \in T$ with $a < b < c$ , if $a,c \in I$ then $b \in I$ , too.

As with linear orders, unary predicates regarded as colours may be added to the tree to obtain a coloured tree $\left (\mathfrak {T};\bar {\mathbf {c}}\right )$ .

Given a (possibly monochromatic) tree $\mathfrak {T} = \left (T;<,\bar {\mathbf {c}}\right )$ and a subset $C \subseteq T$ , the structure $\mathfrak {T}^C := \left (C;{<}\mathord {\upharpoonright }_{C},{\bar {\mathbf {c}}}\mathord {\upharpoonright }_{C}\right )$ is defined as with linear orders. In particular, for a node $a \in T$ , the trees $\mathfrak {T}^{\geqslant a}$ , $\mathfrak {T}^{>a}$ , $\mathfrak {T}^{\leqslant a}$ , and $\mathfrak {T}^{< a}$ are defined as the trees $\mathfrak {T}^C$ with C taken to be the set $T^{\geqslant a} := \left \{x \in T : a \leqslant x \right \}$ , $T^{> a} := \left \{x \in T : a < x \right \}$ , $T^{\leqslant a} := \left \{x \in T : x \leqslant a \right \}$ , and $T^{< a} := \left \{x \in T : x < a \right \}$ respectively. The tree $\mathfrak {T}^{\geqslant a}$ will be called the principal subtree generated by a and the stem $T^{<a}$ will be called the principal stem generated by a.

Now, for $\mathfrak {T} = \left (T;<_{\mathfrak {T}},\bar {\mathbf {c}}\right )$ a tree and $\mathfrak {F} = \left (F;<_{\mathfrak {F}},\bar {\mathbf {d}}\right )$ a forest (where either of $\mathfrak {T}$ and $\mathfrak {F}$ may be monochromatic), and a stem B in $\mathfrak {T}$ , we denote by $\mathfrak {T} +_B \mathfrak {F} := \left (T \sqcup F;<_{\mathfrak {T} +_B \mathfrak {F}},\bar {\mathbf {e}}\right )$ the tree with domain

$$ \begin{align*}\left|\mathfrak{T} +_B \mathfrak{F}\right| := T \sqcup F = \left(T \times \left\{0\right\}\right) \cup \left(F \times \left\{1\right\}\right),\end{align*} $$

with order relation

$$ \begin{align*} {<_{\mathfrak{T} +_B \mathfrak{F}}} & := \Big\{ \big( \left(x,0\right), \left(y,0\right) \big) : x <_{\mathfrak{T}} y \Big\} \cup \Big\{ \big( \left(x,1\right), \left(y,1\right) \big) : x <_{\mathfrak{F}} y \Big\} \\ & \;\,\quad \cup \Big\{ \big( \left(x,0\right), \left(y,1\right) \big) : x \in B \ \text{and} \ y \in F \Big\}, \end{align*} $$

and with colours $\bar {\mathbf {e}} = \bar {\mathbf {c}}\bar {\mathbf {d}}'$ , where $\bar {\mathbf {d}}'$ is the tuple that is obtained from $\bar {\mathbf {d}}$ by removing all colours that are already in $\bar {\mathbf {c}}$ , and where, for any colour $\mathbf {e}_{i}$ in $\bar {\mathbf {e}}$ and any $u \in T \sqcup F$ , $\mathfrak {T} +_B \mathfrak {F} \models \mathbf {e}_{i}\left (u\right )$ if and only if either u has the form $u = \left (a,0\right )$ and $\mathfrak {T} \models \mathbf {e}_{i}\left (a\right )$ , or u has the form $u = \left (b,1\right )$ and $\mathfrak {F} \models \mathbf {e}_{i}\left (b\right )$ . In other words, $\mathfrak {T} +_B \mathfrak {F}$ is the tree obtained from $\mathfrak {T}$ by adding the forest $\mathfrak {F}$ to the end of the stem B; a depiction of this tree is given in Figure 1. As with sums of linear orders, for nodes $a \in T$ and $b \in F$ , the nodes $\left (a,0\right )$ and $\left (b,1\right )$ in the tree $\mathfrak {T} +_B \mathfrak {F}$ will often be written simply as a and b rather than as $\left (a,0\right )$ and $\left (b,1\right )$ , to keep the notation simpler, provided that there is no danger of confusion.

Figure 1 A depiction of the tree $\mathfrak {T} +_B \mathfrak {F}$ .

The following composition result generalises Lemma 2.1 and can also be proved by a straightforward Ehrenfeucht–Fraïssé game.

Lemma 2.2. Let $\mathfrak {T}$ and $\mathfrak {S}$ be (possibly monochromatic) trees with the same set of colours $\bar {\mathbf {c}}$ , A and B be stems in $\mathfrak {T}$ and $\mathfrak {S}$ respectively, and $\mathfrak {F}$ and $\mathfrak {G}$ be (possibly monochromatic) forests with the same set of colours $\bar {\mathbf {d}}$ . Let $\bar {a} := \left (a_1,a_2,\ldots ,a_k\right ) \in F^k$ and $\bar {b} := \left (b_1,b_2,\ldots ,b_k\right ) \in G^k$ (the tuples $\bar {a}$ and $\bar {b}$ may be empty) and for each i, let $a_i' := \left (a_i,1\right ) \in \left |\mathfrak {T} +_A \mathfrak {F}\right |$ and $b_i' := \left (b_i,1\right ) \in \left |\mathfrak {S} +_B \mathfrak {G}\right |$ and let $\bar {a}' := \left (a_1',a_2',\ldots ,a_k'\right )$ and $\bar {b}' := \left (b_1',b_2',\ldots ,b_k'\right )$ (the tuples $\bar {a}'$ and $\bar {b}'$ will be empty when $\bar {a}$ and $\bar {b}$ are). If $\left (\mathfrak {T};A\right ) \equiv _n \left (\mathfrak {S};B\right )$ and $\left (\mathfrak {F};\bar {a}\right ) \equiv _n (\mathfrak {G};\bar {b} )$ then $\left (\mathfrak {T} +_A \mathfrak {F}; \bar {a}'\right ) \equiv _n (\mathfrak {S} +_B \mathfrak {G}; \bar {b}' )$ .

Given a class of (monochromatic) linear orders $\mathcal {C}$ , a tree $\mathfrak {S}$ for which $\mathcal {H}\left (\mathfrak {S}\right ) \subseteq \mathcal {C}$ will be called a tree over $\mathcal {C}$ , or simply a $\mathcal {C}$ -tree, and the class of all k-coloured $\mathcal {C}$ -trees will be denoted by $\mathcal {T}_k\left (\mathcal {C}\right )$ (with $k=0$ in the case of monochromatic trees). If $\mathcal {C} = \left \{\alpha \right \}$ consists of just one linear order then $\mathcal {T}_k\left (\mathcal {C}\right )$ will also be denoted by $\mathcal {T}_k\left (\alpha \right )$ and the trees in $\mathcal {T}_k\left (\alpha \right )$ will be called $\alpha $ -trees.

3 First-order theories of coloured linear orders

3.1 A catalogue of properties and axioms

A (possibly coloured) tree $\mathfrak {T}$ , in particular, a linear order, is called:

  • forward discrete, if for any two elements a and b, such that $a < b$ , there exists an immediate successor c of a such that $a < c \leqslant b$ . In the case of linear orders, forward discreteness is equivalent to stating that each non-greatest element has an immediate successor.

  • backward discrete, when each element except for the least element, if there is one, has an immediate predecessor.

  • discrete, when it is both forward discrete and backward discrete.

  • definably forward well-founded, if every parametrically first-order definable non-empty set of elements contains a maximal element.Footnote 1

  • definably backward well-founded, if every parametrically first-order definable non-empty set of elements contains a minimal (which will also be a least) element.

  • definably bounded forward well-founded, if every parametrically first-order definable non-empty set of elements X that is bounded above (i.e., for which there exists b such that $x \leqslant b$ for each $x \in X$ ) contains a maximal element.

  • definably bounded backward well-founded, if every parametrically first-order definable non-empty set of elements X that is bounded below (i.e., there exists a such that $a \leqslant x$ for each $x \in X$ ) contains a minimal element.

We define the following sentences in the first-order language $L_k$ :

  1. 1. $\mathsf {LO}$ expresses that the structure is a linear order, as the conjunction of the sentences expressing irreflexivity, transitivity, and totality:

    $$ \begin{align*} \mathsf{LO} &:= \forall x \left(x \not< x\right) \wedge \forall x \forall y \forall z \big( \left(x < y \wedge y < z\right) \rightarrow x < z\big) \\ & \;\,\quad \wedge \forall x \forall y \left(x < y \vee y < x \vee x = y\right). \end{align*} $$
  2. 2. $\mathsf {Tree}$ expresses that the structure is a tree, as the conjunction of sentences that express irreflexivity, transitivity, left-linearity, and left-connectedness:

    $$ \begin{align*} \mathsf{Tree} &:= \forall x \left(x \not< x\right) \wedge \forall x \forall y \forall z \big( \left(x < y \wedge y < z\right) \rightarrow x < z\big) \\ & \;\,\quad \wedge \forall x \forall y \forall z \big( \left(y < x \wedge z < x\right) \rightarrow \left( y < z \vee z < y \vee y = z \right) \big) \\ & \;\,\quad\wedge \forall x \forall y \exists z \left(z \leqslant x \wedge z \leqslant y\right). \end{align*} $$
  3. 3. $\mathsf {Le}$ is a sentence that expresses the existence of a $<$ -least element:

    $$ \begin{align*}\mathsf{Le} := \exists x \forall y \left(x \leqslant y\right).\end{align*} $$
  4. 4. $\mathsf {Gr}$ is a sentence that expresses the existence of a $<$ -maximal element:

    $$ \begin{align*}\mathsf{Gr} := \exists x \big(\neg \exists y \left(x < y\right) \big).\end{align*} $$
  5. 5. $\mathsf {FD}$ is a sentence that expresses the property of forward discreteness:

    $$ \begin{align*}\mathsf{FD} := \forall x \forall y (x < y \rightarrow \exists z (x < z \leqslant y \wedge \neg \exists u (x < u < z))).\end{align*} $$
  6. 6. $\mathsf {BD}$ is a sentence that expresses the property of backward discreteness:Footnote 2

    $$ \begin{align*}\mathsf{BD} := \forall x (\exists y (y < x) \rightarrow \exists y (y < x \wedge \neg \exists z (y < z < x))).\end{align*} $$
  7. 7. $\mathsf {De}$ is a sentence that expresses the property of density:

    $$ \begin{align*}\mathsf{De} := \forall x \forall y \left(x < y \rightarrow \exists z \left(x < z < y\right)\right).\end{align*} $$
  8. 8. $\mathsf {FWF}_k$ is a scheme of (infinitely many) sentences that expresses the property of definable forward well-foundedness and consists of all sentences of the form

    $$ \begin{align*} \mathsf{FWF}_k(\varphi) := \forall \bar{z} ( \exists x (\varphi(x,\bar{z})) \rightarrow \exists x ( \varphi(x,\bar{z}) \wedge \neg \exists y (\varphi(y,\bar{z}) \wedge x < y))),\end{align*} $$
    where $\bar {z} = \left (z_1,z_2,\ldots ,z_n\right )$ is any (possibly empty) tuple of variables and $\varphi \left (x,\bar {z}\right )$ is any formula in the language $L_k$ .
  9. 9. $\mathsf {BWF}_k$ is an infinite scheme that expresses the property of definable backward well-foundedness and consists of all sentences of the form

    $$ \begin{align*} \mathsf{BWF}_k(\varphi) := \forall \bar{z}( \exists x(\varphi(x,\bar{z})) \rightarrow \exists x ( \varphi(x,\bar{z}) \wedge \neg \exists y (\varphi(y,\bar{z}) \wedge y < x )))\end{align*} $$
    for any formula $\varphi \left (x,\bar {z}\right )$ in the language $L_k$ .
  10. 10. $\mathsf {BFWF}_k$ is an infinite scheme that expresses the property of definable bounded forward well-foundedness and consists of all sentences of the form

    $$ \begin{align*} \mathsf{BFWF}_k(\varphi) & := \forall \bar{z} (( \exists x (\varphi(x,\bar{z})) \wedge \exists y \forall x( \varphi(x,\bar{z}) \rightarrow x \leqslant y)) \\ & \;\,\quad \rightarrow \exists x( \varphi(x,\bar{z}) \wedge \neg \exists y(\varphi(y,\bar{z}) \wedge x < y))) \end{align*} $$
    for any formula $\varphi \left (x,\bar {z}\right )$ in the language $L_k$ .
  11. 11. $\mathsf {BBWF}_k$ is an infinite scheme that expresses the property of definable bounded backward well-foundedness and consists of all sentences of the form

    $$ \begin{align*} \mathsf{BBWF}_k(\varphi) & := \forall \bar{z}(( \exists x(\varphi(x,\bar{z})) \wedge \exists y \forall x( \varphi(x,\bar{z}) \rightarrow y \leqslant x)) \\ & \;\,\quad \rightarrow \exists x( \varphi(x,\bar{z}) \wedge \neg \exists y(\varphi(y,\bar{z}) \wedge y < x))) \end{align*} $$
    for any formula $\varphi \left (x,\bar {z}\right )$ in the language $L_k$ .

The following facts about the first-order theory of $\omega $ will be needed later.

Fact 3.1. The first-order theory of $\omega $ can be axiomatised by the theory

$$ \begin{align*}\left\{\mathsf{LO},\mathsf{Le},\neg\mathsf{Gr},\mathsf{BD},\mathsf{FD}\right\}\end{align*} $$

(see, e.g., [Reference Rosenstein16, p. 254]). Moreover, the class of models of this first-order theory consists precisely of all linear orders of the form $\omega + \zeta \cdot \alpha $ where $\alpha $ is any linear order (see, e.g., [Reference Rosenstein16, Corollary 13.12 and Proposition 13.25]).

3.2 The first-order theory of coloured finite linear orders

We define the following first-order theory:

$$\begin{align*}\mathsf{CFLO}_k := \left\{\mathsf{LO},\mathsf{Le},\mathsf{FD}\right\} \cup \mathsf{FWF}_k.\end{align*}$$

The models of $\mathsf {CFLO}_k$ will be called coloured quasi-finite linear orders.

Lemma 3.2. Let $\mathfrak {A} = \left (A;<,\bar {\mathbf {c}}\right )$ be a coloured quasi-finite linear order. Then:

  1. 1. A has a greatest element.

  2. 2. $\mathfrak {A}$ is backward discrete, hence (due to $\mathsf {FD}$ ) also discrete.

  3. 3. $\mathfrak {A}$ is definably backward well-founded.

  4. 4. For each element $a \in A$ , the structures $\mathfrak {A}^{< a}$ , $\mathfrak {A}^{\leqslant a}$ , $\mathfrak {A}^{> a}$ , and $\mathfrak {A}^{\geqslant a}$ are coloured quasi-finite linear orders too.

Proof (1) The set A is itself definable, hence, by $\mathsf {FWF}_k$ , A contains a greatest element.

(2) Given a non-least element $a \in A$ , the set $A^{< a}$ is definable in $\left (\mathfrak {A};a\right )$ , hence, by $\mathsf {FWF}_k$ , $A^{< a}$ contains a greatest element b, which must be the immediate predecessor of a.

(3) Suppose that $\varphi \left (x,\bar {z}\right )$ is a formula such that $\varphi \left (x,\bar {a}\right )$ defines a non-empty set B in $\left (\mathfrak {A};\bar {a}\right )$ . If B contains the least element of A then the proof is completed. Suppose otherwise. Then the set $B^< := \left \{x \in A : x < y \ \text {for all} \ y \in B\right \}$ is non-empty. Let $\psi \left (x,\bar {z}\right ) := \forall y \left ( \varphi \left (y,\bar {z}\right ) \rightarrow x < y \right )$ . Then $\psi \left (x,\bar {a}\right )$ defines $B^<$ in $\left (\mathfrak {A};\bar {a}\right )$ , hence, by $\mathsf {FWF}_k$ , $B^<$ contains a greatest element b. Since $b < y$ for all $y \in B$ and $B \neq \emptyset $ , then b is not the greatest element of A, hence, by $\mathsf {FD}$ , b has an immediate successor $b'$ which will be the least element of B, as required.

(4) Straightforward.⊣

More generally, observe that the following implications hold in any linear order:

$$ \begin{align*} \mathsf {FWF}_k \Longrightarrow \mathsf {BFWF}_k \Longrightarrow \mathsf {BD}\ \mathrm{and}\ \mathsf {BWF}_k \Longrightarrow \mathsf {BBWF}_k \Longrightarrow \mathsf {FD}.\end{align*}$$

Proposition 3.3. If $\mathfrak {A} = \left (A;<,\bar {\mathbf {c}}\right )$ is a coloured quasi-finite linear order then A is either finite or $\mathfrak {A}^-$ has the form $\omega + \zeta \cdot \alpha + \omega ^{\star }$ for some linear order $\alpha $ .

Proof Straightforward.⊣

Proposition 3.4 below and its proof are similar to [Reference Doets2, Theorem 3.2] except that there, restricted definable induction instead of forward well-foundedness is used to approximate finiteness. Proposition 3.4 and its proof are also similar to the “vertical collapsing” construction used in [Reference Rogers, Backofen and Vijay-Shankar15] to obtain finite tree paths from infinite ones. The construction there, like Proposition 3.4, also approximates finiteness using definable forward well-foundedness. Here, however, we also give an upper bound for the size of the coloured finite linear order produced by the construction, that is n-equivalent to the given coloured quasi-finite linear order. Recall, from the discussion of characteristic sentences in Section 2.1, that $f\left (L_k,n\right )$ is the number of non-equivalent characteristic sentences of quantifier rank n in the language $L_k$ of linear orders expanded with k colours, and that $f\left (L_k,n\right )$ is necessarily finite.

Proposition 3.4. For every coloured quasi-finite linear order $\mathfrak {A} = \left (A;<,\bar {\mathbf {c}}_k\right )$ with k colours and for each natural number n there exists a k-coloured finite linear order $\mathfrak {A}_n$ such that $\mathfrak {A} \equiv _n \mathfrak {A}_n$ . Moreover, taking

$$ \begin{align*}S := {\{ [\kern-0.15em[ \mathfrak{A}^{\geqslant u} ]\kern-0.15em]^n : u \in A\}} / \mathord{\equiv}\end{align*} $$

to be the set of equivalence classes of all the n-characteristic sentences of the orders $\mathfrak {A}^{\geqslant u}$ for $u \in A$ , the cardinality of $\mathfrak {A}_n$ is given by $\left |A_n\right | = \left | S \right | \leqslant f\left (L_k,n\right )$ .

Proof Let $\mathfrak {A}$ be a coloured quasi-finite linear order and let $m = \left |S\right |$ . We start by defining three sequences of coloured linear orders: $\mathfrak {B}_1,\mathfrak {B}_2,\ldots ,\mathfrak {B}_{m-1}$ , $\mathfrak {C}_1,\mathfrak {C}_2,\ldots ,\mathfrak {C}_{m-1}$ , and $\mathfrak {D}_1,\mathfrak {D}_2,\ldots ,\mathfrak {D}_{m-1}$ , where $\mathfrak {B}_i = \mathfrak {C}_i + \mathfrak {D}_i$ and where $\mathfrak {C}_i$ and $\mathfrak {D}_i$ are substructuresFootnote 3 of $\mathfrak {A}$ for each i, as described below. Informally, each $\mathfrak {C}_i$ will consist of the first i elements x in $\mathfrak {A}$ that are maximal with respect to satisfying their n-characteristic formula $[\kern -0.15em[ \mathfrak {A}^{\geqslant x} ]\kern -0.15em]^n\left (x\right )$ , and $\mathfrak {D}_i$ then consists of the elements in $\mathfrak {A}$ that are greater than all of the elements in $\mathfrak {C}_i$ . Formally, $\mathfrak {B}_i$ , $\mathfrak {C}_i$ , and $\mathfrak {D}_i$ will have the following properties for each i:

  1. (i) $\mathfrak {B}_i \equiv _n \mathfrak {A}$ ;

  2. (ii) for each $u \in B_i$ , $\mathfrak {B}_i^{\geqslant u} \equiv _n \mathfrak {A}^{\geqslant u}$ ;

  3. (iii) $\mathfrak {C}_i$ is a coloured finite linear order with domain $C_i$ , such that $\left |C_i \right | = i$ ;

  4. (iv) $\mathfrak {D}_i$ is a coloured quasi-finite linear order of the form $\mathfrak {A}^{> u}$ for some $u \in A$ ;

  5. (v) for $u \in C_i$ and $v \in B_i$ with $u \neq v$ , $\mathfrak {B}_i^{\geqslant u} \not \equiv _n \mathfrak {B}_i^{\geqslant v}$ .

The exact construction of $\mathfrak {C}_i$ and $\mathfrak {D}_i$ will now be described. Let a be the least element of $\mathfrak {A}$ . We define the formula

The set $\mathfrak {A}^{\varphi _0}$ is non-empty because $\mathfrak {A} \models \varphi _0\left (a\right )$ , and it is definable. Hence, by the forward well-foundedness of $\mathfrak {A}$ , $\mathfrak {A}^{\varphi _0}$ contains a greatest element $a_0$ . Since $a,a_0 \in \mathfrak {A}^{\varphi _0}$ then $\mathfrak {A}^{\geqslant a} \equiv _n \mathfrak {A}^{\geqslant a_0}$ hence

$$ \begin{align*}\mathfrak{A} = \mathfrak{A}^{\geqslant a} \equiv_n \mathfrak{A}^{\geqslant a_0} \cong \mathfrak{A}^{\left[a_0,a_0\right]} + \mathfrak{A}^{> a_0}.\end{align*} $$

Let $\mathfrak {C}_1 := \mathfrak {A}^{\left [a_0,a_0\right ]}$ , $\mathfrak {D}_1 := \mathfrak {A}^{> a_0}$ , and $\mathfrak {B}_1 := \mathfrak {C}_1 + \mathfrak {D}_1$ . Observe that properties (i)–(v) are all satisfied by $\mathfrak {B}_1$ , $\mathfrak {C}_1$ , and $\mathfrak {D}_1$ ; property (iv) follows using part 4 of Lemma 3.2; property (v) follows using property (ii) along with the fact $a_0$ is the greatest element in $\mathfrak {A}^{\varphi _0}$ ; the other properties are straightforward.

Given $\mathfrak {B}_i$ , $\mathfrak {C}_i$ , and $\mathfrak {D}_i$ for some i with $1 \leqslant i < m-1$ , that satisfy properties (i)–(v), the coloured linear orders $\mathfrak {B}_{i+1}$ , $\mathfrak {C}_{i+1}$ , and $\mathfrak {D}_{i+1}$ are obtained as follows. Since $\mathfrak {D}_i$ is a coloured quasi-finite linear order, it contains a least element, say $d_i$ . Define the formula

$$\begin{align*}\varphi_i\left(x\right) := \left(\left[\kern-0.15em[ \mathfrak{D}_i \right]\kern-0.15em]^n\right)^{\geqslant x}.\end{align*}$$

Again, the set $\mathfrak {D}_i^{\varphi _i}$ is non-empty (because $\mathfrak {D}_i \models \varphi _i\left (d_i\right )$ ) and definable, hence, by the definable forward well-foundedness of $\mathfrak {D}_i$ , $\mathfrak {D}_i^{\varphi _i}$ contains a greatest element $e_i$ . Then

(1) $$ \begin{align} \mathfrak{D}_i = \mathfrak{D}_i^{\geqslant d_i} \equiv_n \mathfrak{D}_i^{\geqslant e_i} \cong \mathfrak{D}_i^{\left[e_i,e_i\right]} + \mathfrak{D}_i^{> e_i}. \end{align} $$

Now take $\mathfrak {C}_{i+1} := \mathfrak {C}_i + \mathfrak {D}_i^{\left [e_i,e_i\right ]}$ , $\mathfrak {D}_{i+1} := \mathfrak {D}_i^{> e_i}$ , and $\mathfrak {B}_{i+1} := \mathfrak {C}_{i+1} + \mathfrak {D}_{i+1}$ , and again observe that properties (i)–(v) are all satisfied by $\mathfrak {B}_{i+1}$ , $\mathfrak {C}_{i+1}$ , and $\mathfrak {D}_{i+1}$ : for property (i),

$$ \begin{align*} \mathfrak{B}_{i+1} & = \mathfrak{C}_{i+1} + \mathfrak{D}_{i+1} = (\mathfrak{C}_i + \mathfrak{D}_i^{[e_i,e_i]}) + \mathfrak{D}_i^{> e_i} \\ &\cong \mathfrak{C}_i + (\mathfrak{D}_i^{[e_i,e_i]} + \mathfrak{D}_i^{> e_i}) \equiv_n \mathfrak{C}_i + \mathfrak{D}_i = \mathfrak{B}_i \equiv_n \mathfrak{A} \end{align*} $$

with the first instance of $\equiv _n$ following from (1) and Lemma 2.1; properties (ii) and (iii) are straightforward; property (iv) again follows using part 4 of Lemma 3.2 using the fact that $\mathfrak {D}_i$ is a coloured quasi-finite linear order; property (v) follows using property (ii) for $\mathfrak {B}_{i+1}$ , along with how $\mathfrak {C}_{i+1}$ and $\mathfrak {D}_{i+1}$ were constructed.

Finally, since $\left |S\right | = m$ there are at most m pairwise non-equivalent n-characteristic sentences that are satisfied by the structures $\mathfrak {B}_{m-1}^{\geqslant u}$ for $u \in B_{m-1}$ . Using property (v), the $m-1$ structures of the type $\mathfrak {B}_{m-1}^{\geqslant u}$ , where u ranges over $C_{m-1}$ , must all have pairwise non-equivalent n-characteristic sentences. Hence all structures of the type $\mathfrak {D}_{m-1}^{\geqslant u}$ , where $u \in D_{m-1}$ , must have equivalent n-characteristic sentences. In particular, the n-characteristic sentences of each of these structures $\mathfrak {D}_{m-1}^{\geqslant u}$ must be equivalent to the n-characteristic sentence of $\mathfrak {D}_{m-1}^{\geqslant b}$ , where b is the greatest element of $\mathfrak {D}_{m-1}$ . It follows that $\left |D_{m-1}\right |=1$ and so $\left |B_{m-1}\right | = m$ , i.e., $\mathfrak {A}_n := \mathfrak {B}_{m-1}$ may be taken to be a coloured finite linear order such that $\mathfrak {A} \equiv _n \mathfrak {A}_n$ , as required.⊣

Lemma 3.5. If $\mathfrak {A}$ and $\mathfrak {B}$ are k-coloured quasi-finite linear orders then $\mathfrak {A} + \mathfrak {B}$ is a k-coloured quasi-finite linear order, too.

Proof Let n be any natural number. By Proposition 3.4 there exist k-coloured finite linear orders $\mathfrak {A}'$ and $\mathfrak {B}'$ such that $\mathfrak {A}' \equiv _n \mathfrak {A}$ and $\mathfrak {B}' \equiv _n \mathfrak {B}$ . By Lemma 2.1, $\mathfrak {A} + \mathfrak {B} \equiv _n \mathfrak {A}' + \mathfrak {B}'$ . Since $\mathfrak {A}' + \mathfrak {B}'$ is a k-coloured finite linear order then $\mathfrak {A}' + \mathfrak {B}' \models \mathsf {CFLO}_k$ and so $\mathfrak {A} + \mathfrak {B}$ satisfies all sentences of quantifier rank at most n in $\mathsf {CFLO}_k$ . Since n is arbitrary, it follows that $\mathfrak {A} + \mathfrak {B} \models \mathsf {CFLO}_k$ , hence $\mathfrak {A} + \mathfrak {B}$ is itself a k-coloured quasi-finite linear order, as required.⊣

3.3 The first-order theories of coloured well-orders and expansions of $\omega $

We now define the following theories, which will be shown further to axiomatise the first-order theories of the classes of all coloured well-orders, of all coloured expansions of $\omega $ , and of all coloured expansions of $\omega ^{\star }$ (the reverse order of $\omega $ ), respectively:

$$ \begin{align*} \mathsf{CWO}_k & := \left\{\mathsf{LO}\right\} \cup \mathsf{BWF}_k, \\ \mathsf{C}\omega_k & := \left\{\mathsf{LO},\neg\mathsf{Gr},\mathsf{BD}\right\} \cup \mathsf{BWF}_k, \\ \mathsf{C}\omega^{\star}_k & := \left\{\mathsf{LO},\neg\mathsf{Le},\mathsf{FD}\right\} \cup \mathsf{FWF}_k. \end{align*} $$

Fact 3.6 ([Reference Doets2, Corollary 4.4])

For every model $\mathfrak {A}$ of $\mathsf {CWO}_k$ and each $n \in \mathbb {N}$ , there exists a k-coloured well-order $\mathfrak {B}_n$ such that $\mathfrak {A} \equiv _n \mathfrak {B}_n$ .

Corollary 3.7 below is similar to [Reference Doets2, Theorem 3.1] but uses backward well-foundedness instead of definable induction for approximating the structure of coloured $\omega $ .

Corollary 3.7. For every model $\mathfrak {A}$ of $\mathsf {C}\omega _k$ and each $n \in \mathbb {N}$ , there exists a k-coloured linear order $\mathfrak {B}_n$ such that $\left (\mathfrak {B}_n\right )^- \cong \omega $ and $\mathfrak {A} \equiv _n \mathfrak {B}_n$ .

Proof It suffices to prove the result for $n \geqslant 3$ . By Fact 3.6, there exists a k-coloured well-order $\mathfrak {B}_n$ such that $\mathfrak {A} \equiv _n \mathfrak {B}_n$ . Since the quantifier ranks of the sentences $\neg \mathsf {Gr}$ and $\mathsf {BD}$ are at most $3$ , $\mathfrak {B}_n \models \left \{\neg \mathsf {Gr},\mathsf {BD}\right \}$ and, since $\mathfrak {B}_n$ is a well-order, $\mathfrak {B}_n \models \left \{\mathsf {Le},\mathsf {FD}\right \}$ . By Fact 3.1, $\left (\mathfrak {B}_n\right )^-$ is a model of the first-order theory of $\omega $ and also a well-order, from which it follows that $\left (\mathfrak {B}_n\right )^- \cong \omega $ .⊣

Using a similar argument, it follows that the theory $\mathsf {C}\omega ^{\star }_k$ axiomatises the first-order theory of the class of coloured expansions of $\omega ^*$ :

Corollary 3.8. For every model $\mathfrak {A}$ of $\mathsf {C}\omega ^{\star }_k$ and each $n \in \mathbb {N}$ , there exists a k-coloured linear order $\mathfrak {B}_n$ such that $\left (\mathfrak {B}_n\right )^- \cong \omega ^*$ and $\mathfrak {A} \equiv _n \mathfrak {B}_n$ .

3.4 The first-order theory of coloured expansions of $\zeta $

We now define the theory

$$ \begin{align*}\mathsf{C}\zeta_k := \left\{\mathsf{LO},\neg\mathsf{Le},\neg\mathsf{Gr}\right\} \cup \mathsf{BBWF}_k \cup \mathsf{BFWF}_k.\end{align*} $$

Proposition 3.9. For every model $\mathfrak {A}$ of $\mathsf {C}\zeta _k$ and each $n \in \mathbb {N}$ , there is a k-coloured linear order $\mathfrak {B}_n$ , with $\left (\mathfrak {B}_n\right )^- \cong \zeta $ , such that $\mathfrak {A} \equiv _n \mathfrak {B}_n$ .

Proof Let $\mathfrak {A} \models \mathsf {C}\zeta _k$ and observe that $\mathfrak {A} \models \left \{\mathsf {BD},\mathsf {FD}\right \}$ . Pick any element $a \in A$ and let b be its immediate predecessor. Then $\mathfrak {A}^{\geqslant a}$ satisfies the theory $\mathsf {C}\omega _k$ and $\mathfrak {A}^{\leqslant b}$ satisfies the theory $\mathsf {C}\omega ^{\star }_k$ , hence, by Corollaries 3.7 and 3.8, there exists a k-coloured linear order $\mathfrak {C}$ such that $\mathfrak {C}^- \cong \omega $ and $\mathfrak {A}^{\geqslant a} \equiv _n \mathfrak {C}$ , and there exists a k-coloured linear order $\mathfrak {D}$ such that $\mathfrak {D}^- \cong \omega ^*$ and $\mathfrak {A}^{\leqslant b} \equiv _n \mathfrak {D}$ . Then $\mathfrak {B}_n = \mathfrak {D} + \mathfrak {C}$ is the desired coloured linear order, since $\mathfrak {A} \equiv _n \mathfrak {B}_n$ follows from Lemma 2.1.⊣

3.5 The first-order theory of coloured ordinals

Let $\alpha $ be a monochromatic ordinal such that $\omega < \alpha < \omega ^{\omega }$ . It is known (see [Reference Rosenstein16, Section 13.2]) that the FO theory of $\alpha $ is finitely axiomatised, i.e., there exists a first-order sentence $\Phi \left (\alpha \right )$ in the language $L_0$ such that, for any monochromatic linear order $\mathfrak {A}$ ,

$$ \begin{align*}\mathfrak{A} \models \Phi\left(\alpha\right) \Longleftrightarrow \mathfrak{A} \equiv \alpha,\end{align*} $$

and $\alpha $ is the only ordinal that is a model of $\Phi \left (\alpha \right )$ . The formulation of $\Phi \left (\alpha \right )$ is somewhat involved and for details we refer the reader to [Reference Rosenstein16, Section 13.2]. We now define the theory

$$ \begin{align*}\mathsf{C}\alpha_k := \left\{\Phi\left(\alpha\right)\right\} \cup \mathsf{BWF}_k.\end{align*} $$

Proposition 3.10. For every model $\mathfrak {A}$ of $\mathsf {C}\alpha _k$ and each $n \in \mathbb {N}$ there is a k-coloured linear order $\mathfrak {B}_n$ such that $\left (\mathfrak {B}_n\right )^- \cong \alpha $ and $\mathfrak {A} \equiv _n \mathfrak {B}_n$ .

Proof Let $\mathfrak {A} \models \mathsf {C}\alpha _k$ and let $n \geqslant \mathrm {qr}\left (\Phi \left (\alpha \right )\right )$ . Since $\mathfrak {A} \models \Phi \left (\alpha \right )$ then $\mathfrak {A} \models \mathsf {LO}$ . By Fact 3.6, there exists a k-coloured well-order $\mathfrak {B}_n$ such that $\mathfrak {A} \equiv _n \mathfrak {B}_n$ , and since $\mathfrak {B}_n \models \Phi \left (\alpha \right )$ then $\left (\mathfrak {B}_n\right )^- \equiv \alpha $ . Since $\mathfrak {B}_n$ is a well-order and $\alpha $ is the only ordinal that is a model of $\Phi \left (\alpha \right )$ , it follows that $\left (\mathfrak {B}_n\right )^- \cong \alpha $ , as required.⊣

3.6 The first-order theory of coloured expansions of $\eta $

We now define the theory

$$ \begin{align*}\mathsf{C}\eta_k := \left\{\mathsf{LO},\neg\mathsf{Le},\neg\mathsf{Gr},\mathsf{De}\right\}.\end{align*} $$

Proposition 3.11. For every model $\mathfrak {A}$ of $\mathsf {C}\eta _k$ , there is a k-coloured linear order $\mathfrak {B}$ , with $\mathfrak {B}^- \cong \eta $ , such that $\mathfrak {A} \equiv \mathfrak {B}$ .

Proof By the Löwenheim–Skolem Theorem, there exists a countable structure $\mathfrak {B}$ such that $\mathfrak {B} \equiv \mathfrak {A}$ . Since $\mathfrak {B} \models \mathsf {C}\eta _k$ , $\mathfrak {B}$ is a countable k-coloured dense linear order without endpoints, hence $\mathfrak {B}^- \cong \eta $ .⊣

3.7 The first-order theory of coloured expansions of $\lambda $

An axiomatisation of the first-order theory of the class of coloured expansions of $\lambda $ is obtained in [Reference Doets2]. For the sake of completeness of this paper we will briefly describe that axiomatisation here, as it will be needed in Section 5 to axiomatise the first-order theory of the class of $\lambda $ -trees.

A coloured linear order is called definably bounded complete when each of its non-empty definable subsets that is bounded above has a least upper bound. For any formula $\varphi \left (x,\bar {z}\right )$ (where the tuple $\bar {z}$ may be empty) we define the formula

$$ \begin{align*}\mathsf{ub}_{\varphi}{\left(u,\bar{z}\right)} := \forall x \left(\varphi\left(x,\bar{z}\right) \rightarrow x \leqslant u \right),\end{align*} $$

which expresses that u is an upper bound of the set that is defined by $\varphi \left (x,\bar {z}\right )$ . The property of definable bounded completeness can now be expressed by the set $\mathsf {COM}_k$ that consists of all sentences of the form

$$ \begin{align*} \mathsf{COM}_k\left( \varphi\right) & := \forall \bar{z} (( \exists x( \varphi(x,\bar{z})) \wedge \exists u ( \mathsf{ub}_{\varphi}{(u,\bar{z})})) \\ & \;\,\quad \rightarrow \exists u( \mathsf{ub}_{\varphi}{(u,\bar{z})} \wedge \forall v( \mathsf{ub}_{\varphi}{(v,\bar{z})} \rightarrow u \leqslant v))) \end{align*} $$

for any formula $\varphi \left (x,\bar {z}\right )$ in the language $L_k$ .

Given a linear order $\mathfrak {A}$ and a partition P of A of which all members are intervals, we define the binary relation $<_{\mathfrak {P}}$ on P by specifying, for $X,Y \in P$ , that $X <_{\mathfrak {P}} Y$ iff $x <_{\mathfrak {A}} y$ for some $x \in X$ and $y \in Y$ . The linear order $\mathfrak {P} := \left (P;<_{\mathfrak {P}}\right )$ is called a condensation of $\mathfrak {A}$ . Given a transitive binary relation R on A, we define the relation $\sim _R$ on A by specifying, for $u,v \in A$ , that $u \sim _R v$ if and only if any of the following conditions hold:

  1. (i) $u = v$ ,

  2. (ii) $u <_{\mathfrak {A}} v$ and $sRt$ for all $s,t \in A$ such that $u \leqslant _{\mathfrak {A}} s <_{\mathfrak {A}} t \leqslant _{\mathfrak {A}} v$ , or

  3. (iii) $v <_{\mathfrak {A}} u$ and $sRt$ for all $s,t \in A$ such that $v \leqslant _{\mathfrak {A}} s <_{\mathfrak {A}} t \leqslant _{\mathfrak {A}} u$ .

It is straightforward to see that for any transitive binary relation R, the relation $\sim _R$ is an equivalence relation, hence the set is a partition of A into intervals, and so the structure $\mathfrak {Q} = \left (Q;<_{\mathfrak {Q}}\right )$ is a condensation of $\mathfrak {A}$ . The relation R is said to induce the condensation $\mathfrak {Q}$ . If the relation R is definable then $\mathfrak {Q}$ is called a definable condensation. For any formula $\varphi \left (x,y,\bar {z}\right )$ (with the tuple $\bar {z}$ possibly empty) we define the formulas

$$ \begin{align*} \mathsf{bin}_{\varphi}{\left(\bar{z}\right)} & := \forall x \forall y \forall v (( \varphi(x,y,\bar{z}) \wedge \varphi(y,v,\bar{z})) \rightarrow \varphi(x,v,\bar{z})), \\ \mathsf{part}_{\varphi}{\left(u,v,\bar{z}\right)} & := u < v \wedge \forall s \forall t ( u \leqslant s < t \leqslant v \rightarrow \varphi(s,t,\bar{z})), \\ \mathsf{eq}_{\varphi}{\left(u,v,\bar{z}\right)} & := u = v \vee \mathsf{part}_{\varphi}{\left(u,v,\bar{z}\right)} \vee \mathsf{part}_{\varphi}{\left(v,u,\bar{z}\right)}, \\ \mathsf{dense}_{\varphi}{\left(\bar{z}\right)} & := \forall u \forall v \big( \left( u < v \wedge \neg\mathsf{eq}_{\varphi}{\left(u,v,\bar{z}\right)} \right) \\ & \;\,\quad \rightarrow \exists w ( u < w < v \wedge \neg\mathsf{eq}_{\varphi}{(u,w,\bar{z})} \wedge \neg\mathsf{eq}_{\varphi}{(w,v,\bar{z})})), \ \text{and} \\ \mathsf{singleton}_{\varphi}{\left(\bar{z}\right)} & := \forall u \forall v (( u < v \wedge \neg\mathsf{eq}_{\varphi}{(u,v,\bar{z})}) \rightarrow \exists w ( u < w < v \, \\ & \;\,\quad \wedge \neg\mathsf{eq}_{\varphi}{(u,w,\bar{z})} \wedge \neg\mathsf{eq}_{\varphi}{(w,v,\bar{z})} \wedge \forall t ( \mathsf{eq}_{\varphi}{(w,t,\bar{z})} \rightarrow t = w))). \end{align*} $$

The formula $\mathsf {bin}_{\varphi }{\left (\bar {z}\right )}$ expresses that $\varphi \left (x,y,\bar {z}\right )$ defines a transitive binary relation R, the formula $\mathsf {part}_{\varphi }{\left (u,v,\bar {z}\right )}$ expresses property (ii) in the above definition of the relation $\sim _R$ , $\mathsf {eq}_{\varphi }{\left (u,v,\bar {z}\right )}$ expresses that $u \sim _R v$ , $\mathsf {dense}_{\varphi }{\left (\bar {z}\right )}$ expresses that the condensation that is induced by R is densely ordered, and $\mathsf {singleton}_{\varphi }{\left (\bar {z}\right )}$ expresses that the condensation that is induced by R contains a set of singletons that is dense in the condensation.

Following the terminology of [Reference Doets2], a linear order $\mathfrak {A}$ is called definably- I when each of its densely ordered definable condensations contains a set of singletons that is dense in the condensation. The property of a k-coloured linear order being definably-I approximates the countable chain condition of $\mathbb {R}$ , and can be expressed by the set $\mathsf {I}_k$ that consists of all sentences of the form

$$ \begin{align*}\mathsf{I}_k(\varphi):= \forall \bar{z} \left( \big( \mathsf{bin}_{\varphi}{\left(\bar{z}\right)} \wedge \mathsf{dense}_{\varphi}{\left(\bar{z}\right)} \big) \rightarrow \mathsf{singleton}_{\varphi}{\left(\bar{z}\right)} \right)\end{align*} $$

for any formula $\varphi \left (x,y,\bar {z}\right )$ in the language $L_k$ and with the tuple $\bar {z}$ possibly empty.

Finally, the first-order theory of the class of k-coloured expansions of $\lambda $ is axiomatised by the theory

$$ \begin{align*}C\lambda_k := \left\{\mathsf{LO},\neg\mathsf{Le},\neg\mathsf{Gr},\mathsf{De}\right\} \cup \mathsf{COM}_k \cup \mathsf{I}_k,\end{align*} $$

as is seen from the following fact, the proof of which can be found in [Reference Doets2]:

Fact 3.12 [Reference Doets2]

For every model $\mathfrak {A}$ of $C\lambda _k$ and each $n \in \mathbb {N}$ , there exists a k-coloured linear order $\mathfrak {B}_n$ such that $\left (\mathfrak {B}_n\right )^- \cong \lambda $ and $\mathfrak {A} \equiv _n \mathfrak {B}_n$ .

4 From trees to coloured linear orders and back

4.1 Trees as coloured linear orders

A tree $\mathfrak {T}$ is called branching complete when for any two distinct paths $\mathsf {A}$ and $\mathsf {B}$ in $\mathfrak {T}$ , each of the sets $\mathsf {A} \setminus \mathsf {B}$ and $\mathsf {B} \setminus \mathsf {A}$ has an infimum in $\mathfrak {T}$ . Branching completeness is equivalent to the following property: for any antichain $\left \{a,b\right \} \subseteq T$ , the set

(2) $$ \begin{align} T_{ab} := \{ x \in T^{\leqslant a}: x \geqslant y \ \text{for each} \ y \in T^{\leqslant a} \cap T^{\leqslant b}\} \end{align} $$

has a least element. Recall the formula $\mathsf {ub}_{\varphi }(u,\bar {z})$ that was defined in Section 3.7 and which expresses the fact that u is an upper bound of the set that is defined by $\varphi (x,\bar {z})$ . Letting $\varphi \left (x,z_1,z_2\right ) := x \leqslant z_1 \wedge x \leqslant z_2$ , the property of branching completeness can therefore be expressed by the sentence

$$ \begin{align*} \mathsf{bc} &:= \forall z_1 \forall z_2 ( \neg (z_1 < z_2 \vee z_2 < z_1 \vee z_1 = z_2) \\ & \;\,\quad \rightarrow \exists u ( u \leqslant z_1 \wedge \mathsf{ub}_{\varphi}(u,z_1,z_2) \wedge \forall v (( v \leqslant z_1 \wedge \mathsf{ub}_{\varphi}(v,z_1,z_2)) \rightarrow u \leqslant v))). \end{align*} $$

A sufficient condition for branching completeness is that every path in the tree is a complete linear order. Thus, in particular, every $\lambda $ -tree is branching complete.

Remark 4.1. There are other notions of completeness of trees known from the literature, such as Dedekind–MacNeille completeness and others, cf.Footnote 4 [Reference Droste4, Reference Kellerman, Goranko and Zanardo12, Reference Rubin17]. The notion of branching completeness introduced here is mostly ad hoc, as it is sufficient for our purposes, but it is also of some independent interest. As shown in [Reference Kellerman, Goranko and Zanardo12], branching completeness is generally weaker than Dedekind–MacNeille completeness of trees.

For $\mathfrak {T}$ any tree, $\mathsf {A}$ any stem in $\mathfrak {T}$ ( $\mathsf {A}$ will usually be a path), and $s \in \mathsf {A}$ , we define the following sets (to be explained shortly):

$$ \begin{align*} &\qquad F_l{\left(s\right)} := \left\{ \begin{array}{cl} \emptyset & \text{when}\ s\ \text{has an immediate predecessor,} \\ \displaystyle \left(\bigcap_{t<s}T^{>t}\right) \backslash T^{\geqslant s} & \text{otherwise,} \end{array} \right. \\ &\text{and} \ \ F_u{\left( s \backslash \mathsf{A} \right)} := T^{> s} \backslash \left(\,\bigcup_{t \in \mathsf{A} \cap T^{>s}} \!\!\!\! T^{\geqslant t} \right), \end{align*} $$

and, provided that these sets are non-empty, define the structures $\mathfrak {F}_l{\left (s\right )} := \mathfrak {T}^{F_l{\left (s\right )}}$ and $\mathfrak {F}_u{\left ( s \backslash \mathsf {A} \right )} := \mathfrak {T}^{F_u{\left ( s \backslash \mathsf {A} \right )}}$ . In general, either or both of the sets $F_l{\left (s\right )}$ and $F_u{\left ( s \backslash \mathsf {A} \right )}$ may be empty, in which case the corresponding structure is left undefined. The structures $\mathfrak {F}_l{\left (s\right )}$ and $\mathfrak {F}_u{\left ( s \backslash \mathsf {A} \right )}$ , if defined, are forests and will be called the lower side-forest of s, and the upper side-forest of s with respect to $\mathsf {A}$ , respectively. Finally, the side-forest of s with respect to $\mathsf {A}$ is the forest $\mathfrak {F}\left ( s \backslash \mathsf {A} \right ) := (\mathfrak {T}^{F( s \backslash \mathsf {A} )};s )$ where

$$ \begin{align*} F\left( {s}\backslash \mathsf{A} \right) := \{s\} \cup F_l{\left(s\right)} \cup F_u{\left( s \backslash \mathsf{A} \right)}. \end{align*} $$

The forests $\mathfrak {F}_l{\left (s\right )}$ and $\mathfrak {F}_u{\left ( s \backslash \mathsf {A} \right )}$ are depicted in Figure 2.

Figure 2 The side-forests $\mathfrak {F}_l{\left (s\right )}$ and $\mathfrak {F}_u{\left ( s \backslash \mathsf {A} \right )}$ .

Intuitively, the upper side-forest of s with respect to $\mathsf {A}$ consists of those nodes that sit above s but do not sit above any nodes on $\mathsf {A} \cap T^{>s}$ , and the lower side-forest of s consists of those nodes that sit above $T^{<s}$ but are incomparable with s, unless s has an immediate predecessor, in which case its lower side-forest is left empty so as not to coincide with the upper side-forest of that predecessor. Observe that if $\mathsf {A}$ is a stem with a greatest node b then $F_u{\left ( b \backslash \mathsf {A} \right )} = T^{>b}$ .

The property of branching completeness ensures that if $\mathsf {A}$ is either a path in $\mathfrak {T}$ , or a stem with a greatest node, then the set $\displaystyle \left \{ F\left ( s \backslash \mathsf {A} \right ) \right \}_{s \in \mathsf {A}}$ forms a partition of T. Indeed, each side-forest $F\left ( s \backslash \mathsf {A} \right )$ is non-empty since $s \in F\left ( s \backslash \mathsf {A} \right )$ , and different side-forests $F\left ( s \backslash \mathsf {A} \right )$ and $F\left ( t \backslash \mathsf {A} \right )$ are disjoint by the way that side-forests are defined. To see that $\displaystyle \left \{ F\left ( s \backslash \mathsf {A} \right ) \right \}_{s \in \mathsf {A}}$ covers T, pick any node $u \in T$ , and consider the following cases.

  • Case 1: $u \in \mathsf {A}$ . Then $u \in F\left ( u \backslash \mathsf {A} \right )$ .

  • Case 2: $u \not \in \mathsf {A}$ .

    • Case 2.1: $u \geqslant x$ for each $x \in \mathsf {A}$ . Then $\mathsf {A}$ must be a stem with a greatest node b, in which case $u \in F\left ( b \backslash \mathsf {A} \right )$ .

    • Case 2.2: There exists $v \in \mathsf {A}$ such that $\left \{u,v\right \}$ is an antichain. By the branching completeness of $\mathfrak {T}$ , the set $T_{vu}$ has a least element $w \in \mathsf {A}$ .

      • Case 2.2.1: $w < u$ . Then $u \in F_u{\left ( w \backslash \mathsf {A} \right )}$ .

      • Case 2.2.2: $w \not < u$ . Then $u \in F_l{\left (w\right )}$ .

This shows that $\bigcup _{s \in \mathsf {A}} F\left ( s \backslash \mathsf {A} \right ) = T$ .

Given $s,t \in \mathsf {A}$ such that $t> s$ , the sets $F_l{\left (s\right )}$ , $F_u{\left ( s \backslash \mathsf {A} \right )}$ , and $F\left ( s \backslash \mathsf {A} \right )$ can be defined in $\left (\mathfrak {T};s,t\right )$ (the expansion of $\mathfrak {T}$ obtained by adding the constants s and t) by the formulas

$$ \begin{align*} &\mathsf{lsf}\left(x,s\right) := \big(\exists u \big( u < s \wedge \neg \exists v \left( u < v < s \right) \big) \rightarrow \neg \left(x = x\right) \big) \\ & \quad \wedge \big( \forall u \big( u < s \rightarrow \exists v \left( u < v < s \right) \big) \rightarrow \big( \forall w \left(w < s \rightarrow w < x \right) \wedge \neg \left(s \leqslant x\right) \big) \big),\\ &\kern-12pt\mathsf{usf}\left(x,s,t\right) := s < x \wedge \neg \exists u \left( s < u \leqslant t \wedge u \leqslant x \right), \ \text{and} \\ &\kern-7pt\mathsf{sf}\left(x,s,t\right) := \left(x = s\right) \vee \mathsf{lsf}\left(x,s\right) \vee \mathsf{usf}\left(x,s,t\right). \end{align*} $$

(The first conjunct in $\mathsf {lsf}\left (x,s\right )$ covers the case where s has an immediate predecessor, and the second conjunct covers the case where s has no immediate predecessor.)

Now suppose that the tree $\mathfrak {T}$ has $k\geq 0$ colours, say $\mathfrak {T} = \left (T;<,\bar {\mathbf {c}}\right )$ where $\bar {\mathbf {c}} = \bar {\mathbf {c}}_k$ (or, simply $\mathfrak {T} = \left (T;<\right )$ if $k=0$ ). Recall that $L_k'$ denotes the signature $L_k$ of k-coloured trees, expanded with a single constant symbol, and $f\left (L_k',n\right )$ denotes the number of non-equivalent n-characteristic sentences of k-coloured trees that have an added constant symbol. Let $p = f\left (L_k',n\right )$ and define an additional set of colours, called extended colours, by $\bar {\mathbf {e}} := \bar {\mathbf {e}}_p = \left (\mathbf {e}_{1},\mathbf {e}_{2},\ldots ,\mathbf {e}_{p}\right )$ . Recall the definition of the sentence $\tau _{L,n,i}$ from the discussion on characteristic formulas in Section 2.1. Now, consider the coloured linear order

$$ \begin{align*} \mathsf{A}^{\mathfrak{T}}\left[\bar{\mathbf{e}}\right] := \left(\mathfrak{T}^{\mathsf{A}};\bar{\mathbf{e}}\right) = \left(\mathsf{A};<,\bar{\mathbf{c}},\bar{\mathbf{e}}\right) \ \ \ \ \text{(or, just} \left(\mathsf{A};<,\bar{\mathbf{e}}\right)\ \text{when}\ \mathfrak{T}\ \text{is monochromatic)} \end{align*} $$

and where the extended colours are defined by specifying, for each $s \in \mathsf {A}$ and each i such that $1 \leqslant i \leqslant p$ ,

$$ \begin{align*}\mathsf{A}^{\mathfrak{T}}\left[\bar{\mathbf{e}}\right] \models \mathbf{e}_{i}\left(s\right) \ \ \text{if and only if} \ \ \mathfrak{F}\left( s \backslash \mathsf{A} \right) \models \tau_{L_k',n,i},\end{align*} $$

i.e., the extended colour of s corresponds to the n-characteristic sentence that is satisfied in $\mathfrak {F}\left ( s \backslash \mathsf {A} \right )$ ).

The linear order $\mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ can be viewed as the k-coloured stem $\mathsf {A}$ of $\mathfrak {T}$ that is further enriched with p additional colours, the role of which is to capture, for each $s \in \mathsf {A}$ , the n-equivalence class of the side-forest $\mathfrak {F}\left ( s \backslash \mathsf {A} \right )$ when viewed within $\mathfrak {T}$ . This will allow us to approximate a tree as a coloured linear order, and thereby adapt results on coloured linear orders to trees.

The next theorem relates the first-order equivalence of two trees to the first-order equivalence of their corresponding coloured linear orders.

Theorem 4.2. Let $k\geq 0$ and $\mathfrak {T}_1$ and $\mathfrak {T}_2$ be k-coloured branching complete trees. Let $\mathsf {A}$ and $\mathsf {B}$ either both be paths, or both be stems with greatest elements in $\mathfrak {T}_1$ and $\mathfrak {T}_2$ respectively. Let $n \in \mathbb {N}$ and $\bar {\mathbf {e}} = \bar {\mathbf {e}}_{f\left (L_k',n\right )}$ . For any $m \in \mathbb {N}$ :

  1. 1. If $\left (\mathfrak {T}_1;\mathsf {A}\right ) \equiv _{m+n+1} \left (\mathfrak {T}_2;\mathsf {B}\right )$ then $\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ] \equiv _m \mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]$ .

  2. 2. If $\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ] \equiv _n \mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]$ then $\left (\mathfrak {T}_1;\mathsf {A}\right ) \equiv _n \left (\mathfrak {T}_2;\mathsf {B}\right )$ .

Proof The result will be proved assuming that $\mathsf {A}$ and $\mathsf {B}$ are paths; the proof for the case where they are instead stems with greatest elements, is identical.

(1) We will reason by contraposition. Suppose that Spoiler has a winning strategy $\Sigma $ in the game $\mathrm {EF}_{m}\!\left (\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ],\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]\right )$ . Then Spoiler can win the game $\mathrm {EF}_{m+n+1}\!\left (\left (\mathfrak {T}_1;\mathsf {A}\right ),\left (\mathfrak {T}_2;\mathsf {B}\right )\right )$ as follows. For the first m rounds of the game, Spoiler plays only on the paths $\mathsf {A}$ and $\mathsf {B}$ (so, Duplicator will have no choice but to also play on these paths) by choosing nodes according to the strategy $\Sigma $ in the parallel game $\mathrm {EF}_{m}\!\left (\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ],\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]\right )$ . If, after m rounds, the resulting nodes $a_1,a_2,\ldots ,a_m \in \mathsf {A}$ and $b_1,b_2,\ldots ,b_m \in \mathsf {B}$ (where $a_i$ and $b_i$ are the nodes chosen by the players in the i-th round) do not form a local isomorphism between the trees $\left (\mathfrak {T}_1;\mathsf {A}\right )$ and $\left (\mathfrak {T}_2;\mathsf {B}\right )$ then Spoiler wins the game $\mathrm {EF}_{m+n+1}\!\left (\left (\mathfrak {T}_1;\mathsf {A}\right ),\left (\mathfrak {T}_2;\mathsf {B}\right )\right )$ . Now, suppose the nodes do form a local isomorphism between $\left (\mathfrak {T}_1;\mathsf {A}\right )$ and $\left (\mathfrak {T}_2;\mathsf {B}\right )$ . Then there must exist nodes $a_q \in \mathsf {A}$ and $b_q \in \mathsf {B}$ such that $a_q$ and $b_q$ have different extended colours in $\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ]$ and $\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]$ . Spoiler can then win the game $\mathrm {EF}_{m+n+1}\!\left (\left (\mathfrak {T}_1;\mathsf {A}\right ),\left (\mathfrak {T}_2;\mathsf {B}\right )\right )$ in the next $n+1$ rounds as follows.

First observe that, in rounds $m+1$ to $m+n$ of $\mathrm {EF}_{m+n+1}\!\left (\left (\mathfrak {T}_1;\mathsf {A}\right ),\left (\mathfrak {T}_2;\mathsf {B}\right )\right )$ , whenever Spoiler chooses a node $a_i$ from $F\left ( a_q \backslash \mathsf {A} \right )$ in round i of the game, Duplicator has to choose her response from $F\left ( b_q \backslash \mathsf {B} \right )$ , and vice versa. To see this, suppose that $a_i \in F\left ( a_q \backslash \mathsf {A} \right )$ but that $b_i \not \in F\left ( b_q \backslash \mathsf {B} \right )$ . Then $a_i \neq a_q$ for if $a_i = a_q$ then $b_i$ would have to be the node $b_q$ thus giving $b_i \in F\left ( b_q \backslash \mathsf {B} \right )$ , a contradiction. It follows that $a_i \not \in \mathsf {A}$ , hence $b_i$ cannot lie on $\mathsf {B}$ . So, there are two cases to consider: either $a_q < a_i$ or $a_i$ is incomparable with $a_q$ . If $a_q < a_i$ then it must be the case that $b_q < b_i$ too, from which, since $b_i \not \in F\left ( b_q \backslash \mathsf {B} \right )$ , there exists $b \in \mathsf {B}$ such that $b_q < b < b_i$ . If Spoiler then chooses the node $b \in T_2$ in round $i+1$ of the game, he will have won, since there is no node $a \in \mathsf {A}$ for Duplicator to choose in round $i+1$ that satisfies $a_q < a < a_i$ . The case where $a_i$ is incomparable with $a_q$ similarly leads to a win for Spoiler. Now since $a_q$ and $b_q$ have different extended colours in $\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ]$ and $\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]$ then $\mathfrak {F}\left ( a_q \backslash \mathsf {\mathsf {A}} \right ) \not \equiv _n \mathfrak {F}\left ( b_q \backslash \mathsf {\mathsf {B}} \right )$ . By using a winning strategy for $\mathrm {EF}_{n}\!\left (\mathfrak {F}\left ( a_q \backslash \mathsf {\mathsf {A}} \right ),\mathfrak {F}\left ( b_q \backslash \mathsf {\mathsf {B}} \right )\right )$ , Spoiler can create a configuration of nodes in rounds $m+1$ through to n of the game $\mathrm {EF}_{m+n+1}\!\left (\left (\mathfrak {T}_1;\mathsf {A}\right ),\left (\mathfrak {T}_2;\mathsf {B}\right )\right )$ that Duplicator cannot match.

(2) Suppose that Duplicator has a winning strategy in the n-round game $\mathrm {EF}_{n}\!\left (\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ],\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]\right )$ . By decomposing the tree $\left (\mathfrak {T}_1;\mathsf {A}\right )$ into the path $\mathsf {A}$ along with all the side-forests $\mathfrak {F}\left ( s \backslash \mathsf {\mathsf {A}} \right )$ , and similarly for the tree $\left (\mathfrak {T}_2;\mathsf {B}\right )$ , Duplicator can win the game $\mathrm {EF}_{n}\!\left (\left (\mathfrak {T}_1;\mathsf {A}\right ),\left (\mathfrak {T}_2;\mathsf {B}\right )\right )$ by combining her winning strategy for the game $\mathrm {EF}_{n}\!\left (\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ],\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]\right )$ with her local strategies for the games $\mathrm {EF}_{n}\!\left (\mathfrak {F}\left ( s \backslash \mathsf {\mathsf {A}} \right ),\mathfrak {F}\left ( t \backslash \mathsf {\mathsf {B}} \right )\right )$ for nodes s from $\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ]$ and t from $\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]$ that have the same extended colours in $\mathsf {A}^{\mathfrak {T}_1}\left [\bar {\mathbf {e}}\right ]$ and $\mathsf {B}^{\mathfrak {T}_2}\left [\bar {\mathbf {e}}\right ]$ respectively.⊣

Let $k,n \in \mathbb {N}$ and $p = f\left (L_k',n\right )$ . Given a formula $\varphi \left (\bar {x}\right )$ (where $\varphi $ may be a sentence) in the language $L_{k+p}$ with colours $\bar {\mathbf {c}}_k$ and $\bar {\mathbf {e}}_p$ , and a variable z that does not occur in $\varphi $ , let $\varphi ^n_z\left (\bar {x}\right )$ denote the formula that is obtained from $\varphi ^{<z}$ (the relativisation of $\varphi $ to the formula $\theta \left (u,z\right ) := u < z$ where z is treated as a parameter) by replacing, for each i, for $1 \leqslant i \leqslant p$ , every atomic formula of the form $\mathbf {e}_{i}\left (y\right )$ that occurs in $\varphi ^{< z}$ with the formula $\tau _{L_k',n,i}^{\mathsf {sf}\left (u,y,z\right )}$ (i.e., $\tau _{L_k',n,i}$ relativised to $\mathsf {sf}\left (u,y,z\right )$ where y and z are treated as parameters).

In other words, the formula $\varphi ^n_z$ is the same as the relativised formula $\varphi ^{<z}$ , except that instead of stating that a node y has colour $\mathbf {e}_{i}$ , $\varphi ^n_z$ states that the side-forest of y with respect to the stem $T^{<z}$ satisfies the characteristic sentence $\tau _{L_k',n,i}$ . If $\varphi $ does not contain any atomic formulas of the form $\mathbf {e}_{i}\left (y\right )$ then $\varphi ^n_z$ is simply the formula $\varphi ^{<z}$ .

The following result, which relates the truth of a first-order formula in a tree to the truth of that formula in the coloured linear order approximating that tree, now follows immediately from the definition of $\varphi ^n_z$ .

Corollary 4.3. Let $k\geq 0$ and $\mathfrak {T}$ be a k-coloured tree with colours $\bar {\mathbf {c}}_k$ , $n \in \mathbb {N}$ , $p = f\left (L_k',n\right )$ , and $\bar {\mathbf {e}} = \bar {\mathbf {e}}_p$ , and let $\varphi \left (\bar {x}\right )$ be a formula in the language $L_{k+p}$ with colours $\bar {\mathbf {c}}_k$ and $\bar {\mathbf {e}}_p$ . For each $a \in T$ and tuple $\bar {b}$ of nodes in $T^{<a}$ of the same arity as $\bar {x}$ ,

$$ \begin{align*} (\mathfrak{T};a,\bar{b}) \models \varphi_a^n (\bar{b}) \ \text{if and only if} \ ((T^{< a})^{\mathfrak{T}} [\bar{\mathbf{e}}];\bar{b}) \models \varphi(\bar{b}).\end{align*} $$

In particular, if $\varphi $ is a sentence then for each $a \in T$ , $\left (\mathfrak {T};a\right ) \models \varphi _a^n$ if and only if $\left (T^{< a}\right )^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ] \models \varphi $ .

4.2 Towards axiomatising the first-order theory of $\zeta $ -trees

We will now use Theorem 4.2 and Corollary 4.3 to show how the first-order theories of certain classes of coloured $\mathcal {C}$ -trees can be axiomatised. The method used in this section will have the set $\mathcal {C} = \{\zeta \}$ in mind, but can also be used on the sets $\mathcal {C} = \{\omega ^{\star }\}$ and $\mathcal {C} = \{\zeta ,\omega ^{\star }\}$ .

Say that a class of monochromatic linear orders $\mathcal {C}$ has the fusion closure property when the following condition holds:

For any two linear orders $\mathfrak {A}, \mathfrak {B} \in \mathcal {C}$ and for any $a \in A$ and $b \in B$ , $\mathfrak {A}^{< a} + \mathfrak {B}^{\geqslant b}$ is isomorphic to a linear order in $\mathcal {C}$ .

Consider any class $\mathcal {C}$ of monochromatic linear orders with the following properties:

  1. (C1) $\mathcal {C}$ does not contain the singleton linear order;

  2. (C2) $\mathcal {C}$ has the fusion closure property;

  3. (C3) all linear orders in $\mathcal {C}$ are discrete; and

  4. (C4) every $\mathcal {C}$ -tree is branching complete.

The property (C1) is needed to eliminate degenerate cases, (C2) is needed to ensure that paths belong to $\mathcal {C}$ when constructing models of the axiomatisation that will be given below, (C3) is to ensure that all components in the lower and upper side-forests of any $\mathcal {C}$ -tree have roots, and (C4) is needed to ensure that the set of side-forests along any path form a partition of the tree.

Now let k be any natural number and let $\mathcal {S} = \mathcal {T}_k(\mathcal {C})$ be the class of k-coloured $\mathcal {C}$ -trees. Suppose that the following are known:

  1. (i) an axiomatisation $\Sigma ^k_{\uparrow }$ of the first-order theory of the class of k-coloured trees

    $$ \begin{align*}\left\{\mathfrak{T}^{\geqslant a} : \mathfrak{T} \in \mathcal{S}, \, a \in T \right\};\end{align*} $$
  2. (ii) for each natural number n, an axiomatisation $\Sigma ^{k,n}_{\downarrow }$ of the first-order theory of the class of $\left (k+f\left (L_k',n\right )\right )$ -coloured linear orders

    $$ \begin{align*}\{ \left(T^{<a}\right)^{\mathfrak{T}}\left[\bar{\mathbf{e}}\right] : \mathfrak{T} \in \mathcal{S}, \, a \in T \}.\end{align*} $$

Let

$$ \begin{align*} (\Sigma^k_{\uparrow})' & := \{ \forall x \left( \sigma^{\geqslant x} \right) : \sigma \in \Sigma^k_{\uparrow} \} \ \text{and} \\ (\Sigma^k_{\downarrow})' & := \{ \forall z ( \exists y (y < z) \rightarrow \sigma^n_z ) : n \in \mathbb{N}, \, \sigma \in \Sigma^{k,n}_{\downarrow}\}. \end{align*} $$

In essence, the theory $ (\Sigma ^k_{\uparrow } )'$ expresses for a k-coloured tree $\mathfrak {T}$ that each principal subtree of $\mathfrak {T}$ satisfies $\Sigma ^k_{\uparrow }$ , while, using Corollary 4.3, the theory $ (\Sigma ^k_{\downarrow } )'$ expresses of $\mathfrak {T}$ that, for each natural number n and each principal stem $\mathsf {A}$ in $\mathfrak {T}$ , the linear order $\mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ satisfies $\Sigma ^{k,n}_{\downarrow }$ .

Then, the first-order theory of $\mathcal {S}$ can be axiomatised by the theoryFootnote 5

$$ \begin{align*}\mathsf{Ax}\left(\mathcal{S}\right) := \left\{\mathsf{Tree},\mathsf{bc}\right\} \cup (\Sigma^k_{\uparrow})' \cup (\Sigma^k_{\downarrow})'\end{align*} $$

as is shown in the next result.

Theorem 4.4. Let $\mathcal {C}$ a class of linear orders that satisfies (C1)–(C4), and let $\mathcal {S} = \mathcal {T}_k(\mathcal {C})$ be the class of k-coloured $\mathcal {C}$ -trees, for some $k \in \mathbb {N}$ . For each k-coloured model $\mathfrak {T}$ of $\mathsf {Ax}\left (\mathcal {S}\right )$ and each $n \in \mathbb {N}$ , there exists a k-coloured tree $\mathfrak {S}_n \in \mathcal {S}$ such that $\mathfrak {T} \equiv _n \mathfrak {S}_n$ .

Proof Let $\mathfrak {T} = \left (T;<,\bar {\mathbf {c}}\right )$ be a k-coloured model of $\mathsf {Ax}\left (\mathcal {S}\right )$ . Fix $n \in \mathbb {N}$ ; without loss of generality, we can assume that $n \geqslant \mathrm {qr}\left (\mathsf {bc}\right )$ . Now, let $p = f\left (L_k',n\right )$ and $\bar {\mathbf {e}} = \bar {\mathbf {e}}_p$ . Since $\mathfrak {T} \models \mathsf {Tree}$ then $\mathfrak {T}$ is a tree. Pick any node $a \in T$ other than the root of $\mathfrak {T}$ (if there is a root) and let $A = T^{<a}$ . Observe that $\mathfrak {T}$ is discrete (the property of discreteness is encoded in both of the theories $(\Sigma ^k_{\uparrow })'$ and $(\Sigma ^k_{\downarrow })'$ ), hence A has a greatest node, say b.

Since $\mathfrak {T} \models (\Sigma ^k_{\downarrow })'$ then $\left (\mathfrak {T};a\right ) \models \{ \sigma ^n_a : \sigma \in \Sigma ^{k,n}_{\downarrow }\}$ , hence, by Corollary 4.3, $A^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ] \models \Sigma ^{k,n}_{\downarrow }$ . Therefore, there exists a $\left (k+p\right )$ -coloured linear order $\mathfrak {B} = \left (B;<_{\mathfrak {B}},\bar {\mathbf {c}},\bar {\mathbf {e}}\right )$ such that $\mathfrak {B} \equiv _n A^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ and for which the reduct $\mathfrak {B}' = \left (B;<_{\mathfrak {B}},\bar {\mathbf {c}}\right )$ is a coloured principal stem in some tree from $\mathcal {S}$ .

Since $\mathfrak {T}$ is discrete, there are no lower side-forests in $\mathfrak {T}$ . Now, for each $t \in A$ , let $ \{\mathfrak {F}_{t,i} \}_{i \in I_t}$ be the set of components in the upper side-forest $\mathfrak {F}_u\!\left (t \backslash A\right )$ of t with respect to A in the tree $\mathfrak {T}$ . Since $\mathfrak {T}$ is discrete, each component $\mathfrak {F}_{t,i}$ has a root, say $r_{t,i}$ . Then, since $\mathfrak {T}^{\geqslant r_{t,i}} \models \Sigma ^k_{\uparrow }$ , there exists a k-coloured tree $\mathfrak {G}_{t,i}$ such that $\mathfrak {G}_{t,i} \equiv _n \mathfrak {F}_{t,i}$ and for which $\mathfrak {G}_{t,i}$ is a principal subtree of some tree from $\mathcal {S}$ . It follows that, for each node $t \in A$ there exists a forest $\mathfrak {G}_t$ such that $\mathfrak {F}_u\!\left (t \backslash A\right ) \equiv _n \mathfrak {G}_t$ and for which each component of $\mathfrak {G}_t$ is a principal subtree of some tree from $\mathcal {S}$ .

For each $s \in B$ , let $t\left (s\right ) \in A$ be a node such that s and $t\left (s\right )$ have the same colour from amongst the colours in $\bar {\mathbf {e}}$ in the linear orders $\mathfrak {B}$ and $A^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ respectively. (Such $t\left (s\right )$ exists since $\mathfrak {B} \equiv _n A^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ ). Define $\mathfrak {S}_n$ to be the k-coloured tree that is obtained from $\mathfrak {B}'$ by adding, to each element $s \in B$ , the forest $\mathfrak {G}_{t\left (s\right )}$ as an upper side-forest of s with respect to B. It follows that for each $s \in B$ , s has the same colour from amongst the colours in $\bar {\mathbf {e}}$ in both the linear orders $B^{\mathfrak {S}_n}\!\left [\bar {\mathbf {e}}\right ]$ and $\mathfrak {B}$ , hence $B^{\mathfrak {S}_n}\!\left [\bar {\mathbf {e}}\right ] \cong \mathfrak {B}$ . Since also $\mathfrak {B} \equiv _n A^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ then $B^{\mathfrak {S}_n}\!\left [\bar {\mathbf {e}}\right ] \equiv _n A^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ .

Next, we note that both $\mathfrak {S}_n$ and $\mathfrak {T}$ are branching complete. That $\mathfrak {S}_n$ is branching complete follows from the fact that every component in each of the upper side-forests $\mathfrak {F}_u\!\left (s \backslash B\right )$ ( $\cong \mathfrak {G}_{t\left (s\right )}$ ) in $\mathfrak {S}_n$ , being a principal subtree of a tree from $\mathcal {S}$ , is branching complete. $\mathfrak {T}$ is branching complete since it is a model of the sentence $\mathsf {bc}$ . Now since $B^{\mathfrak {S}_n}\!\left [\bar {\mathbf {e}}\right ] \equiv _n A^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ then, by Theorem 4.2, $\mathfrak {S}_n \equiv _n \mathfrak {T}$ .

Finally, to see that $\mathfrak {S}_n$ is a $\mathcal {C}$ -tree, observe that each path in $\mathfrak {S}_n$ can be written as a sum $\left (\mathfrak {B}^-\right )^{\leqslant s} + \mathsf {D}$ for some node $s \in B$ and where $\mathsf {D}$ is a path in the forest $\mathfrak {F}_u\!\left (s \backslash B\right ) \cong \mathfrak {G}_{s\left (t\right )}$ in $\mathfrak {S}_n$ . Since $\mathfrak {B}^-$ is a principal stem in a $\mathcal {C}$ -tree and the linear orders in $\mathcal {C}$ are discrete, it follows that $\left (\mathfrak {B}^-\right )^{\leqslant s} \cong \left (\mathfrak {C}_1\right )^{<c_1}$ for some linear order $\mathfrak {C}_1 \in \mathcal {C}$ of which $\mathfrak {B}^-$ is an initial suborder and some $c_1 \in C_1$ . Further, since each component of $\mathfrak {G}_{s\left (t\right )}$ is a principal subtree of some tree from $\mathcal {S}$ , it follows that $\mathsf {D} \cong \left (\mathfrak {C}_2\right )^{\geqslant c_2}$ for some linear order $\mathfrak {C}_2 \in \mathcal {C}$ of which $\mathsf {D}$ is a terminal suborder and some $c_2 \in C_2$ . Hence, by the fusion closure property of $\mathcal {C}$ , we obtain that $\left (\mathfrak {B}^-\right )^{\leqslant s} + \mathsf {D} \cong \left (\mathfrak {C}_1\right )^{<c_1} + \left (\mathfrak {C}_2\right )^{\geqslant c_2} \in \mathcal {C}$ , as required.⊣

4.3 Applications of the results so far

Theorem 4.4 can be used to axiomatise the first-order theories of classes of trees of the type $\mathcal {S} = \mathcal {T}_k(\mathcal {C})$ , where the requirements on $\mathcal {C}$ are firstly that it must satisfy the properties (C1)–(C4), and secondly, that the axiomatisations $(\Sigma ^k_{\uparrow })'$ and $(\Sigma ^k_{\downarrow })'$ are known. There are three classes $\mathcal {C}$ that fulfill these requirements, namely the sets $\left \{\zeta \right \}$ , $\left \{\omega ^{\star }\right \}$ , and $\left \{\zeta ,\omega ^{\star }\right \}$ .

Theorem 4.4 will hence be used in Section 5.2 to axiomatise the first-order theory of the class of k-coloured $\zeta $ -trees. The two simpler axiomatisations $(\Sigma ^k_{\uparrow })'$ and $(\Sigma ^k_{\downarrow })'$ that are needed for that purpose are those of the first-order theories of the class of k-coloured $\omega $ -trees (to be given in Section 5.1) and of the class of $\left (k+f\left (L_k',n\right )\right )$ -coloured expansions of the linear order $\omega ^{\star }$ (given in Section 3.3).

In the case of the sets $\mathcal {C} = \left \{\omega ^{\star }\right \}$ and $\mathcal {C} = \left \{\zeta ,\omega ^{\star }\right \}$ , an axiomatisation of the first-order theory of the class of $\left (k+f\left (L_k',n\right )\right )$ -coloured expansions of $\omega ^{\star }$ is also needed for $(\Sigma ^k_{\downarrow })'$ . For $(\Sigma ^k_{\uparrow })'$ , one uses in the case $\mathcal {C} = \left \{\omega ^{\star }\right \}$ an axiomatisation of the first-order theory of the class of k-coloured trees of which all paths are finite (which can be deduced from [Reference Rogers, Backofen and Vijay-Shankar15]), and in the case $\mathcal {C} = \left \{\zeta ,\omega ^{\star }\right \}$ one uses an axiomatisation of the first-order theory of the class of k-coloured well-founded trees of which all paths have height at most $\omega $ (a trivial exercise). Axiomatisations for the cases $\mathcal {C} = \left \{\omega ^{\star }\right \}$ and $\mathcal {C} = \left \{\zeta ,\omega ^{\star }\right \}$ will not be presented here however.

5 First-order theories of coloured trees

5.1 The first-order theory of coloured $\omega $ -trees

The following result will be needed to axiomatise the first-order theory of the class of k-coloured $\omega $ -trees.

Proposition 5.1 ([Reference Doets2, Theorem 5.1])

For every k-coloured model $\mathfrak {T}$ of $\left \{\mathsf {Tree}\right \} \cup \mathsf {BWF}_k$ and each $n \in \mathbb {N}$ there exists a well-founded tree $\mathfrak {S}_n$ such that $\mathfrak {S}_n \equiv _n \mathfrak {T}$ .

It now follows that the theory

$$ \begin{align*}T\omega_k := \left\{\mathsf{Tree}, \mathsf{Le}, \neg\mathsf{Gr}, \mathsf{BD}, \mathsf{FD} \right\} \cup \mathsf{BWF}_k\end{align*} $$

axiomatises the class of k-coloured $\omega $ -trees:

Corollary 5.2. Let $\mathfrak {T}$ be any k-coloured model of $T\omega _k$ , for some $k \in \mathbb {N}$ . Then, for each $n \in \mathbb {N}$ there exists a k-coloured $\omega $ -tree $\mathfrak {S}_n$ such that $\mathfrak {S}_n \equiv _n \mathfrak {T}$ .

Proof Without loss of generality we can assume that n is at least as large as the quantifier ranks of all the sentences $\mathsf {Tree}$ , $\mathsf {Le}$ , $\neg \mathsf {Gr}$ , $\mathsf {BD}$ , and $\mathsf {FD}$ . Since $\mathfrak {T} \models \left \{\mathsf {Tree}\right \} \cup \mathsf {BWF}_k$ then, by Proposition 5.1, there exists a well-founded tree $\mathfrak {S}_n$ such that $\mathfrak {S}_n \equiv _n \mathfrak {T}$ . The axioms $\mathsf {Le}$ , $\neg \mathsf {Gr}$ , $\mathsf {BD}$ , and $\mathsf {FD}$ ensure that each path in $\mathfrak {S}_n$ will have the properties of having a least element but no greatest element and of being discrete. Then, each path in $\mathfrak {S}_n$ will be isomorphic to $\omega $ , as required.⊣

5.2 The first-order theory of coloured $\zeta $ -trees

Using Theorem 4.4, we will now show that the theory

$$ \begin{align*}T\zeta_k := \left\{\mathsf{Tree}, \neg\mathsf{Le}, \neg\mathsf{Gr}, \mathsf{BD}, \mathsf{FD} \right\} \cup \mathsf{BBWF}_k \cup \mathsf{BFWF}_k\end{align*} $$

axiomatises the first-order theory of the class of k-coloured $\zeta $ -trees.

Theorem 5.3. Let $\mathfrak {T}$ be a k-coloured model of $T\zeta _k$ , for some $k \in \mathbb {N}$ . Then, for each $n \in \mathbb {N}$ there exists a k-coloured $\zeta $ -tree $\mathfrak {S}_n$ such that $\mathfrak {S}_n \equiv _n \mathfrak {T}$ .

Proof Let $\mathfrak {T}$ be a k-coloured model of $T\zeta _k$ . The result will be proved by showing that $\mathfrak {T} \models \mathsf {Ax}\left (\mathcal {T}_k\left (\zeta \right )\right )$ and employing Theorem 4.4. It is straightforward to check that the class $\left \{\zeta \right \}$ satisfies properties (C1)–(C4). Observe that the class of principal subtrees of trees in $\mathcal {T}_k\left (\zeta \right )$ coincides with the class of k-coloured $\omega $ -trees, hence we take $\Sigma ^k_{\uparrow } := T\omega _k$ . For $n \in \mathbb {N}$ and $p = f\left (L_k',n\right )$ , each linear order of the form $\left (S^{<a}\right )^{\mathfrak {S}}[\bar {\mathbf {e}}_p]$ , where $\mathfrak {S} \in \mathcal {T}_k\left (\zeta \right )$ and $a \in S$ , is a $\left (k+p\right )$ -coloured expansion of $\omega ^{\star }$ , from which it follows that we can take $\Sigma ^{k,n}_{\downarrow } := C\omega ^{\star }_{k+p}$ (but, see Remark 5.4 after this proof). We define $(\Sigma ^k_{\uparrow })'$ and $(\Sigma ^k_{\downarrow })'$ as in Section 4.2.

By assumption, $\mathfrak {T}$ is a model of $\mathsf {Tree}$ . $\mathfrak {T}$ is a model of $\mathsf {bc}$ since the property of branching completeness follows from the property of definable bounded backward well-foundedness. Therefore, to show that $\mathfrak {T} \models \mathsf {Ax}\left (\mathcal {T}_k\left (\zeta \right )\right )$ , it remains to show that $\mathfrak {T} \models (\Sigma ^k_{\uparrow })'$ and $\mathfrak {T} \models (\Sigma ^k_{\downarrow })'$ .

$\mathfrak {T} \models (\Sigma ^k_{\uparrow })'$ : Let $a \in T$ . We first show that $\mathfrak {T}^{\geqslant a} \models T\omega _k$ . It is straightforward to check that $\mathfrak {T}^{\geqslant a} \models \left \{\mathsf {Tree},\mathsf {Le},\neg \mathsf {Gr},\mathsf {BD},\mathsf {FD}\right \}$ . To see that $\mathfrak {T}^{\geqslant a} \models \mathsf {BWF}_k$ , let $\bar {b}$ be a tuple of nodes in $T^{\geqslant a}$ and suppose that the formula $\varphi (x,\bar {b} )$ defines a non-empty set B in the tree $(\mathfrak {T}^{\geqslant a};\bar {b} )$ . Then, $\varphi ^{\geqslant a}(x,\bar {b} )$ defines B in the tree $(\mathfrak {T};a,\bar {b} )$ and B is bounded below by the node a. Hence, since $\mathfrak {T}$ is a model of $ \mathsf {BBWF}_k$ , B must contain a least element. It follows that $\mathfrak {T}^{\geqslant a} \models \mathsf {BWF}_k$ , hence also $\mathfrak {T}^{\geqslant a} \models T\omega _k$ . Then $(\mathfrak {T};a ) \models \{\sigma ^{\geqslant a} : \sigma \in T\omega _k \}$ , and since a was arbitrary and $\Sigma ^k_{\uparrow } = T\omega _k$ , we get $\mathfrak {T} \models (\Sigma ^k_{\uparrow })'$ .

$\mathfrak {T} \models (\Sigma ^k_{\downarrow })'$ : Again, let $a \in T$ and let $\mathsf {A} = \left (T^{<a};<_{\mathfrak {T}}\right )$ , $n \in \mathbb {N}$ , and $p = f\left (L_k',n\right )$ . We first show that $\mathsf {A}^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models C\omega ^{\star }_{k+p}$ . Clearly, $\mathsf {A}^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models \left \{\mathsf {LO},\neg \mathsf {Le},\mathsf {FD}\right \}$ . As for $\mathsf {A}^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models \mathsf {FWF}_k$ , let $\bar {b}$ this time be a tuple of nodes in $T^{<a}$ and suppose that the formula $\varphi (x,\bar {b} )$ defines a non-empty set B in the tree $(\mathsf {A}^{\mathfrak {T}} [\bar {\mathbf {e}}_p ];\bar {b} )$ . By Corollary 4.3, $\varphi ^n_a(x,\bar {b} )$ defines B in $(\mathfrak {T};a,\bar {b} )$ and B is bounded above by a, hence, using the fact that $\mathfrak {T}$ is a model of $\mathsf {BFWF}_k$ , B must contain a greatest element. It follows that $\mathsf {A}^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models \mathsf {FWF}_k$ , hence also $\mathsf {A}^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models C\omega ^{\star }_{k+p}$ . Now, by Corollary 4.3, we have that $\left (\mathfrak {T};a\right ) \models \{\sigma ^n_a : \sigma \in C\omega ^{\star }_{k+p} \}$ . Since a and n were arbitrary and $\Sigma ^{k,n}_{\downarrow } = C\omega ^{\star }_{k+p}$ , it follows that $\mathfrak {T} \models (\Sigma ^k_{\downarrow })'$ .⊣

Remark 5.4. In the above proof, for $n \in \mathbb {N}$ and $p = f\left (L_k',n\right )$ , the class of linear orders of the form $\left (S^{<a}\right )^{\mathfrak {S}}[\bar {\mathbf {e}}_p]$ , where $\mathfrak {S} \in \mathcal {T}_k\left (\zeta \right )$ and $a \in S$ , does not, strictly speaking, coincide with the class of $\left (k+p\right )$ -coloured expansions of $\omega ^{\star }$ , because there are extended colours from $\bar {\mathbf {e}}_p$ that aren’t realised in any linear orders in the first class (not all characteristic sentences $\tau _{L_k',n,i}$ are realised in side-forests), while every colour in $\bar {\mathbf {e}}_p$ is realised in some $\left (k+p\right )$ -coloured expansion of $\omega ^{\star }$ . We can nonetheless take $\Sigma ^{k,n}_{\downarrow }$ to be the theory $C\omega ^{\star }_{k+p}$ since, for each model $\mathfrak {A}$ of $C\omega ^{\star }_{k+p}$ , every $\left (k+p\right )$ -coloured expansion of $\omega ^{\star }$ that is m-equivalent to $\mathfrak {A}$ realises exactly the same colours as $\mathfrak {A}$ .

5.3 The first-order theory of coloured $\eta $ -trees

We now define the theory

$$ \begin{align*}T\eta := \left\{\mathsf{Tree}, \neg\mathsf{Le}, \neg\mathsf{Gr}, \mathsf{De} \right\}.\end{align*} $$

The first-order theory of the class of k-coloured $\eta $ -trees is axiomatised by $T\eta $ , as seen from the next theorem.

Theorem 5.5. For each k-coloured model $\mathfrak {T}$ of $T\eta $ , there exists a k-coloured $\eta $ -tree $\mathfrak {S}$ such that $\mathfrak {S} \equiv \mathfrak {T}$ .

Proof Given a k-coloured model $\mathfrak {T}$ of $T\eta $ , by the Downward Löwenheim–Skolem Theorem there exists a countable k-coloured tree $\mathfrak {S}$ such that $\mathfrak {S} \equiv \mathfrak {T}$ . Hence each path $\mathsf {A}$ in the tree $\mathfrak {S}$ will be a countable dense linear order without endpoints, so $\mathsf {A} \cong \eta $ , as required.⊣

5.4 The first-order theory of coloured $\lambda $ -trees

Here we will present an axiomatisation of the first-order theory of the class of k-coloured $\lambda $ -trees. We will make use of Fact 3.12, along with the method of partitioning into side-forests from Section 4, to prove that our axiomatisation is complete. In essence, the axiomatisation will express the fact that the linear orders $\mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ obtained from a coloured $\lambda $ -tree by partitioning it into side-forests are themselves coloured expansions of $\lambda $ . The proof of completeness of the axiomatisation that we give here follows a similar idea as in the proof of completeness of the axiomatisation of the class of well-founded trees, given in [Reference Doets2], but with several essential differences.

Recall the definition of the formula $\varphi ^n_z$ that was given before Corollary 4.3. By applying it to each $\sigma \in C\lambda _{k+f\left (L_k',n\right )}$ , for any $k \in \mathbb {N}$ we define

$$ \begin{align*}\left(C\lambda_k\right)' := \{ \forall z \left(\sigma_z^n\right) : n \in \mathbb{N}, \, \sigma \in C\lambda_{k+f\left(L_k',n\right)}\}.\end{align*} $$

We will show (Theorem 5.9) that the first-order theory of the class of k-coloured $\lambda $ -trees is axiomatised by the set of sentences

$$ \begin{align*}T\lambda_k := \left\{ \mathsf{Tree}, \neg\mathsf{Gr}, \mathsf{bc} \right\} \cup \left(C\lambda_k\right)'.\end{align*} $$

First, note that every k-coloured $\lambda $ -tree, being branching complete, is a model of $T\lambda _k$ . Next, a few technical lemmas and propositions are needed.

Lemma 5.6. Let $\mathfrak {T}$ be a k-coloured model of $T\lambda _k$ . Then, for each $a \in T$ , each component in $\mathfrak {T}^{>a}$ is a model of $T\lambda _k$ , too.

Proof Let $\mathfrak {F}$ be a component in $\mathfrak {T}^{>a}$ . It is immediate that $\mathfrak {F}$ satisfies the sentences $\mathsf {Tree}$ , $\neg \mathsf {Gr}$ , and $\mathsf {bc}$ . To see that $\mathfrak {F}$ satisfies $\left (C\lambda _k\right )'$ , let $n \in \mathbb {N}$ , $p = f\left (L_k',n\right )$ , and $b \in F$ . It then suffices to show that $\left (\mathfrak {F};b\right ) \models \left \{\sigma ^n_b : \sigma \in C\lambda _{k+p}\right \}$ . Clearly $\left (\mathfrak {F};b\right ) \models \left (\mathsf {LO}\right )^n_b$ . It follows from the fact that $\mathfrak {T} \models \left (C\lambda _k\right )'$ that $\mathfrak {T}$ is dense, hence $\left (\mathfrak {F};b\right ) \models \left \{\left (\neg \mathsf {Le}\right )^n_b,\left (\neg \mathsf {Gr}\right )^n_b,\left (\mathsf {De}\right )^n_b\right \}$ .

Since $\mathfrak {T} \models \left (C\lambda _k\right )'$ then in particular $\left (\mathfrak {T};b\right ) \models \left \{ \sigma ^n_b : \sigma \in \mathsf {COM}_{k+p} \right \}$ and $\left (\mathfrak {T};b\right ) \models \left \{ \sigma ^n_b : \sigma \in \mathsf {I}_{k+p} \right \}$ . Hence, by Corollary 4.3, we obtain $\left (T^{<b}\right )^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models \mathsf {COM}_{k+p}$ and $\left (T^{<b}\right )^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models \mathsf {I}_{k+p}$ .

To show that $\left (\mathfrak {F};b\right ) \models \left \{ \sigma ^n_b : \sigma \in \mathsf {COM}_{k+p} \right \}$ it suffices, again by Corollary 4.3, to show that $\left (F^{<b}\right )^{\mathfrak {F}}[\bar {\mathbf {e}}_p] \models \mathsf {COM}_{k+p}$ . Let $\bar {d}$ be a tuple of nodes in $F^{<b}$ and let $\varphi (x,\bar {d})$ define the non-empty set $B \subseteq F^{<b}$ in the linear order $ (F^{< b} )^{\mathfrak {F}}[\bar {\mathbf {e}}_p]$ . Suppose that B has an upper bound $u \in F^{<b}$ . By properties of relativised formulas, $\varphi ^{>a}(x,\bar {d})$ defines B in $((T^{< b})^{\mathfrak {T}}[\bar {\mathbf {e}}_p];a )$ . Since $(T^{< b})^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models \mathsf {COM}_{k+p}$ , we obtain that B has a least upper bound in $T^{<b}$ , hence also in $F^{<b}$ , as required.

Lastly, it will be shown that $\left (\mathfrak {F};b\right ) \models \left \{ \sigma ^n_b : \sigma \in \mathsf {I}_{k+p} \right \}$ by showing, instead, that $(F^{<b} )^{\mathfrak {F}}[\bar {\mathbf {e}}_p] \models \mathsf {I}_{k+p}$ . To this end, let $\bar {d}$ be a tuple of nodes in $F^{<b}$ and let $\varphi (x,y,\bar {d})$ define in the linear order $((F^{< b})^{\mathfrak {F}}[\bar {\mathbf {e}}_p];\bar {d} )$ a transitive binary relation on $F^{<b}$ that induces a densely ordered condensation $\mathfrak {P} = (P;<_{\mathfrak {P}} )$ of $F^{<b}$ . Let

$$ \begin{align*}\psi(x,y,a,\bar{d}) := \left( x \leqslant a \wedge x=y \right) \vee ( a < x \wedge a < y \wedge \varphi^{>a}(x,y,\bar{d})).\end{align*} $$

The formula $\psi (x,y,a,\bar {d})$ defines in $\left (T^{<b}\right )^{\mathfrak {T}}[\bar {\mathbf {e}}_p]$ a transitive binary relation on $T^{<b}$ that induces a densely ordered condensation $\mathfrak {Q} = \left (Q;<_{\mathfrak {Q}}\right )$ of $T^{<b}$ and $\mathfrak {P}$ is a tail of $\mathfrak {Q}$ . Since $\left (T^{<b}\right )^{\mathfrak {T}}[\bar {\mathbf {e}}_p] \models \mathsf {I}_{k+p}$ then it follows that $\mathfrak {Q}$ must contain a set S of singletons that is dense in $\mathfrak {Q}$ , hence ${S}\mathord {\upharpoonright }_{P}$ is a set of singletons in P that is dense in $\mathfrak {P}$ , as required.⊣

Given a k-coloured forest $\mathfrak {F} := \left (F;<,\bar {\mathbf {c}}\right )$ , let $F_r$ denote the (possibly empty) set of roots of the components of $\mathfrak {F}$ and let $F' := F \backslash F_r$ . Consider the following property:

$\mathsf {Comp}_{\lambda }\left (\mathfrak {F}\right )$ : For each component G of $\mathfrak {F}^{F'}$ , $\mathfrak {F}^G$ is a model of $T\lambda _k$ .

In other words, property $\mathsf {Comp}_{\lambda }\left (\mathfrak {F}\right )$ states that for each component X in $\mathfrak {F}$ , if $\mathfrak {F}^X$ has no root then $\mathfrak {S}^X$ is a model of $T\lambda _k$ , while if $\mathfrak {F}^X$ has a root r then $\mathfrak {F}^Y$ is a model of $T\lambda _k$ for each component Y in $\left (\mathfrak {F}^X\right )^{>r}$ .

Lemma 5.7. Let $\mathfrak {T} := \left (T;<_{\mathfrak {T}},\bar {\mathbf {c}}\right )$ be a k-coloured model of $T\lambda _k$ . Then, for each $a \in T$ and $n \in \mathbb {N}$ there exists a k-coloured tree $\mathfrak {S} := \left (S;<_{\mathfrak {S}},\bar {\mathbf {c}}\right )$ and a node $b \in S$ such that

  1. (i) $(\mathfrak {T};a) \equiv _n \left (\mathfrak {S};b\right )$ ,

  2. (ii) $(\mathfrak {S}^{< b})^- \cong \lambda $ , and

  3. (iii) $\mathsf {Comp}_{\lambda }\left (\mathfrak {S}^J\right )$ holds, where $J := S \backslash S^{\leqslant b}$ .

Proof Let $\mathsf {A} := T^{\leqslant a}$ , $n'=n+1$ , $p = f\left (L_k',n'\right )$ , and $\bar {\mathbf {e}} = \left (\mathbf {e}_{1},\mathbf {e}_{2},\ldots ,\mathbf {e}_{p}\right )$ . Let $\mathfrak {F}_1,\mathfrak {F}_2,\ldots ,\mathfrak {F}_q$ be all, up to $n'$ -equivalence, side-forests of the form $\mathfrak {F}\left ( t \backslash \mathsf {A} \right )$ with $t \in \mathsf {A}$ . Without loss of generality, we can assume that $\mathfrak {F}_i \models \tau _{L_k',n',i}$ for each i. Each $\mathfrak {F}_i$ has the form $\mathfrak {F}_i := \left (F_i;<_{\mathfrak {F}_i},\bar {\mathbf {c}},d_i\right )$ where $d_i \in \mathsf {A}$ .

Since $(\mathfrak {T};a) \models \{ \sigma _a^{n'}: \sigma \in C\lambda _{k+p} \}$ then by Corollary 4.3, $\left (T^{<a}\right )^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ] \models C\lambda _{k+p}$ , hence, by Fact 3.12, there exists a $\left (k+p\right )$ -coloured linear order $\mathfrak {B}_1$ such that $\left (T^{<a}\right )^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ] \equiv _{n'} \mathfrak {B}_1$ and $\mathfrak {B}_1^- \cong \lambda $ . Let $\mathfrak {B}_2 := \left (\mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]\right )^{\left \{a\right \}}$ be the restriction of $\mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ to the set $\left \{a\right \}$ and define the $\left (k+p\right )$ -coloured linear order $\mathfrak {B} := \mathfrak {B}_1 + \mathfrak {B}_2 = \left (B;<_{\mathfrak {B}},\bar {\mathbf {c}},\bar {\mathbf {e}}\right )$ . By Lemma 2.1, $\mathfrak {B} \equiv _{n'} \mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ .

For each $x \in B$ , let $\chi \left (x\right )$ denote the value of i for which $\mathfrak {B} \models \mathbf {e}_{i}\left (x\right )$ . We define

$$ \begin{align*}S := \bigcup_{x \in B} \left(F_{\chi\left(x\right)} \times \left\{x\right\}\right),\end{align*} $$

and we also define the binary relation $<_{\mathfrak {S}}$ on S by specifying, for all pairs $\left (u,x\right ), \left (v,y\right ) \in S$ , that

$$ \begin{align*}\left(u,x\right) <_{\mathfrak{S}} \left(v,y\right) \Longleftrightarrow [(x <_{\mathfrak{B}} y \ \text{and} \ u = d_{\chi(x)}) \ \text{or} \ (x=y \ \text{and} \ u <_{\mathfrak{F}_{\chi(x)}} v)].\end{align*} $$

Now, we define $\mathfrak {S} = \left (S;<_{\mathfrak {S}},\bar {\mathbf {c}}\right )$ by specifying that, for each $s = \left (u,x\right )$ in S, $\mathfrak {S} \models \mathbf {c}_{i}\left (s\right )$ if and only if $\mathfrak {F}_{\chi \left (x\right )} \models \mathbf {c}_{i}\left (u\right )$ . Let $D := \left \{\left (d_{\chi \left (x\right )},x\right ) : x \in B\right \}$ and $\mathsf {D} := \left (D;<_{\mathfrak {S}}\right )$ . $\mathfrak {S}$ is simply the tree that is obtained from $\mathfrak {B}$ by replacing each $x \in B$ with the side-forest $\mathfrak {F}_{\chi \left (x\right )}$ , and $\mathsf {D}$ is the path in $\mathfrak {S}$ that consists of all the $d_i$ ’s in these side-forests.

We are going to apply Theorem 4.2. For that, we will need to know that $\mathfrak {T}$ and $\mathfrak {S}$ are branching complete. $\mathfrak {T}$ is branching complete since it satisfies $\mathsf {bc}$ . It follows that every lower side-forest in $\mathfrak {T}$ —in particular, each of the lower side-forests in $\mathfrak {F}_1,\mathfrak {F}_2,\ldots ,\mathfrak {F}_q$ —must contain a root. For $\mathfrak {S}$ , let $r,w \in S$ be incomparable and let $S_{rw}$ be defined as in (2) at the beginning of Section 4.1. We consider the following two cases, each with three subcases, for the location of r and w:

Case 1: r and w belong to the same set $F_{\chi \left (x\right )} \times \left \{x\right \}$ . We consider three sub-cases:

  1. 1.1 If r and w belong to the same component from $F_{\chi \left (x\right )} \times \left \{x\right \}$ then $S_{rw}$ contains a least element, by the fact that $\mathfrak {F}_{\chi \left (x\right )}$ is branching complete.

  2. 1.2 If r and w belong to different components from $F_{\chi \left (x\right )} \times \left \{x\right \}$ , but with r lying in the upper side-forest from $F_{\chi \left (x\right )} \times \left \{x\right \}$ , then $(d_{\chi (x)},x )$ is the least element of $S_{rw}$ .

  3. 1.3 If r and w belong to different components from $F_{\chi \left (x\right )} \times \left \{x\right \}$ , but with r lying in the lower side-forest from $F_{\chi \left (x\right )} \times \left \{x\right \}$ , then the root of that lower side-forest that contains r is the least element of $S_{rw}$ .

Case 2: r belongs to the set $F_{\chi \left (x\right )} \times \left \{x\right \}$ and w belongs to a different set $F_{\chi \left (y\right )} \times \left \{y\right \}$ . Then either $x <_{\mathfrak {B}} y$ or $y <_{\mathfrak {B}} x$ .

  1. 2.1 If $x <_{\mathfrak {B}} y$ and r lies in the upper side-forest from $F_{\chi \left (x\right )} \times \left \{x\right \}$ then $(d_{\chi (x)},x )$ is the least element of $S_{rw}$ .

  2. 2.2 If $x <_{\mathfrak {B}} y$ and r lies in the lower side-forest from $F_{\chi \left (x\right )} \times \left \{x\right \}$ then the root of that lower side-forest that contains r is the least element of $S_{rw}$ .

  3. 2.3 If $y <_{\mathfrak {B}} x$ then $(d_{\chi (y)},y)$ is the least element of $S_{rw}$ .

Hence, $\mathfrak {S}$ is branching complete. Since $\mathsf {D}^{\mathfrak {S}}\left [\bar {\mathbf {e}}\right ] \cong \mathfrak {B} \equiv _{n'} \mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ it follows, using Theorem 4.2, that $\left (\mathfrak {T};\mathsf {A}\right ) \equiv _{n'} \left (\mathfrak {S};\mathsf {D}\right )$ . Letting b be the greatest element of $\mathsf {D}$ , it then follows that $(\mathfrak {T};a) \equiv _n \left (\mathfrak {S};b\right )$ and clearly $(\mathfrak {S}^{<b})^- \cong \mathfrak {B}_1^- \cong \lambda $ . That the property $\mathsf {Comp}_{\lambda }\left (\mathfrak {S}^J\right )$ holds follows from the fact that, for each component F in $\mathfrak {S}^J$ , the tree $\mathfrak {S}^F$ is either a copy of some lower side-forest of $\mathfrak {T}$ (which will necessarily have a root), or a copy of a component in some upper side-forest of $\mathfrak {T}$ (which will not have a root since $\mathfrak {T}$ satisfies the density property that is encoded in $\left (C\lambda _k\right )'$ ); the claim then follows by Lemma 5.6.⊣

Lemma 5.8. Let $\mathfrak {T} := \left (T;<_{\mathfrak {T}},\bar {\mathbf {c}}\right )$ be a k-coloured model of the theory $T\lambda _k$ , let $\left \{a_1,a_2,\ldots ,a_m\right \}$ be an antichain in $\mathfrak {T}$ , and let $\bar {a} := \left (a_1,a_2,\ldots ,a_m\right )$ . Then, for each $n \in \mathbb {N}$ there exists a k-coloured tree $\mathfrak {S} := \left (S;<_{\mathfrak {S}},\bar {\mathbf {c}}\right )$ and a tuple of nodes $\bar {b} := \left (b_1,b_2,\ldots ,b_m\right ) \in S^m$ such that

  1. (i) $\left (\mathfrak {T};\bar {a}\right ) \equiv _n (\mathfrak {S};\bar {b})$ ,

  2. (ii) $(\mathfrak {S}^{< b_i})^- \cong \lambda $ for each i, and

  3. (iii) $\mathsf {Comp}_{\lambda }\left (\mathfrak {S}^J\right )$ holds, where $J := S \backslash \bigcup _{i=1}^m S^{\leqslant b_i}$ .

Proof The result will be proved by induction on m. For $m=1$ , the result is precisely Lemma 5.7. Now let $m \geqslant 2$ be an integer such that the claim holds for each positive integer less than m and let $C := \left \{a_1,a_2,\ldots ,a_m\right \}$ be an antichain in $\mathfrak {T}$ . Consider the stem $H := \bigcap _{i=1}^m T^{< a_i}$ and let $H' := \left \{ x \in T : x \geqslant y \ \text {for each} \ y \in H \right \}$ . Now, let $\left \{F_i\right \}_{i \in I}$ be the set of all components in $\mathfrak {T}^{H'}$ and define the tree $\mathfrak {F}_i := \mathfrak {T}^{F_i}$ for each $i \in I$ . Each component $F_i$ must contain fewer than m (possibly none) elements from C, for, if $C \subseteq F_i$ then $H \cap F_i \neq \emptyset $ —a contradiction. Assume, without loss of generality, that the components $F_i$ that contain nodes from C are precisely $F_1,F_2,\ldots ,F_r$ . Let $C \cap F_i = \left \{a_{i,1},a_{i,2}\ldots ,a_{i,s_i}\right \}$ and $\bar {a}_{i} := \left (a_{i,1},a_{i,2}\ldots ,a_{i,s_i}\right )$ for $1 \leqslant i \leqslant r$ . Two cases will be considered, depending on whether H contains a greatest element or not.

Case 1: H contains a greatest element a. By Lemma 5.7, there exists a k-coloured tree $\mathfrak {S}_0 = \left (S_0;<_{\mathfrak {S}_0},\bar {\mathbf {c}}\right )$ and a node $b \in S_0$ such that $(\mathfrak {T};a) \equiv _n \left (\mathfrak {S}_0;b\right )$ and $(\mathfrak {S}_0^{< b})^- \cong \lambda $ and $\mathsf {Comp}_{\lambda }(\mathfrak {S}_0^{J_0})$ holds for $J_0 := S_0 \backslash S_0^{\leqslant b}$ .

By Lemma 5.6, each of the trees $\mathfrak {F}_i$ is a model of $T\lambda _k$ . By the induction hypothesis, there exists, for each i with $1 \leqslant i \leqslant r$ , a k-coloured tree $\mathfrak {G}_i = \left (G_i;<_{\mathfrak {G}_i},\bar {\mathbf {c}}\right )$ and a tuple $\bar {b}_i := \left (b_{i,1},b_{i,2},\ldots ,b_{i,s_i}\right )$ of nodes in $G_i$ such that

  1. (i) $\left (\mathfrak {F}_i;\bar {a}_i\right ) \equiv _n (\mathfrak {G}_i;\bar {b}_i)$ ,

  2. (ii) $(\mathfrak {G}_i^{< b_{i,j}})^- \cong \lambda $ for each j with $1 \leqslant j \leqslant s_i$ , and

  3. (iii) $\mathsf {Comp}_{\lambda }(\mathfrak {G}_i^{J_i})$ holds, where $J_i := G_i \backslash \bigcup _{j=1}^{s_i} G_i^{\leqslant b_{i,j}}$ .

Now, we define the tree $\mathfrak {S}$ as follows. Let $\mathfrak {G}$ be the disjoint union of all the trees $(\mathfrak {G}_i;\bar {b}_i)$ for $1 \leqslant i \leqslant r$ , along with all the trees $\mathfrak {F}_i$ for $i \in I \backslash \left \{1,2,\ldots ,r\right \}$ . Let $S_0' := S_0 \backslash \left (S_0\right )^{> b}$ , $\mathfrak {S}_0' := \mathfrak {S}_0^{S_0'}$ be the restriction of $\mathfrak {S}_0$ to $S_0'$ , $\mathsf {B} := \left (S_0'\right )^{\leqslant b}$ , and $\mathfrak {S} := \mathfrak {S}_0' +_{\mathsf {B}} \mathfrak {G}$ .

To see that $\left (\mathfrak {T};\bar {a}\right ) \equiv _n (\mathfrak {S};\bar {b})$ , let $T' := T \backslash T^{> a}$ , $\mathfrak {T}' := \mathfrak {T}^{T'}$ , $\mathsf {A} := T^{\leqslant a}$ , and $\mathfrak {F} := \mathfrak {T}^{> a}$ . Since $(\mathfrak {T};a) \equiv _n \left (\mathfrak {S}_0;b\right )$ then $\left (\mathfrak {T}';\mathsf {A}\right ) \equiv _n \left (\mathfrak {S}_0';\mathsf {B}\right )$ , and it is clear that $(\mathfrak {F};\bar {a}_1\bar {a}_2\cdots \bar {a}_r ) \equiv _n (\mathfrak {G};\bar {b}_1\bar {b}_2\cdots \bar {b}_r)$ . By Lemma 2.2, $\left (\mathfrak {T}' +_{\mathsf {A}} \mathfrak {F}; \bar {a}_1\bar {a}_2\cdots \bar {a}_r \right ) \equiv _n (\mathfrak {S}_0' +_{\mathsf {B}} \mathfrak {G}; \bar {b}_1\bar {b}_2\cdots \bar {b}_r )$ , from which it follows that $\left (\mathfrak {T};\bar {a}\right ) \equiv _n (\mathfrak {S};\bar {b})$ .

Next, we will show that $(\mathfrak {S}^{< b_i})^- \cong \lambda $ for each i. Suppose that $b_i = b_{j,l} \in G_j$ . Since $(\mathfrak {S}^{<b})^- \cong (\mathfrak {S}_0^{<b})^- \cong \lambda $ and $(\mathfrak {S}^{(b,b_i)})^- \cong (\mathfrak {G}_j^{<b_{j,l}})^- \cong \lambda $ (where $\mathfrak {S}^{\left (b,b_i\right )}$ denotes the restriction of $\mathfrak {S}$ to the interval $\left (b,b_i\right )$ ), it follows that $(\mathfrak {S}^{<b_i})^- \cong (\mathfrak {S}^{< b})^- + 1 + (\mathfrak {S}^{(b,b_i)})^- \cong \lambda + 1 + \lambda \cong \lambda $ .

Finally, the claim that $\mathsf {Comp}_{\lambda }\left (\mathfrak {S}^J\right )$ holds for $J = S \backslash \bigcup _{i=1}^m S^{\leqslant b_i}$ follows by the corresponding properties that hold in $\mathfrak {S}_0$ and in each of the trees $\mathfrak {G}_i$ , along with using Lemma 5.6 when working in the trees $\mathfrak {F}_i$ for $i \in I \backslash \left \{1,2,\ldots ,r\right \}$ .

Case 2: H does not contain a greatest element. It follows by the branching completeness of $\mathfrak {T}$ that each $\mathfrak {F}_i$ must contain a least element. Fix $i_0 \in I$ and let a be the root of $\mathfrak {F}_{i_0}$ and $\mathsf {A} := T^{<a}$ . Let $T' = T \backslash \bigcup _{i \in I} F_i$ and $\mathfrak {T}' = \mathfrak {T}^{T'}$ . Observe that, as $\mathsf {A}$ does not have a greatest element, $\mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ] = \mathsf {A}^{\mathfrak {T}'}\left [\bar {\mathbf {e}}\right ]$ .

Let $p := f\left (L_k',n\right )$ and $\bar {\mathbf {e}} := \bar {\mathbf {e}}_p$ . Since $(\mathfrak {T};a) \models \left \{ \sigma ^n_a : \sigma \in C\lambda _{n+p} \right \}$ then $\mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ] \models C\lambda _{n+p}$ . Hence, by Fact 3.12, there exists an $\left (n+p\right )$ -coloured linear order $\mathfrak {B} = \left (B;<_{\mathfrak {B}},\bar {\mathbf {c}},\bar {\mathbf {e}}\right )$ such that $\mathfrak {B} \equiv _n \mathsf {A}^{\mathfrak {T}}\left [\bar {\mathbf {e}}\right ]$ and $\mathfrak {B}^- \cong \lambda $ . As with the construction of the tree $\mathfrak {S}$ in the proof of Lemma 5.7, we can add side-forests to the elements of B to obtain a k-coloured tree $\mathfrak {S}_0 := \left (S_0;<_{\mathfrak {S}_0},\bar {\mathbf {c}}\right )$ , and a path $\mathsf {D}$ in $\mathfrak {S}_0$ , such that $\mathsf {D}^{\mathfrak {S}_0}\left [\bar {\mathbf {e}}\right ] \cong \mathfrak {B}$ , from which

(3) $$ \begin{align} \mathsf{A}^{\mathfrak{T}'}\left[\bar{\mathbf{e}}\right] = \mathsf{A}^{\mathfrak{T}}\left[\bar{\mathbf{e}}\right] \equiv_n \mathfrak{B} \cong \mathsf{D}^{\mathfrak{S}_0}\left[\bar{\mathbf{e}}\right]. \end{align} $$

$\mathfrak {T}'$ is branching complete, due to the fact that $\mathfrak {T}$ satisfies $\mathsf {bc}$ , and it can be shown that $\mathfrak {S}_0$ is branching complete, similarly to how it was shown that the tree $\mathfrak {S}$ in the proof of Lemma 5.7 was branching complete. It then follows, using (3) and Theorem 4.2, that $\left (\mathfrak {T}';\mathsf {A}\right ) \equiv _n \left (\mathfrak {S}_0;\mathsf {D}\right )$ .

Similarly to Case 1, for each i, where $1 \leqslant i \leqslant r$ , there exists a k-coloured tree $\mathfrak {G}_i$ and a tuple $\bar {b}_i := \left (b_{i,1},b_{i,2},\ldots ,b_{i,s_i}\right )$ of nodes in $G_i$ , such that properties (i $'$ ) and (iii $'$ ) hold, in addition to property (ii $'$ ) replaced by:

  1. (iv) $(\mathfrak {G}_i^{< b_{i,j}})^- \cong 1+\lambda $ for each j with $1 \leqslant j \leqslant s_i$ .

Now, we define $\mathfrak {G}$ as in Case 1 and let $\mathfrak {S} := \mathfrak {S}_0 +_{\mathsf {D}} \mathfrak {G}$ . By similar arguments to those used in Case 1 and Lemma 5.7, it can again be seen that properties (i)–(iii) hold.⊣

Theorem 5.9. Let $\mathfrak {T}$ be a k-coloured model of $T\lambda _k$ . Then, for each $n \in \mathbb {N}$ there exists a k-coloured $\lambda $ -tree $\mathfrak {S}$ such that $\mathfrak {T} \equiv _n \mathfrak {S}$ .

Proof Without loss of generality, we can assume that $n \geqslant 5$ . We define the k-coloured trees $\mathfrak {T}_0,\mathfrak {T}_1,\mathfrak {T}_2,\ldots $ and sets $H_0,H_1,H_2,\ldots $ with each $H_i \subseteq T_i$ , as follows. First, take $\mathfrak {T}_0 := \mathfrak {T}$ and $H_0 = \emptyset $ . Now, given $\mathfrak {T}_i$ and $H_i$ , let $T_i' := T_i \backslash H_i$ and $\mathfrak {T}_i' := \mathfrak {T}_i^{T_i'}$ , and suppose that these have the following properties:

  1. (a) $H_i$ is a left-closed subset of $T_i$ and each path in $\mathfrak {H}_i := \mathfrak {T}_i^{H_i}$ is isomorphic to $\lambda +1$ ;

  2. (b) $\mathfrak {T}_i$ is dense and branching complete;

  3. (c) $\mathsf {Comp}_{\lambda }\left (\mathfrak {T}_i'\right )$ holds.

Note that property (a) is satisfied vacuously by $H_0$ , property (b) is satisfied by $\mathfrak {T}_0$ (with the density property being encoded in the theory $\left (C\lambda _k\right )'$ that is satisfied by $\mathfrak {T}$ ), and property (c) is trivially satisfied by $\mathfrak {T}_0'$ . The construction of $\mathfrak {T}_{i+1}$ and $H_{i+1}$ is described below.

Let F be any component in $\mathfrak {T}_i'$ and put $\mathfrak {F} := \mathfrak {T}_i^{F}$ . If $i = 0$ then $\mathfrak {F} = \mathfrak {T}$ while if $i> 0$ then $\mathfrak {F}$ can be represented in one of the forms (i) and (ii) below as follows. For each path $\mathsf {X}$ in $\mathfrak {H}_i$ (which can be viewed as a stem in $\mathfrak {T}_i$ ), since $\mathsf {X}$ has a greatest node and $\mathfrak {T}_i$ is branching complete, the set of side-forests $\left \{F\left ( x \backslash \mathsf {X} \right )\right \}_{x \in \mathsf {X}}$ form a partition of $T_i$ . There must hence exist a path $\mathsf {A}$ in $\mathfrak {H}_i$ and a node $d \in \mathsf {A}$ such that F is contained in the side-forest of d with respect to $\mathsf {A}$ . In particular, $(\mathfrak {T}_i^{< d})^- \cong \lambda $ and either $F \subseteq F_u{\left ( d \backslash \mathsf {A} \right )}$ or $F \subseteq F_l{\left (d\right )}$ . Now, if $F \subseteq F_u{\left ( d \backslash \mathsf {A} \right )}$ then, using the fact that $\mathfrak {T}_i$ is dense, $\mathfrak {F}$ has the following form:

  1. (i) $\mathfrak {F}$ has no root and $F \subseteq T_i^{>d}$ with $(\mathfrak {T}_i^{\leqslant d})^- \cong \lambda +1$ ,

while, if $F \subseteq F_l{\left (d\right )}$ then, using the fact that $\mathfrak {T}_i$ is branching complete and dense, $\mathfrak {F}$ has the following form:

  1. (ii) $\mathfrak {F}$ has a root r and $F = T_i^{\geqslant r}$ with $\left (\mathfrak {T}_i^{< r}\right )^- \cong \lambda $ .

In the construction that follows, we hence take $\mathfrak {F} = \mathfrak {T}$ when $i = 0$ and if $i> 0$ then we can restrict our attention to components that have the form of case (i), because in case (ii) each component $F'$ in $\mathfrak {T}_i^{> r}$ will have the form of case (i) (but, with r fulfilling the role of d) and the subsequent construction is then performed using this $F'$ rather than F.

Let $\left \{a_1,a_2,\ldots ,a_p\right \}$ be an antichain in $\mathfrak {F}$ such that $\bigcup _{i = 1}^p F^{\leqslant a_i}$ contains an n-support for $\mathfrak {F}$ (recall the definition of ‘n-support’ at the end of Section 2.1) and let $\bar {a} := \left (a_1,a_2,\ldots ,a_p\right )$ . By property (c) and Lemma 5.8, there exists a k-coloured tree $\mathfrak {G}$ and a tuple of nodes $\bar {b} := \left (b_1,b_2,\ldots ,b_p\right ) \in G^p$ such that: $\left (\mathfrak {F};\bar {a}\right ) \equiv _n (\mathfrak {G};\bar {b})$ , $(\mathfrak {G}^{< b_j})^- \cong \lambda $ for each j, and $\mathsf {Comp}_{\lambda }\left (\mathfrak {G}^J\right )$ holds for $J := G \backslash \left (\bigcup _{j=1}^p G^{\leqslant b_j}\right )$ . It then follows that

( $\dagger $ ): for each $a \in F$ there exists $b \in \bigcup _{i=1}^p G^{\leqslant b_i}$ such that $\left (\mathfrak {F};a\right ) \equiv _{n-1} \left (\mathfrak {G};b\right )$ .

The tree $\mathfrak {T}_{i+1}$ is now obtained from $\mathfrak {T}_i$ by replacing $\mathfrak {F}$ with $\mathfrak {G}$ and by performing a similar replacement for every other component of $\mathfrak {T}_i'$ , while $H_{i+1}$ is the union over all these components $\mathfrak {G}$ of the sets $\bigcup _{j=1}^p T_{i+1}^{\leqslant b_j}$ .

Note that since $\mathfrak {H}_{i+1}$ is obtained from $\mathfrak {H}_i$ by adding only finitely many new paths corresponding to each component F in $\mathfrak {T}_i'$ , none of the paths in $\mathfrak {H}_{i+1}$ are emerging paths, and hence there is no possibility of producing ‘unwanted’ paths that are isomorphic to $\lambda $ by diagonalisation.

Each path in $\mathfrak {H}_{i+1}$ will be of the form

$$ \begin{align*}(\mathfrak{T}_i^{\leqslant d})^- + (\mathfrak{S}^{\leqslant b_j})^- \cong (\lambda+1) + (\lambda+1) \cong \lambda+1\end{align*} $$

(or simply of the form $(\mathfrak {S}^{\leqslant b_j})^- \cong \lambda +1$ when $i=0$ ) so property (a) holds for $H_{i+1}$ . To see that property (b) holds for $\mathfrak {T}_{i+1}$ , first note that each of the trees $\mathfrak {G}$ that was used to replace the tree $\mathfrak {F}$ in the above construction is itself dense and branching complete, since $\mathfrak {F} \equiv _n \mathfrak {G}$ and the properties of denseness and branching completeness can be expressed by sentences of quantifier rank at most $5$ . We can now use an argument similar to the one used in Lemma 5.6 to prove that the new tree that was constructed there was branching complete. The claim that property (c) holds for $\mathfrak {T}_{i+1}'$ follows from the fact that each of the trees $\mathfrak {G}$ that was used to replace the tree $\mathfrak {F}$ in the above construction satisfies $\mathsf {Comp}_{\lambda }\left (\mathfrak {G}^J\right )$ for $J = G \backslash \left (\bigcup _{j=1}^p G^{\leqslant b_j}\right )$ .

Observe, moreover, that $\mathfrak {T}_{i+1}$ and $H_{i+1}$ also have the following properties:

  1. (d) $\mathfrak {H}_{i+1}$ is an end-extension of $\mathfrak {H}_i$ ,

  2. (e) $\displaystyle \left (\mathfrak {T}_i,t\right )_{t \in H_i} \equiv _n \left (\mathfrak {T}_{i+1},t\right )_{t \in H_i}$ ,

  3. (f) for each $a \in T_i'$ , there exists $b \in H_{i+1}$ such that

    $$ \begin{align*} \left (\mathfrak {T}_i;t,a\right )_{t \in H_i} \equiv _{n-1} \left (\mathfrak {T}_{i+1};t,b\right )_{t \in H_i}.\end{align*} $$

Property (e) follows from the fact that in the construction of $\mathfrak {T}_{i+1}$ , $\mathfrak {H}_i$ was left intact and the components in $\mathfrak {T}_i'$ were all replaced by n-equivalent trees. Property (f) follows from ( $\dagger $ ).

Finally, let $ \mathfrak {S} := \bigcup _{i=0}^{\infty } \mathfrak {H}_i$ .

(It follows from property (d) that this union is defined.)

Note that even though every path in each $\mathfrak {H}_i$ is singular, the (infinite) union $\mathfrak {S}$ of all these $\mathfrak {H}_i$ ’s may contain emerging paths.

$\mathfrak {S}$ is a $\lambda $ -tree since, clearly, the monochromatic reduct of each of its paths is isomorphic to $\left (\lambda +1\right ) \cdot \omega \cong \lambda $ . To show that $\mathfrak {T} \equiv _n \mathfrak {S}$ , we will describe a winning strategy for Player II for the game $\mathrm {EF}_{n}\!\left (\mathfrak {T},\mathfrak {S}\right )$ . Let $\bar {t} := \left (t_1,t_2,\ldots ,t_m\right ) \in T^m$ and $\bar {s} :=\left (s_1,s_2,\ldots ,s_m\right ) \in S^m$ represent the nodes that have been chosen by the two players after m moves (with $\bar {t} = \varepsilon = \bar {s}$ when $m=0$ ), and such that the following property holds:

$[\star]_{i,\bar {t},\bar {s}}$ : $\bar {s} \in \left (H_i\right )^m$ and $\left (\mathfrak {T};\bar {t}\right ) \equiv _{n-m} \left (\mathfrak {T}_i;\bar {s}\right )$ .

Observe that, by property (e), $\left [\star \right ]_{i,\bar {t},\bar {s}}$ holds trivially when $m=0$ , and if $i<j$ and $\left [\star \right ]_{i,\bar {t},\bar {s}}$ holds then $\left [\star \right ]_{j,\bar {t},\bar {s}}$ holds, too. We will consider two cases for the choice of Spoiler in the ( $m+1$ )-th round of the game:

  • Suppose that Spoiler chooses the node $t_{m+1} \in T$ . By $\left [\star \right ]_{i,\bar {t},\bar {s}}$ there exists $u_{m+1} \in T_i$ such that $\left (\mathfrak {T};\bar {t}t_{m+1}\right ) \equiv _{n-m-1} \left (\mathfrak {T}_i;\bar {s}u\right )$ . If $u \in H_i$ , then take $s_{m+1} := u$ as the response of Duplicator for round $m+1$ of the game and observe that $\left [\star \right ]_{i,\bar {t}t_{m+1},\bar {s}s_{m+1}}$ holds. Otherwise, by property (f) there exists $v \in H_{i+1}$ such that $\displaystyle \left (\mathfrak {T}_i;\bar {s}u\right ) \equiv _{n-1} \left (\mathfrak {T}_{i+1};\bar {s}v\right )$ , hence also $\left (\mathfrak {T};\bar {t}t_{m+1}\right ) \equiv _{n-m-1} \left (\mathfrak {T}_{i+1};\bar {s}v\right )$ . Now, take $s_{m+1} := v$ as the response of Duplicator for round $m+1$ of the game and observe that $\left [\star \right ]_{i+1,\bar {t}t_{m+1},\bar {s}s_{m+1}}$ holds.

  • Suppose that Spoiler chooses $s_{m+1} \in S$ , say with $s_{m+1} \in H_j$ . If $j \leqslant i$ then $s_{m+1} \in H_i$ . Hence, by $\left [\star \right ]_{i,\bar {t},\bar {s}}$ , there exists $u \in T$ such that $\left (\mathfrak {T};\bar {t}u\right ) \equiv _{n-m-1} \left (\mathfrak {T}_i;\bar {s}s_{m+1}\right )$ . Take $t_{m+1} := u$ as the response of Duplicator and observe that $\left [\star \right ]_{i,\bar {t}t_{m+1},\bar {s}s_{m+1}}$ holds. On the other hand, if $i<j$ , then $\left [\star \right ]_{j,\bar {t},\bar {s}}$ holds, too. Duplicator will again have a response $t_{m+1} \in T$ such that $\left [\star \right ]_{j,\bar {t}t_{m+1},\bar {s}s_{m+1}}$ holds.

Clearly, the tuples $\left (t_1,t_2,\ldots ,t_n\right ) \in T^n$ and $\left (s_1,s_2,\ldots ,s_n\right ) \in S^n$ thus defined form a local isomorphism between $\mathfrak {T}$ and $\mathfrak {S}$ , as required.⊣

6 Concluding remarks

The study of classes of trees and their first-order theories is still in an early stage, and still much less explored than the study of first-order theories of classes of linear orders. In particular, the complete first-order axiomatisations of several important classes of trees have so far been open. The work presented in this paper was intended towards advancing that study and, in particular, answering some of these open questions. Our main contributions can be summarised as follows.

  1. 1. The main new technique developed here is a method for approximating any given (possibly, coloured) tree as a coloured linear order, so that the first-order theory of the resulting coloured linear order can be used to determine the first-order theory of the given tree. Using this technique, we have obtained a complete axiomatisation of the first-order theory of the class of coloured $\zeta $ -trees.

  2. 2. Another major new result in this work is the complete axiomatisation of the first-order theory of the class of coloured $\lambda $ -trees. Our completeness proof of that axiomatisation mimics, modulo several essential complications, the completeness proof of the axiomatisation of the class of well-founded trees given in [Reference Doets2]. In fact, in the introduction of Section 5 of [Reference Doets2], Doets says that the completeness proof that is presented there “can be considered as a paradigm for a method applicable in a variety of situations, where the models considered belong to certain types of partial orderings (trees being the simplest example) and the $\Pi ^1_1$ -property involved can be either well-foundedness, converse well-foundedness, or, more generally, some kind of completeness.”

  3. 3. We have also obtained and presented a few easier cases of complete axiomatisations of the first-order theories of other tree classes of natural interest, including $\omega $ -trees and $\eta $ -trees. In addition, the paper summarised the axiomatisations of the first-order theories of important classes of linear orders needed for axiomatising the first-order theories of the classes of trees mentioned above, though most of the former axiomatisations were already known from the literature.

Some directions for future work include the following.

  • Use the methods developed here to axiomatise the first-order theories of other classes of trees of natural interest, such as the coloured trees in which all paths are complete linear orders.

  • Generalise the method of [Reference Doets2] to a result of the kind of Theorem 4.4, presented here. Moreover, the only class to which Theorem 4.4 was applied here was $\left \{\zeta \right \}$ , but it can also be applied to the classes $\{\omega ^{\star }\}$ and $\{\zeta ,\omega ^{\star }\}$ .

  • The methods developed here can possibly be modified to apply to other, more general ordered structures, such as partially ordered sets and lattices.

Acknowledgments

We thank the referee for the careful reading and helpful comments and suggestions on the paper.

Footnotes

1 Note that, in the case of linear orders, maximal elements and greatest elements coincide, but for proper trees they need not.

2 The reason for the different forms of $\mathsf {FD}$ and $\mathsf {BD}$ is that trees are left-linear but not right-linear.

3 Elements $(x,0)$ and $(y,1)$ in $\mathfrak {B}_i+\mathfrak {C}_i$ will be identified with the elements $x \in B_i$ and $y \in C_i$ respectively, so that the elements of $\mathfrak {B}_i$ and $\mathfrak {C}_i$ may be treated as elements of $\mathfrak {A}$ too.

4 We thank the reviewer for the first two references.

5 In fact, only the property of left-connectedness is needed from the axiom $\mathsf {Tree}$ ; the properties of irreflexivity, transitivity, and left-linearity are all encoded in the axioms from $(\Sigma ^k_{\uparrow })'$ and $(\Sigma ^k_{\downarrow })'$ .

References

REFERENCES

Doets, K., Completeness and definability: Applications of the Ehrenfeucht game in second-order and intensional logic, Ph.D. thesis, University of Amsterdam, 1987.Google Scholar
Doets, K., Monadic ${\Pi}_1^1$ -theories of ${\Pi}_1^1$ -properties. Notre Dame Journal of Formal Logic, vol. 30 (1989), no. 2, pp. 224240.CrossRefGoogle Scholar
Doets, K., Basic Model Theory , CSLI Publications & The European Association for Logic Language and Information, Stanford, 1996.Google Scholar
Droste, M., Structure of Partially Ordered Sets with Transitive Automorphism Groups,, Memoirs of the American Mathematical Society, vol. 57, no. 334, American Mathematical Society, Providence, 1985.Google Scholar
Ebbinghaus, H.D., Flum, J., and Thomas, W., Mathematical Logic, Springer, New York, 1996.Google Scholar
Feferman, S. and Vaught, R.L., The first order properties of products of algebraic systems. Fundamenta Mathematicae, vol. 47 (1959), pp. 57103.10.4064/fm-47-1-57-103CrossRefGoogle Scholar
Goranko, V., Trees and finite branching. Proceedings of the 2nd Panhellenic Logic Symposium, 1999, pp. 91–101.Google Scholar
Goranko, V. and Kellerman, R., Classes and theories of trees associated with a class of linear orders. Logic Journal of the IGPL, vol. 19 (2010), pp. 217232.CrossRefGoogle Scholar
Gurevich, Y. and Shelah, S., The decision problem for branching time logic, this Journal, vol. 50 (1985), pp. 668681.Google Scholar
Gurevich, Y. and Shelah, S., To the decision problem for branching time logic, Foundations of Logic and Linguistics: Problems and Their Solutions (Dorn, G. and Weingartner, P., editors), Plenum, New York, 1985, pp. 181198.CrossRefGoogle Scholar
Kellerman, R., First-order theories of bounded trees, 2019, under submission.Google Scholar
Kellerman, R., Goranko, V., and Zanardo, A., Structural theory of trees. II. Completeness and completions of trees, 2021, under submission.Google Scholar
Mwesigye, F. and Truss, J.K., Classification of finite coloured linear orderings. Order, vol. 28 (2011), pp. 387397.CrossRefGoogle Scholar
Mwesigye, F. and Truss, J.K., Ehrenfeucht–Fraïssé games on ordinals. Annals of Pure and Applied Logic, vol. 169 (2018), no. 7, pp. 616636.CrossRefGoogle Scholar
Rogers, J., Backofen, R., and Vijay-Shankar, K., A first-order axiomatization of the theory of finite trees. Journal of Logic, Language and Information, vol. 4 (1995), pp. 539.Google Scholar
Rosenstein, J.G., Linear Orderings , Academic Press, New York, 1982.Google Scholar
Rubin, M., The Reconstruction of Trees from Their Automorphism Groups , Contemporary Mathematics, vol. 151, American Mathematical Society, Providence, 1993.CrossRefGoogle Scholar
Schmerl, J., On ${\mathrm{\aleph}}_0$ -categoricity and the theory of trees. Fundamenta Mathematicae, vol. 94 (1977), pp. 121128.CrossRefGoogle Scholar
Figure 0

Figure 1 A depiction of the tree $\mathfrak {T} +_B \mathfrak {F}$.

Figure 1

Figure 2 The side-forests $\mathfrak {F}_l{\left (s\right )}$ and $\mathfrak {F}_u{\left ( s \backslash \mathsf {A} \right )}$.