Characterization of the conditional stationary distribution in Markov chains via systems of linear inequalities

Masatoshi Kimura; Tetsuya Takine

doi:10.1017/apr.2020.40

Characterization of the conditional stationary distribution in Markov chains via systems of linear inequalities

Part of: Markov processes Special processes

Published online by Cambridge University Press: 03 December 2020

Masatoshi Kimura and

Tetsuya Takine

Show author details

Masatoshi Kimura*: Affiliation:
Osaka University
Tetsuya Takine*: Affiliation:
Osaka University
*: *Postal address: Department of Information and Communications Technology, Graduate School of Engineering, Osaka University, Suita 565-0871, Japan.
*Postal address: Department of Information and Communications Technology, Graduate School of Engineering, Osaka University, Suita 565-0871, Japan.

Article contents

Abstract
Introduction
The characterization of $\boldsymbol{\pi}(N)$ by linear inequalities: fundamental results
Extensions using state transitions in peripheral states
Minimum convex polytopes for $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$
Concluding remarks
Proof of Corollary 2.2
Equivalence between (2.22) and (2.23) for $\mathcal{J}(N) \neq \mathbb{Z}_0^N$
Proof of Lemma 2.2
Proof of Lemma 2.3
Proof of Corollary 3.1
Proof of Corollary 3.2
Proof of Theorem 4.1
Proof of Lemma 4.1
Proof of Lemma 4.2
Proof of Theorem 4.2
References

Rights & Permissions

Abstract

This paper considers ergodic, continuous-time Markov chains $\{X(t)\}_{t \in (\!-\infty,\infty)}$ on $\mathbb{Z}^+=\{0,1,\ldots\}$ . For an arbitrarily fixed $N \in \mathbb{Z}^+$ , we study the conditional stationary distribution $\boldsymbol{\pi}(N)$ given the Markov chain being in $\{0,1,\ldots,N\}$ . We first characterize $\boldsymbol{\pi}(N)$ via systems of linear inequalities and identify simplices that contain $\boldsymbol{\pi}(N)$ , by examining the $(N+1) \times (N+1)$ northwest corner block of the infinitesimal generator $\textbf{\textit{Q}}$ and the subset of the first $N+1$ states whose members are directly reachable from at least one state in $\{N+1,N+2,\ldots\}$ . These results are closely related to the augmented truncation approximation (ATA), and we provide some practical implications for the ATA. Next we consider an extension of the above results, using the $(K+1) \times (K+1)$ ( $K > N$ ) northwest corner block of $\textbf{\textit{Q}}$ and the subset of the first $K+1$ states whose members are directly reachable from at least one state in $\{K+1,K+2,\ldots\}$ . Furthermore, we introduce new state transition structures called (K, N)-skip-free sets, using which we obtain the minimum convex polytope that contains $\boldsymbol{\pi}(N)$ .

Keywords

Markov chains countably infinite state space conditional stationary distribution linear inequalities convex polytopes augmented truncation approximation

MSC classification

Primary: 60J27: Continuous-time Markov processes on discrete state spaces

Secondary: 60K25: Queueing theory

Type: Original Article
Information: Advances in Applied Probability , Volume 52 , Issue 4 , December 2020 , pp. 1249 - 1283

DOI: https://doi.org/10.1017/apr.2020.40 [Opens in a new window]
Copyright: © Applied Probability Trust 2020

1. Introduction

We consider a time-homogeneous, continuous-time Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ on a countably infinite state space $\mathbb{Z}^+ = \{0,1,\dots\}$ . Let $q_{i,j}$ ( $i,j \in \mathbb{Z}^+, i \neq j$ ) denote the transition rate from state i to state j. The infinitesimal generator $\textbf{\textit{Q}}$ of the Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ is then given by

\begin{equation*}\textbf{\textit{Q}}=\left(\begin{array}{c@{\quad}c@{\quad}c@{\quad}c@{\quad}c}-q_0 & q_{0,1} & q_{0,2} & q_{0,3} & \dots\\ \\[-5pt] q_{1, 0} & -q_1 & q_{1,2} & q_{1,3} & \dots\\ \\[-5pt] q_{2, 0} & q_{2,1} & -q_2 & q_{2,3} & \dots\\ \\[-5pt] q_{3, 0} & q_{3,1} & q_{3,2} & -q_3 & \dots\\ \\[-5pt] \vdots &\vdots &\vdots & \vdots & \ddots\end{array}\right),\end{equation*}

where

\begin{equation*}q_i = \sum_{\substack{j \in \mathbb{Z}^+\\ j \neq i}} q_{i,j},\quad i \in \mathbb{Z}^+.\end{equation*}

We assume that $q_i < \infty$ ( $i \in \mathbb{Z}^+$ ) and $\{X(t)\}_{t \in (\!-\infty,\infty)}$ is stationary and ergodic. We then define $\boldsymbol{\pi}$ as the stationary distribution of $\{X(t)\}_{t \in (\!-\infty,\infty)}$ :

\begin{equation*}\boldsymbol{\pi} = (\pi_0\ \pi_1\ \pi_2\ \dots),\end{equation*}

where $\pi_i = \mathbb{P}(X(0)=i)$ ( $i \in \mathbb{Z}^+$ ). Note that $\boldsymbol{\pi}$ is determined uniquely by

(1.1)

\begin{equation}\boldsymbol{\pi}\textbf{\textit{Q}} = \textbf{0},\qquad\boldsymbol{\pi}\textbf{\textit{e}} = 1,\end{equation}

where $\textbf{\textit{e}}$ denotes a column vector of the appropriate dimension whose elements are all equal to one.

In general, it is hard to obtain the stationary distribution of the Markov chain $\{X(t)\}_{t \in (\!-\infty, \infty)}$ because there are infinitely many unknowns $\pi_i$ ( $i \in \mathbb{Z}^+$ ) in (1.1). In the past, this problem has been tackled mainly by two approaches: the augmented truncation approximation [Reference Hart and Tweedie4, Reference Liu, Li and Masuyama7, Reference Seneta12, Reference Zhao and Liu16] and the matrix-analytic method [Reference Latouche and Ramaswami6, Reference Neuts10, Reference Neuts11]. In the augmented truncation approximation, we choose a sufficiently large $N \in \mathbb{Z}^+$ and partition $\mathbb{Z}^+$ into a finite subset $\mathbb{Z}_0^N$ and its complement $\mathbb{Z}_{N+1}^{\infty}$ , where $\mathbb{Z}_m^{\ell}$ ( $m,\ell \in\mathbb{Z}^+$ ) and $\mathbb{Z}_m^{\infty}$ ( $m \in \mathbb{Z}^+$ ) are defined as

\begin{equation*}\mathbb{Z}_m^{\ell}=\begin{cases}\{m, m+1, \dots, \ell\}, & m \leq \ell,\\[4pt] \emptyset, &m > \ell,\end{cases}\qquad\mathbb{Z}_m^{\infty}=\{m, m+1,\dots\},\quad m \in \mathbb{Z}^+.\end{equation*}

We also partition the stationary distribution $\boldsymbol{\pi}$ in conformance with the partition of the state space:

(1.2)

\begin{equation}\boldsymbol{\pi} =\kbordermatrix{&\mathbb{Z}_0^N &\quad \mathbb{Z}_{N+1}^{\infty}\\ \\[-9pt]& \boldsymbol{\pi}^{(1)}(N) &\quad \boldsymbol{\pi}^{(2)}(N)}.\end{equation}

We then construct a finite-state Markov chain on $\mathbb{Z}_0^N$ , using the $(N+1) \times (N+1)$ northwest corner block $\textbf{\textit{Q}}^{(1,1)}(N)$ of the infinitesimal generator $\textbf{\textit{Q}}$ . This finite-state Markov chain on $\mathbb{Z}_0^N$ can be regarded as an approximation to the original Markov chain on $\mathbb{Z}^+$ . Specifically, we construct an infinitesimal generator $\textbf{\textit{Q}}^{(1,1)}(N) +\textbf{\textit{Q}}_{\textrm{A}}(N)$ of a Markov chain on $\mathbb{Z}_0^N$ , where $\textbf{\textit{Q}}_{\textrm{A}}(N)$ is an augmentation matrix such that $\textbf{\textit{Q}}_{\textrm{A}}(N)\geq \textbf{\textit{O}}$ and $\big[\textbf{\textit{Q}}^{(1,1)}(N) + \textbf{\textit{Q}}_{\textrm{A}}(N)\big] \textbf{\textit{e}}= \textbf{0}$ . Usually, $[\textbf{\textit{Q}}^{(1,1)}(N)+\textbf{\textit{Q}}_{\textrm{A}}(N)]$ is assumed to be irreducible. Let $\boldsymbol{\pi}(N)$ ( $N \in \mathbb{Z}^+$ ) denote the conditional stationary distribution given $X(0) \in \mathbb{Z}_0^N$ :

\begin{equation*}\boldsymbol{\pi}(N)=(\pi_0(N)\ \pi_1(N)\ \cdots \ \pi_N(N)),\end{equation*}

where $\pi_i(N)=\mathbb{P}(X(0)=i \mid X(0) \in \mathbb{Z}_0^N)$ ( $i \in \mathbb{Z}_0^N$ ). By definition, we have

(1.3)

\begin{equation}\boldsymbol{\pi}(N) = \frac{\boldsymbol{\pi}^{(1)}(N)}{\boldsymbol{\pi}^{(1)}(N)\textbf{\textit{e}}},\end{equation}

where $\boldsymbol{\pi}^{(1)}(N)$ is given in (1.2). In the augmented truncation approximation, an approximation $\boldsymbol{\pi}^{\textrm{trunc}}(N)$ to $\boldsymbol{\pi}(N)$ is determined by $\boldsymbol{\pi}^{\textrm{trunc}}(N) [\textbf{\textit{Q}}^{(1,1)}(N) + \textbf{\textit{Q}}_{\textrm{A}}(N)] = \textbf{0}$ and $\boldsymbol{\pi}^{\textrm{trunc}}(N) \textbf{\textit{e}} = 1$ . We then adopt $\boldsymbol{\pi}^{\textrm{approx},N} = (\boldsymbol{\pi}^{\textrm{trunc}}(N)\ \ \textbf{0})$ as an approximation to the stationary distribution $\boldsymbol{\pi}$ of the original Markov chain. Under some technical conditions, $\boldsymbol{\pi}^{\textrm{approx},N}$ is known to converge to $\boldsymbol{\pi}$ as N goes to infinity [Reference Hart and Tweedie4, Reference Seneta12]. Therefore, $\boldsymbol{\pi}^{\textrm{approx},N}$ will be a good approximation if N is sufficiently large.

On the other hand, the matrix-analytic method utilizes structures of Markov chains which appear often in applications. Specifically, the standard matrix-analytic method is applicable to block-structured Markov chains with the following features: the state space $\mathbb{Z}^+$ can be partitioned into finite and disjoint subsets $\mathcal{L}_m$ ( $m \in \mathbb{Z}^+$ ), called levels, in such a way that direct transitions between levels are skip-free in one direction (or both). In particular, Markov chains are called M/G/1-type if transitions between levels are skip-free to the left [Reference Neuts11], and they are called G/M/1-type if transitions between levels are skip-free to the right [Reference Neuts10]. If transitions between levels are skip-free in both directions, they are called quasi-birth-and-death processes [Reference Bright and Taylor1, Reference Neuts10]. In the matrix-analytic method, the exact computation of the stationary distribution is targeted.

In both of the above-mentioned approaches, the (conditional) stationary distribution is characterized by systems of linear equations. By contrast, this paper studies the conditional stationary distribution $\boldsymbol{\pi}(N)$ ( $N \in \mathbb{Z}^+$ ) via systems of linear inequalities. It is known that the boundary vector in block-structured Markov chains of level-dependent M/G/1 type can be obtained by the solution of an infinite number of inequalities; based on this, computational algorithms for the conditional stationary distribution $\boldsymbol{\pi}(N)$ have been developed in [Reference Kimura and Takine5, Reference Takine15].

The purpose of this paper is to characterize the conditional stationary distribution $\boldsymbol{\pi}(N)$ via systems of linear inequalities, without assuming any regular structures. In other words, we will attempt to find minimum regions on the first orthant $\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} \geq \textbf{0}\}$ of $\mathbb{R}^{N+1}$ which contain $\boldsymbol{\pi}(N)$ . Specifically, we first consider $\boldsymbol{\pi}(N)$ based only on the $(N+1) \times (N+1)$ northwest corner block $\textbf{\textit{Q}}^{(1,1)}(N)$ of the infinitesimal generator $\textbf{\textit{Q}}$ , and we obtain a system of $N+1$ linear inequalities that $\boldsymbol{\pi}(N)$ satisfies. It immediately follows that $\boldsymbol{\pi}(N)$ lies in an N-simplex $\mathcal{P}(N)$ on the first orthant of $\mathbb{R}^{N+1}$ , where vertices of $\mathcal{P}(N)$ are determined only by $\textbf{\textit{Q}}^{(1,1)}(N)$ . As we will see, the derivation of these results is absurdly simple. Nonetheless, the $N+1$ vertices of the N-simplex $\mathcal{P}(N)$ (i.e., $N+1$ linearly independent probability vectors that span $\mathcal{P}(N)$ ) are essential for $\boldsymbol{\pi}(N)$ ; in any ergodic, continuous-time Markov chain with $\textbf{\textit{Q}}^{(1,1)}(N)$ , $\boldsymbol{\pi}(N)$ can be expressed as a convex combination of those vertices. We also provide a probabilistic interpretation of this result.

Next we consider the case that a subset

(1.4)

\begin{equation}\mathcal{J}(N)=\{j \in \mathbb{Z}_0^N;\ \mbox{$q_{i,j} > 0$ for some $i \in \mathbb{Z}_{N+1}^{\infty}$}\}\end{equation}

of $\mathbb{Z}_0^N$ is available, as well as $\textbf{\textit{Q}}^{(1,1)}(N)$ . Note that $\mathcal{J}(N)$ can be regarded as a kind of structural information, because it is the subset of states in $\mathbb{Z}_0^N$ which are directly reachable from at least one state in $\mathbb{Z}_{N+1}^{\infty}$ . We show that the role of $\mathcal{J}(N)$ is to eliminate redundant vertices of $\mathcal{P}(N)$ , and we obtain a $(|\mathcal{J}(N)|-1)$ -simplex $\mathcal{P}^+(N)$ whose relative interior contains $\boldsymbol{\pi}(N)$ , where for any set $\mathcal{X}$ , $|\mathcal{X}|$ stands for the cardinality of $\mathcal{X}$ . In other words, when $\mathcal{J}(N)$ is available, $\boldsymbol{\pi}(N)$ is given by a convex combination of $|\mathcal{J}(N)|$ vertices of $\mathcal{P}^+(N)$ with positive weights. These results are closely related to the augmented truncation approximation (ATA), and we provide some practical implications for it. In particular, the linear ATA has the same degree of freedom as the general ATA does, and $\mathcal{J}(N)$ is useful in choosing an augmentation matrix in the linear ATA.

Next we consider an extension of the above results, using state transitions in peripheral states of $\mathbb{Z}_0^N$ . Specifically, for a fixed K ( $K > N$ ), we obtain an N-simplex $\widetilde{\mathcal{P}}(K,N)$ ( $\subseteq \mathcal{P}(N)$ ) that contains $\boldsymbol{\pi}(N)$ , which is determined only by the $(K+1) \times (K+1)$ northwest corner block $\textbf{\textit{Q}}^{(1,1)}(K)$ of $\textbf{\textit{Q}}$ . Furthermore, when both $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ are available, we show that redundant vertices of $\widetilde{\mathcal{P}}(K,N)$ can be eliminated.

The common feature of all of the above results is that we characterize $\boldsymbol{\pi}(N)$ by specifying simplices on the first orthant of $\mathbb{R}^{N+1}$ which contain $\boldsymbol{\pi}(N)$ . In order to refine these results further, we consider general convex polytopes, using $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ . For this purpose, we introduce a new structure called (K, N)-skip-free sets, and using them, we obtain the minimum convex polytope that contains $\boldsymbol{\pi}(N)$ , under the condition that only $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ are available.

The rest of this paper is organized as follows. In Section 2, we obtain simplices that contain $\boldsymbol{\pi}(N)$ , based on $\textbf{\textit{Q}}^{(1,1)}(N)$ (and $\mathcal{J}(N)$ ). We also provide some practical implications for the augmented truncation approximation. In Section 3, we extend the result in the preceding section, using state transitions in peripheral states. In Section 4, we introduce skip-free sets and obtain the minimum convex polytope that contains $\boldsymbol{\pi}(N)$ . Finally, some concluding remarks are provided in Section 5.

2. The characterization of $\boldsymbol{\pi}(N)$ by linear inequalities: fundamental results

In this section, we characterize the conditional stationary distribution $\boldsymbol{\pi}(N)$ via systems of linear inequalities, based on the $(N+1) \times (N+1)$ northwest corner block of the infinitesimal generator $\textbf{\textit{Q}}$ .

2.1. Preliminaries

We will construct a convex polytope $\mathcal{P}$ , by considering the intersection of a polyhedral convex cone $\mathcal{C}$ on the first orthant $\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} \geq \textbf{0}\}$ of $\mathbb{R}^{N+1}$ and a hyperplane containing all probability vectors in $\mathbb{R}^{N+1}$ . As we will see, $\mathcal{C}$ and $\mathcal{P}$ take the following forms:

(2.1)

\begin{align}\mathcal{C}&=\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}\textbf{\textit{A}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{B}} = \textbf{0}\},\nonumber\\[3pt] \mathcal{P}&=\mathcal{C} \cap \{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}\textbf{\textit{e}} =1 \}=\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}\textbf{\textit{A}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{B}} = \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\},\end{align}

where $\textbf{\textit{A}}$ and $\textbf{\textit{B}}$ denote appropriate matrices with $N+1$ rows. Note that we allow $\textbf{\textit{B}} = \textbf{\textit{O}}$ , and in that case, we can ignore the constraint $\textbf{\textit{x}}\textbf{\textit{B}} = \textbf{0}$ in (2.1). In general, a convex polytope $\mathcal{P}$ can also be represented by a set of convex combinations of vertices of $\mathcal{P}$ . Specifically, by using an appropriate nonnegative matrix $\overline{\textbf{\textit{C}}}$ with $N+1$ columns such that $\overline{\textbf{\textit{C}}}\textbf{\textit{e}}=\textbf{\textit{e}}$ , $\mathcal{P}$ in (2.1) can also be represented as

\begin{align*}\mathcal{P}&=\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\alpha}\overline{\textbf{\textit{C}}},\boldsymbol{\alpha} \geq \textbf{0},\boldsymbol{\alpha}\textbf{\textit{e}} =1\}.\end{align*}

Note that $\overline{\textbf{\textit{C}}}$ is composed of $1 \times (N+1)$ probability vectors $\overline{\textbf{\textit{c}}}_i$ ( $i=0,1,\ldots,M$ ) that span $\mathcal{P}$ . If the M vectors $\overline{\textbf{\textit{c}}}_1 - \overline{\textbf{\textit{c}}}_0$ , $\overline{\textbf{\textit{c}}}_2-\overline{\textbf{\textit{c}}}_0$ , $\ldots$ , $\overline{\textbf{\textit{c}}}_M-\overline{\textbf{\textit{c}}}_0$ are linearly independent, $\mathcal{P}$ is called an M-simplex. Let $\textrm{ri}\, \mathcal{P}$ denote the relative interior of the convex polytope $\mathcal{P}$ :

(2.2)

\begin{align}\textrm{ri}\, \mathcal{P}&=\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} \textbf{\textit{A}} > \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{B}}=\textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}}=1\}\\[3pt] &= \{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \boldsymbol{\alpha}\overline{\textbf{\textit{C}}},\boldsymbol{\alpha} > \textbf{0},\ \boldsymbol{\alpha}\textbf{\textit{e}}=1\}.\nonumber\end{align}

2.2. Characterization of $\boldsymbol{\pi}(N)$ in terms of $\textbf{\textit{Q}}^{(1,1)}(N)$

In this subsection, we assume that for an arbitrarily fixed $N \in\mathbb{Z}^+$ , the stationary distribution $\boldsymbol{\pi}$ of an ergodic, continuous-time Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ is partitioned as in (1.2) and the infinitesimal generator $\textbf{\textit{Q}}$ is partitioned as follows:

(2.3)

\begin{equation}\textbf{\textit{Q}} = \kbordermatrix{&\mathbb{Z}_0^N &\quad \mathbb{Z}_{N+1}^{\infty}\\ \\[-5pt] \mathbb{Z}_0^N & \textbf{\textit{Q}}^{(1,1)}(N) &\quad \textbf{\textit{Q}}^{(1,2)}(N) \\ \\[-5pt] \mathbb{Z}_{N+1}^{\infty} & \textbf{\textit{Q}}^{(2,1)}(N) &\quad \textbf{\textit{Q}}^{(2,2)}(N)}.\end{equation}

We define $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ as the collection of ergodic, continuous-time Markov chains on $\mathbb{Z}^+$ whose infinitesimal generators have the specific $(N+1) \times (N+1)$ northwest corner block $\textbf{\textit{Q}}^{(1,1)}(N)$ .

We first characterize the conditional stationary distribution $\boldsymbol{\pi}(N)$ of Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ . Owing to the ergodicity, $\textbf{\textit{Q}}^{(1,1)}(N)$ is nonsingular whether it is irreducible or not. We then define $\textbf{\textit{H}}(N)$ as

(2.4)

\begin{align}\textbf{\textit{H}}(N)&=(\!-\textbf{\textit{Q}}^{(1,1)}(N))^{-1}.\end{align}

By definition, $\textbf{\textit{H}}(N) \geq \textbf{\textit{O}}$ and $\textbf{\textit{H}}(N) \textbf{\textit{e}} >\textbf{0}$ . Let $\overline{\textbf{\textit{H}}}(N)$ denote an $(N+1) \times (N+1)$ matrix obtained by normalizing each row of $\textbf{\textit{H}}(N)$ in such a way that $\overline{\textbf{\textit{H}}}(N) \textbf{\textit{e}} = \textbf{\textit{e}}$ :

\begin{equation*}\overline{\textbf{\textit{H}}}(N)=\textrm{diag}^{-1}(\textbf{\textit{H}}(N)\textbf{\textit{e}})\textbf{\textit{H}}(N),\end{equation*}

where for any $(n+1)$ -dimensional vector $\textbf{\textit{x}}$ , $\textrm{diag}(\textbf{\textit{x}})$ denotes an $(n+1)$ -dimensional diagonal matrix whose ith diagonal element is given by the ith element $[\textbf{\textit{x}}]_i$ of $\textbf{\textit{x}}$ . We also define $\Gamma(n)$ ( $n \in \mathbb{Z}^+$ ) as the set of all $(n+1)$ -dimensional probability vectors:

(2.5)

\begin{align}\Gamma(n)&=\{\boldsymbol{\alpha} \in \mathbb{R}^{n+1};\quad \boldsymbol{\alpha}\geq\textbf{0},\boldsymbol{\alpha}\textbf{\textit{e}}=1\},\quad n \in \mathbb{Z}^+.\end{align}

Theorem 2.1. For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ , we have

(2.6)

\begin{align}\boldsymbol{\pi}(N) \in \mathcal{P}(N),\quad N \in \mathbb{Z}^+,\end{align}

where $\mathcal{P}(N)$ $(N \in \mathbb{Z}^+)$ denotes an N-simplex on the first orthant of $\mathbb{R}^{N+1}$ which is given by

(2.7)

\begin{align}\mathcal{P}(N)&=\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N)) \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\}\end{align}

(2.8)

\begin{align}&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\alpha} \overline{\textbf{\textit{H}}}(N),\boldsymbol{\alpha} \in \Gamma(N)\big\}.\end{align}

Proof. The starting point of the proof is the global balance equations (1.1) for $\boldsymbol{\pi}^{(1)}(N)$ :

(2.9)

\begin{equation}\boldsymbol{\pi}^{(1)}(N) \textbf{\textit{Q}}^{(1,1)}(N)+\boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}^{(2,1)}(N)=\textbf{0}.\end{equation}

It then follows from (1.3) and (2.9) that

(2.10)

\begin{align}\boldsymbol{\pi}(N) (\!-\textbf{\textit{Q}}^{(1,1)}(N))=\frac{\boldsymbol{\pi}^{(2)}(N)}{\boldsymbol{\pi}^{(1)}(N)\textbf{\textit{e}}}\cdot\textbf{\textit{Q}}^{(2,1)}(N).\end{align}

Because $\boldsymbol{\pi}^{(2)}(N)/\boldsymbol{\pi}^{(1)}(N)\textbf{\textit{e}}>\textbf{0}$ and $\textbf{\textit{Q}}^{(2,1)}(N) \geq \textbf{\textit{O}}$ , $\boldsymbol{\pi}(N)$ satisfies

\begin{equation*}\boldsymbol{\pi}(N)(\!-\textbf{\textit{Q}}^{(1,1)}(N)) \geq \textbf{0},\qquad\boldsymbol{\pi}(N)\textbf{\textit{e}} = 1,\end{equation*}

from which (2.6) with (2.7) follows.

Next, we show the equivalence between (2.7) and (2.8). We rewrite (2.7) as

\begin{align*}&\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N)) \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\}\\&\quad =\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N)) = \textbf{\textit{y}},\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\}\\&\quad =\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \textbf{\textit{y}} \textbf{\textit{H}}(N),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\}\\&\quad =\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \textbf{\textit{y}} \textrm{diag}(\textbf{\textit{H}}(N) \textbf{\textit{e}}) \overline{\textbf{\textit{H}}}(N),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\}.\end{align*}

Let $\boldsymbol{\alpha} = \textbf{\textit{y}} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})$ . Since $\textbf{\textit{x}} =\boldsymbol{\alpha} \overline{\textbf{\textit{H}}}(N)$ , we have $\textbf{\textit{x}}\textbf{\textit{e}} = \boldsymbol{\alpha}\textbf{\textit{e}}$ . Furthermore, if $\textbf{\textit{y}} \geq \textbf{0}$ , we have $\boldsymbol{\alpha} =\textbf{\textit{y}} \textrm{diag}(\textbf{\textit{H}}(N) \textbf{\textit{e}}) \geq \textbf{0}$ and vice versa, since $\textbf{\textit{H}}(N)\textbf{\textit{e}} > \textbf{0}$ . We thus have

(2.11)

\begin{align}&\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \textbf{\textit{y}} \textrm{diag}(\textbf{\textit{H}}(N) \textbf{\textit{e}}) \overline{\textbf{\textit{H}}}(N),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\}\nonumber\\&\quad =\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \boldsymbol{\alpha} \overline{\textbf{\textit{H}}}(N),\boldsymbol{\alpha} \geq \textbf{0},\ \boldsymbol{\alpha}\textbf{\textit{e}} = 1\}.\end{align}

Because $\overline{\textbf{\textit{H}}}(N)$ is composed of $N+1$ linearly independent probability vectors, $\mathcal{P}(N)$ is an N-simplex on the first orthant of $\mathbb{R}^{N+1}$ . □

Theorem 2.1 implies that the conditional stationary distribution $\boldsymbol{\pi}(N)$ of any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ is given by a convex combination of row vectors of $\overline{\textbf{\textit{H}}}(N)$ ; i.e., there exists $\boldsymbol{\alpha}^*(N)$ such that

(2.12)

\begin{equation}\boldsymbol{\pi}(N) = \boldsymbol{\alpha}^*(N) \overline{\textbf{\textit{H}}}(N),\quad\boldsymbol{\alpha}^*(N) \in \Gamma(N).\end{equation}

Remark 2.1. Since $\overline{\textbf{\textit{H}}}(N)$ is nonsingular, there is a one-to-one correspondence between $\boldsymbol{\pi}(N)$ and $\boldsymbol{\alpha}^*(N)$ , i.e., $\boldsymbol{\alpha}^*(N) = \boldsymbol{\pi}(N) (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textrm{diag}(\overline{\textbf{\textit{H}}}(N)\textbf{\textit{e}})$ . Consequently, obtaining the conditional stationary distribution $\boldsymbol{\pi}(N)$ in an ergodic, continuous-time Markov chain is equivalent to obtaining the nonnegative weight vector $\boldsymbol{\alpha}^*(N)$ .

The following corollary can be used to evaluate the accuracy of an approximation to $\boldsymbol{\pi}(N)$ .

Corollary 2.1. For an arbitrary probability vector $\textbf{\textit{x}} \in \Gamma(N)$ , we have

(2.13)

\begin{equation}\| \textbf{\textit{x}} - \boldsymbol{\pi}(N) \|_1\leq\max_{i \in \mathbb{Z}_0^N}\| \textbf{\textit{x}} - \overline{\textbf{\textit{h}}}_i(N) \|_1,\end{equation}

where $\| \cdot \|_1$ stands for the $\ell_1$ norm and $\overline{\textbf{\textit{h}}}_i(N)$ ( $i \in \mathbb{Z}_0^N$ ) denotes the ith row vector of $\overline{\textbf{\textit{H}}}(N)$ . Furthermore, if $\textbf{\textit{x}} \in \mathcal{P}(N)$ , we have

(2.14)

\begin{equation}\max_{i \in \mathbb{Z}_0^N}\| \textbf{\textit{x}} - \overline{\textbf{\textit{h}}}_i(N) \|_1\leq\max_{i,j \in \mathbb{Z}_0^N}\| \overline{\textbf{\textit{h}}}_j(N) - \overline{\textbf{\textit{h}}}_i(N) \|_1,\quad\textbf{\textit{x}} \in \mathcal{P}(N).\end{equation}

Proof. (2.13) immediately follows from Theorem 2.1 and the convexity of $\mathcal{P}(N)$ . Furthermore, if $\textbf{\textit{x}} \in \mathcal{P}(N)$ , there exists a probability vector $\boldsymbol{\beta}=(\beta_0\ \beta_1\ \cdots\ \beta_N) \in \Gamma(N)$ such that

\begin{equation*}\|\textbf{\textit{x}} - \overline{\textbf{\textit{h}}}_i(N) \|_1=\|\sum_{j \in \mathbb{Z}_0^N} \beta_j\bigl( \overline{\textbf{\textit{h}}}_j(N) - \overline{\textbf{\textit{h}}}_i(N) \bigr) \|_1\leq\sum_{j \in \mathbb{Z}_0^N} \beta_j\| \overline{\textbf{\textit{h}}}_j(N) - \overline{\textbf{\textit{h}}}_i(N) \|_1,\end{equation*}

from which (2.14) follows. □

We now provide a probabilistic interpretation of (2.12). We partition the state space $\mathbb{Z}^+$ into $\mathbb{Z}_0^N$ and $\mathbb{Z}_{N+1}^{\infty}$ and regard $\{X(t)\}_{t \in (\!-\infty,\infty)}$ as an alternating Markov renewal process. For an integer n, let $T_n^{(1)}$ (resp. $T_n^{(2)}$ ) denote the nth time instant at which $\{X(t)\}_{t \in (\!-\infty,\infty)}$ enters into $\mathbb{Z}_0^N$ from $\mathbb{Z}_{N+1}^{\infty}$ (resp. into $\mathbb{Z}_{N+1}^{\infty}$ from $\mathbb{Z}_0^N$ ), where we assume $T_n^{(1)} < T_n^{(2)} < T_{n+1}^{(1)}$ without loss of generality. We define I(t) ( $t \in (\!-\infty, \infty)$ ) as the state at the last renewal epoch before time t, i.e.,

\begin{equation*}I(t)=\begin{cases}X(T_n^{(1)}),&T_n^{(1)} \leq t < T_n^{(2)},\\ X(T_n^{(2)}),&T_n^{(2)} \leq t < T_{n+1}^{(1)}.\end{cases}\end{equation*}

We then consider the joint process $\{(X(t),I(t))\}_{t \in (\!-\infty,\infty)}$ , which is assumed to be stationary.

Corollary 2.2. For any Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ , the (i,j)th $(i,j \in \mathbb{Z}_0^N)$ element of $\overline{\textbf{\textit{H}}}(N)$ is given by

(2.15)

\begin{equation}[\overline{\textbf{\textit{H}}}(N)]_{i,j}=\mathbb{P}(X(0) = j\mid I(0) =i),\quad i,j \in \mathbb{Z}_0^N,\end{equation}

and the ith $(i \in \mathbb{Z}_0^N)$ element of $\boldsymbol{\alpha}^*(N)$ is given by

(2.16)

\begin{equation}[\boldsymbol{\alpha}^*(N)]_i=\mathbb{P}(I(0) = i \mid X(0) \in \mathbb{Z}_0^N).\end{equation}

We thus interpret (2.12) as

(2.17)

\begin{equation}[\boldsymbol{\pi}(N)]_j=\sum_{i \in \mathbb{Z}_0^N}\mathbb{P}(I(0) = i \mid X(0) \in \mathbb{Z}_0^N)\cdot\mathbb{P}(X(0) = j \mid I(0) =i),\quad j \in \mathbb{Z}_0^N.\end{equation}

The proof of Corollary 2.2 is given in Appendix A. Note that for a given $\textbf{\textit{Q}}^{(1,1)}(N)$ , $\mathcal{P}(N)$ in Theorem 2.1 may not be tight in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ . For example, $\mathcal{P}(N)$ may include $\textbf{\textit{x}} \in\mathbb{R}^{N+1}$ such that $[\textbf{\textit{x}}]_i = 0$ for some $i \in \mathbb{Z}_0^N$ , whereas $\boldsymbol{\pi}(N) > \textbf{0}$ since Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ are ergodic. In the next subsection, we eliminate redundancy in $\mathcal{P}(N)$ using the structural information $\mathcal{J}(N)$ in (1.4).

2.3. Characterization of $\boldsymbol{\pi}(\textbf{\textit{N}})$ in terms of $\textbf{\textit{Q}}^{(1,1)}(\textbf{\textit{N}})$ and $\mathcal{J}(\textbf{\textit{N}})$

For specific $\textbf{\textit{Q}}^{(1,1)}(N)$ and $\mathcal{J}(N)$ , we define $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ as the collection of ergodic, continuous-time Markov chains on $\mathbb{Z}^+$ whose infinitesimal generators have $\textbf{\textit{Q}}^{(1,1)}(N)$ and $\mathcal{J}(N)$ . By definition,

\begin{equation*}\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))=\bigcup_{\mathcal{J}(N) \in 2^{\mathbb{Z}_0^N} \setminus \{\emptyset\}}\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N)).\end{equation*}

Note here that $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ may be empty for some pairs of $\textbf{\textit{Q}}^{(1,1)}(N)$ and $\mathcal{J}(N)$ . For example, if $\textbf{\textit{Q}}^{(1,1)}(N)$ is a diagonal matrix, $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N),\mathcal{J}(N)) = \emptyset$ unless $\mathcal{J}(N) = \mathbb{Z}_0^N$ , owing to ergodicity. In the rest of this subsection, we assume that if $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N),\mathcal{J}(N)) \neq \emptyset$ , then $\mathcal{J}(N)=\{0,1,\dots,|\mathcal{J}(N)|-1\}$ without loss of generality.

If $\mathcal{J}(N) \neq \mathbb{Z}_0^N$ , we partition $\textbf{\textit{Q}}^{(1,1)}(N)$ and $\textbf{\textit{Q}}^{(2,1)}(N)$ in (2.3) into two matrices:

(2.18)

\begin{align}\begin{pmatrix}\textbf{\textit{Q}}^{(1,1)}(N)\\ \\[-5pt] \textbf{\textit{Q}}^{(2,1)}(N)\end{pmatrix}=\kbordermatrix{& \mathcal{J}(N) & \mathbb{Z}_0^N \setminus \mathcal{J}(N)\\ \\[-5pt] \mathbb{Z}_0^N & \textbf{\textit{Q}}_+^{(1,1)}(N) &\quad \textbf{\textit{Q}}_0^{(1,1)}(N)\\ \\[-5pt] \mathbb{Z}_{N+1}^{\infty} & \textbf{\textit{Q}}_+^{(2,1)}(N) &\quad \textbf{\textit{O}}}.\end{align}

(2.10) is then rewritten as

(2.19)

\begin{align}\boldsymbol{\pi}(N)\cdot\kbordermatrix{& \mathcal{J}(N) &\quad \mathbb{Z}_0^N \setminus \mathcal{J}(N)\\ \\[-5pt]& -\textbf{\textit{Q}}_+^{(1,1)}(N)&\quad -\textbf{\textit{Q}}_0^{(1,1)}(N)}=\kbordermatrix{& \mathcal{J}(N)&\quad \hbox to 7mm{\hss$\scriptstyle \mathbb{Z}_0^N \setminus \mathcal{J}(N)$\hss}\\ \\[-5pt]&\displaystyle\frac{\boldsymbol{\pi}^{(2)}(N)}{\boldsymbol{\pi}^{(1)}(N)\textbf{\textit{e}}} \cdot \textbf{\textit{Q}}_+^{(2,1)}(N)&\quad \textbf{0}}.\end{align}

We define $\Gamma^+(N)$ as

(2.20)

\begin{equation}\Gamma^+(N)=\bigl\{\boldsymbol{\alpha} \in \mathbb{R}^{N+1};\ \boldsymbol{\alpha} \geq \textbf{0},\boldsymbol{\alpha}\textbf{\textit{e}}=1,\ [\boldsymbol{\alpha}]_i=0\ (i \in \mathbb{Z}_0^N\setminus\mathcal{J}(N))\bigr\}.\end{equation}

Theorem 2.2. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N)) \neq \emptyset$ . For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ , we have

(2.21)

\begin{align}\boldsymbol{\pi}(N) \in \textrm{ri}\, \mathcal{P}^+(N),\quad N \in \mathbb{Z}^+,\end{align}

where $\mathcal{P}^+(N)$ $(N \in \mathbb{Z}^+)$ denotes a $(|\mathcal{J}(N)|-1)$ -simplex on the first orthant of $\mathbb{R}^{N+1}$ which is given by $\mathcal{P}^+(N) = \mathcal{P}(N)$ if $\mathcal{J}(N) = \mathbb{Z}_0^N$ , and by

(2.22)

\begin{align}\mathcal{P}^+(N)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}(\!-\textbf{\textit{Q}}_+^{(1,1)}(N)) \geq \textbf{0},\ \textbf{\textit{x}}(\!-\textbf{\textit{Q}}_0^{(1,1)}(N)) = \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\big\}\end{align}

(2.23)

\begin{align}&\hspace*{-95pt}=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\alpha}\overline{\textbf{\textit{H}}}(N),\boldsymbol{\alpha} \in \Gamma^+(N)\big\}\end{align}

otherwise.

Proof. If $\mathcal{J}(N)=\mathbb{Z}_0^N$ , we have $\boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}^{(2,1)}(N) >\textbf{0}$ since $\boldsymbol{\pi}^{(2)}(N) > \textbf{0}$ . Theorem 2.2 for $\mathcal{J}(N)=\mathbb{Z}_0^N$ then follows from (2.2), (2.10), and Theorem 2.1. On the other hand, if $\mathcal{J}(N) \neq \mathbb{Z}_0^N$ , then $\boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}_+^{(2,1)}(N) > \textbf{0}$ . In this case, (2.21) with (2.22) follows from (2.2) and (2.19). The equivalence between (2.22) and (2.23) for $\mathcal{J}(N) \neq \mathbb{Z}_0^N$ can be shown in a way similar to the proof of Theorem 2.1, as shown in Appendix B. Since $\overline{\textbf{\textit{H}}}(N)$ is nonnegative and nonsingular, $\mathcal{P}^+(N)$ is a $(|\mathcal{J}(N)|-1)$ -simplex on the first orthant of $\mathbb{R}^{N+1}$ . □

By definition, we have $\Gamma^+(N)\subseteq \Gamma(N)$ , and therefore

(2.24)

\begin{equation}\mathcal{P}^+(N) \subseteq \mathcal{P}(N),\end{equation}

where $\mathcal{P}^+(N) = \mathcal{P}(N)$ iff $\mathcal{J}(N) = \mathbb{Z}_0^N$ . Theorem 2.2 indicates the importance of the structural information $\mathcal{J}(N)$ about direct transitions from $\mathbb{Z}_{N+1}^{\infty}$ to $\mathbb{Z}_0^N$ . For example, if $\mathcal{J}(N)$ is a singleton for a certain N, say, $\mathcal{J}(N)=\{i\}$ ( $i \in \mathbb{Z}_0^N$ ), $\boldsymbol{\pi}(N)$ is given by the ith row vector of $\overline{\textbf{\textit{H}}}(N)$ , as pointed out in [16, Corollary 3]. Numerical algorithms for block-structured Markov chains of level-dependent M/G/1-type in [Reference Kimura and Takine5, Reference Takine15] also utilize (2.21) with (2.22) implicitly.

The corollary below follows from Theorem 2.2; its proof is omitted because it is almost the same as that of Corollary 2.1.

Corollary 2.3. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N)) \neq \emptyset$ . For an arbitrary probability vector $\textbf{\textit{x}} \in \Gamma(N)$ , we have

\begin{equation*}\| \textbf{\textit{x}} - \boldsymbol{\pi}(N) \|_1\leq\max_{i \in \mathcal{J}(N)}\| \textbf{\textit{x}} - \overline{\textbf{\textit{h}}}_i(N) \|_1.\end{equation*}

Furthermore, if $\textbf{\textit{x}} \in \mathcal{P}^+(N)$ , we have

\begin{equation*}\max_{i \in \mathcal{J}(N)}\| \textbf{\textit{x}} - \overline{\textbf{\textit{h}}}_i(N) \|_1\leq\max_{i,j \in \mathcal{J}(N)}\| \overline{\textbf{\textit{h}}}_j(N) - \overline{\textbf{\textit{h}}}_i(N) \|_1,\quad\textbf{\textit{x}} \in \mathcal{P}^+(N).\end{equation*}

Example 2.1. Consider an ergodic, level-dependent M ${}^{\text{X}}\!$ /M/1 queue with disasters, whose infinitesimal generator $\textbf{\textit{Q}}$ takes the following form:

\begin{equation}\textbf{\textit{Q}}=\left(\begin{array}{c@{\quad}c@{\quad}c@{\quad}c@{\quad}c@{\quad}c@{\quad}c}-q_0 & q_{0,1} & q_{0,2} & q_{0,3} & q_{0,4} & q_{0,5} & \dots\\ \\[-7pt] q_{1,0} & -q_1 & q_{1,2} & q_{1,3} & q_{1,4} & q_{1,5} & \dots\\ \\[-7pt] q_{2,0} & q_{2,1} & -q_2 & q_{2,3} & q_{2,4} & q_{2,5} & \dots\\ \\[-7pt] q_{3,0} & 0 & q_{3,2} & -q_3 & q_{3,4} & q_{3,5} & \dots\\ \\[-7pt] q_{4,0} & 0 & 0 & q_{4,3} & -q_4 & q_{4,5} & \dots\\ \\[-7pt] q_{5,0} & 0 & 0 & 0 & q_{5,4} & -q_5 & \dots\\ \\[-7pt] \vdots & \vdots & \vdots & \vdots & \vdots & \vdots & \ddots\end{array}\right),\notag\end{equation}

where $q_{i,0} > 0$ and $q_{i,i-1} > 0$ ( $i \in \mathbb{Z}^+ \setminus\{0\}$ ). We then have

\begin{equation*}\mathcal{J}(N) = \{0,N\},\end{equation*}

because the state immediately after a downward transition from state i ( $i \geq N+1$ ) is either state 0 or state $i-1$ . It then follows that

\begin{equation*}\Gamma^+(N)=\bigl\{\bigl(\alpha_0\ \ 0\ \ 0\ \ \cdots\ \ 0\ \ 1-\alpha_0\bigr) \in \mathbb{R}^{N+1};\ \alpha_0 \in [0,1]\bigr\},\end{equation*}

and

\begin{equation*}\boldsymbol{\pi}(N) \in \textrm{ri}\, \mathcal{P}^+(N)=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\alpha_0 \overline{\textbf{\textit{h}}}_0(N) + (1-\alpha_0) \overline{\textbf{\textit{h}}}_N(N),\,\alpha_0 \in (0,1)\bigr\}.\end{equation*}

Note here that $\overline{\textbf{\textit{h}}}_i(N)$ ( $i \in \mathbb{Z}_0^N$ ) can be obtained by (i) solving $\textbf{\textit{x}}_i (\!-\textbf{\textit{Q}}^{(1,1)}(N)) = \textbf{\textit{e}}_i(N)$ for $\textbf{\textit{x}}_i \in \mathbb{R}^{N+1}$ and (ii) letting $\overline{\textbf{\textit{h}}}_i(N)=\textbf{\textit{x}}_i/\textbf{\textit{x}}_i\textbf{\textit{e}}$ , where $\textbf{\textit{e}}_i(N) \in \mathbb{R}^{N+1}$ denotes a $1 \times (N+1)$ unit vector whose ith element is equal to one. The simplest way to obtain an approximation $\boldsymbol{\pi}^{\textrm{approx}}(N) \in\textrm{ri}\, \mathcal{P}^+(N)$ to $\boldsymbol{\pi}(N)$ might be to solve $\textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}) = \beta_0 \textbf{\textit{e}}_0(N)+ (1-\beta_0) \textbf{\textit{e}}_N(N)$ for an arbitrary $\beta_0 \in (0,1)$ and let $\boldsymbol{\pi}^{\textrm{approx}}(N) = \textbf{\textit{x}}/\textbf{\textit{x}}\textbf{\textit{e}}$ .

We now show that $\mathcal{P}^+(N)$ is tight in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N),\mathcal{J}(N))$ .

Lemma 2.1. If $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N)) \neq \emptyset$ and $\textbf{\textit{x}} \in\textrm{ri}\, \mathcal{P}^+(N)$ , we have $\textbf{\textit{x}} > \textbf{0}$ .

Proof. We assume $[\textbf{\textit{x}}]_{j^*} = 0$ for some $j^* \in \mathbb{Z}_0^N$ and derive a contradiction. Since $\textbf{\textit{x}} = \boldsymbol{\alpha}(\textbf{\textit{x}})\overline{\textbf{\textit{H}}}(N)$ for some $\boldsymbol{\alpha}(\textbf{\textit{x}}) \in\textrm{ri}\, \Gamma^+(N)$ , $[\textbf{\textit{x}}]_{j^*} = 0$ is equivalent to $[\boldsymbol{\alpha}(\textbf{\textit{x}}) \overline{\textbf{\textit{H}}}(N)]_{j^*} = 0$ for some $\boldsymbol{\alpha}(\textbf{\textit{x}}) \in \textrm{ri}\, \Gamma^+(N)$ . This implies that $[\overline{\textbf{\textit{H}}}(N)]_{i,j^*} = 0$ for all $i \in \mathcal{J}(N)$ since $[\boldsymbol{\alpha}(\textbf{\textit{x}})]_i > 0$ for all $i \in \mathcal{J}(N)$ and $\overline{\textbf{\textit{H}}}(N) \geq \textbf{\textit{O}}$ . It then follows from (2.15) that $\mathbb{P}(X(0) = j^* \mid I(0) \in \mathcal{J}(N)) = 0$ for any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N),\mathcal{J}(N))$ . Furthermore, $\mathbb{P}(I(0) \in \mathbb{Z}_0^N \setminus \mathcal{J}(N)) = 0$ and $\mathbb{P}(X(0) = j^* \mid I(0) \in \mathbb{Z}_{N+1}^{\infty}) = 0$ by definition. We thus have

\begin{align*}\mathbb{P}(X(0)=j^*)&=\mathbb{P}(I(0) \in \mathcal{J}(N)) \mathbb{P}(X(0)=j^* \mid I(0) \in \mathcal{J}(N))\\[3pt] & \qquad\qquad {}+\mathbb{P}(I(0) \in \mathbb{Z}_{N+1}^{\infty})\mathbb{P}(X(0) = j^* \mid I(0) \in \mathbb{Z}_{N+1}^{\infty})\\[3pt] &= 0,\end{align*}

for any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ , which contradicts the ergodicity. □

Theorem 2.3. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N)) \neq \emptyset$ . For any $\textbf{\textit{x}} \in \textrm{ri}\, \mathcal{P}^+(N)$ , there exists a Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ whose $\boldsymbol{\pi}(N)$ is given by $\textbf{\textit{x}}$ .

Proof. It follows from (2.24) that if $\textbf{\textit{x}} \in \textrm{ri}\, \mathcal{P}^+(N)$ , we have $\textbf{\textit{x}} \in \mathcal{P}(N)$ , i.e., $\textbf{\textit{x}} (\!-\textbf{\textit{Q}}^{(1,1)}(N)) \geq\textbf{0}$ and $\textbf{\textit{x}}\textbf{\textit{e}}=1$ . Note also that $\textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N)) \neq \textbf{0}$ , because if $\textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N)) = \textbf{0}$ held, we would have $\textbf{\textit{x}}= \textbf{0}\cdot (\!-\textbf{\textit{Q}}^{(1,1)}(N))^{-1} = \textbf{0}$ , which contradicts $\textbf{\textit{x}}\textbf{\textit{e}}=1$ . We then define $\boldsymbol{\zeta}(\textbf{\textit{x}})$ as

(2.25)

\begin{equation}\boldsymbol{\zeta}(\textbf{\textit{x}})=\frac{\textbf{\textit{x}} (\!-\textbf{\textit{Q}}^{(1,1)}(N))}{\textbf{\textit{x}} (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}.\end{equation}

It is easy to see that $\boldsymbol{\zeta}(\textbf{\textit{x}}) \in \textrm{ri}\, \Gamma^+(N)$ for $\textbf{\textit{x}} \in \textrm{ri}\, \mathcal{P}^+(N)$ . For an arbitrarily chosen $\textbf{\textit{x}} \in\textrm{ri}\, \mathcal{P}^+(N)$ , we consider a Markov chain whose infinitesimal generator $\textbf{\textit{Q}}$ is given by

(2.26)

\begin{equation}\textbf{\textit{Q}}= \kbordermatrix{&\mathbb{Z}_0^N & \mathbb{Z}_{N+1}^{\infty}\\ \\[-7pt]\mathbb{Z}_0^N & \textbf{\textit{Q}}^{(1,1)}(N) &\quad \textbf{\textit{Q}}^{(1,2)}(N)\\ \\[-7pt] \mathbb{Z}_{N+1}^{\infty}& (\!-\textbf{\textit{Q}}^{(2,2)}(N))\textbf{\textit{e}} \boldsymbol{\zeta}(\textbf{\textit{x}}) &\quad \textbf{\textit{Q}}^{(2,2)}(N)}.\end{equation}

The global balance equations are then given by

(2.27)

\begin{align}\boldsymbol{\pi}^{(1)}(N) \textbf{\textit{Q}}^{(1,1)}(N)+\boldsymbol{\pi}^{(2)}(N) (\!-\textbf{\textit{Q}}^{(2,2)}(N))\textbf{\textit{e}} \boldsymbol{\zeta}(\textbf{\textit{x}}) &= \textbf{0},\end{align}

(2.28)

\begin{align}\hspace*{40pt}\boldsymbol{\pi}^{(1)}(N) \textbf{\textit{Q}}^{(1,2)}(N) + \boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}^{(2,2)}(N)&= \textbf{0}.\end{align}

From (2.25) and (2.27), we observe that the special form of $\textbf{\textit{Q}}^{(2,1)}(N) =(\!-\textbf{\textit{Q}}^{(2,2)}(N))\textbf{\textit{e}} \boldsymbol{\zeta}(\textbf{\textit{x}})$ ensures $\boldsymbol{\pi}^{(1)}(N) = c_1 \textbf{\textit{x}}$ for some positive constant $c_1$ . We now set

(2.29)

\begin{equation}\textbf{\textit{Q}}^{(1,2)}(N) = (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \textbf{\textit{z}},\qquad\textbf{\textit{Q}}^{(2,2)}(N) = -\textbf{\textit{I}},\end{equation}

where $\textbf{\textit{z}}$ denotes a $1 \times \infty$ positive probability vector. Equation (2.28) is then reduced to

(2.30)

\begin{equation}\boldsymbol{\pi}^{(1)}(N) (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}\textbf{\textit{z}}- \boldsymbol{\pi}^{(2)}(N)= \textbf{0},\end{equation}

which indicates that (2.29) ensures $\boldsymbol{\pi}^{(2)}(N) = c_2 \textbf{\textit{z}} > \textbf{0}$ for some positive constant $c_2$ . In fact, solving (2.27) and (2.30) with $\boldsymbol{\pi}^{(1)}(N)\textbf{\textit{e}} +\boldsymbol{\pi}^{(2)}(N)\textbf{\textit{e}} = 1$ , we obtain

\begin{equation*}\boldsymbol{\pi}=\begin{pmatrix}\boldsymbol{\pi}^{(1)}(N) &\quad \boldsymbol{\pi}^{(2)}(N)\end{pmatrix}=\frac{1}{1+\textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}\begin{pmatrix}\textbf{\textit{x}} &\quad \textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \cdot \textbf{\textit{z}}\end{pmatrix}> \textbf{0}.\end{equation*}

We thus conclude that the Markov chain with the infinitesimal generator $\textbf{\textit{Q}}$ defined by (2.26) and (2.29) is a member of $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N),\mathcal{J}(N))$ and it has $\boldsymbol{\pi}(N) = \textbf{\textit{x}}$ . □

Example 2.2. (Example 2.1 continued.) Consider $\textbf{\textit{Q}}$ in Example 2.1 of Section 2.3. Theorem 2.2 implies that there exists $\alpha_0 = \alpha_0^* \in (0,1)$ such that $\boldsymbol{\pi}(N) = \alpha_0^* \overline{\textbf{\textit{h}}}_0(N) + (1-\alpha_0^*)\overline{\textbf{\textit{h}}}_N(N)$ . As stated in Theorem 2.3, however, $\textrm{ri}\, \mathcal{P}^+(N)$ is tight in $\mathcal{M}( \textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ . Therefore, we cannot identify the exact $\alpha_0^* \in (0, 1)$ if only $\textbf{\textit{Q}}^{(1,1)}(N)$ and $\mathcal{J}(N)=\{0,N\}$ are available.

2.4. Implications for the augmented truncation approximation

We provide some practical implications of the results in Sections 2.2 and 2.3 for the augmented truncation approximation (ATA). As mentioned in Section 1, an ATA solution $\boldsymbol{\pi}^{\textrm{trunc}}(N)$ to $\boldsymbol{\pi}(N)$ is obtained by

(2.31)

\begin{equation}\boldsymbol{\pi}^{\textrm{trunc}}(N) [\textbf{\textit{Q}}^{(1,1)}(N)+ \textbf{\textit{Q}}_{\textrm{A}}(N)]=\textbf{0},\quad\boldsymbol{\pi}^{\textrm{trunc}}(N)\textbf{\textit{e}}=1,\end{equation}

where $\textbf{\textit{Q}}_{\textrm{A}}(N)$ denotes an augmentation matrix, i.e., an $(N+1) \times (N+1)$ nonnegative matrix satisfying $\textbf{\textit{Q}}_{\textrm{A}}(N)\textbf{\textit{e}} = (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}$ . The central topic in the literature on the ATA is the convergence of $\boldsymbol{\pi}^{\textrm{approx},N}=(\boldsymbol{\pi}^{\textrm{trunc}}(N)\ \ \textbf{0})$ to the stationary distribution $\boldsymbol{\pi}$ as N goes to infinity. Note that our interest is different from this; that is to say, we are interested in what kind of $\textbf{\textit{Q}}_{\textrm{A}}(N)$ is reasonable for a given N.

Although $\textbf{\textit{Q}}_{\textrm{A}}(N)$ is usually chosen in such a way that $[\textbf{\textit{Q}}^{(1,1)}(N)+ \textbf{\textit{Q}}_{\textrm{A}}(N)]$ is irreducible, we allow reducible $[\textbf{\textit{Q}}^{(1,1)}(N)+ \textbf{\textit{Q}}_{\textrm{A}}(N)]$ as well. Note that the error bound for the approximation $\boldsymbol{\pi}^{\textrm{approx},N} =(\boldsymbol{\pi}^{\textrm{trunc}}(N)\ \ \textbf{0})$ to the stationary distribution $\boldsymbol{\pi}$ is given by

\begin{equation*}2 \mathbb{P}(X(0) > N)\leq\| \boldsymbol{\pi}^{\textrm{approx},N} - \boldsymbol{\pi} \|_1\leq\epsilon^{\textrm{trunc}}(N) + 2 \mathbb{P}(X(0) > N)\end{equation*}

(see [Reference Kimura and Takine5]), where $\epsilon^{\textrm{trunc}}(N)$ denotes an upper bound of $\| \boldsymbol{\pi}^{\textrm{trunc}}(N) - \boldsymbol{\pi}(N)\|_1$ :

\begin{equation*}\| \boldsymbol{\pi}^{\textrm{trunc}}(N) - \boldsymbol{\pi}(N)\|_1 \leq \epsilon^{\textrm{trunc}}(N).\end{equation*}

It is clear that $\mathbb{P}(X(0) > N)$ is a decreasing function of N and that for a given N, $\boldsymbol{\pi}^{\textrm{approx},N}$ will be the best approximation to $\boldsymbol{\pi}$ when $\boldsymbol{\pi}^{\textrm{trunc}}(N) = \boldsymbol{\pi}(N)$ (i.e., $\epsilon^{\textrm{trunc}}(N) = 0$ ) [Reference Zhao and Liu16]. In what follows, we first consider Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ and discuss some implications of our results in Section 2.2 for the ATA.

We define $\mathcal{T}(N)$ as the set of all possible ATA solutions satisfying (2.31):

\begin{equation*}\mathcal{T}(N)=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}\big[\textbf{\textit{Q}}^{(1,1)}(N)+\textbf{\textit{Q}}_{\textrm{A}}(N)\big]=\textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}}=1,\ \textbf{\textit{Q}}_{\textrm{A}}(N) \in \mathcal{A}(N)\bigr\},\end{equation*}

where $\mathcal{A}(N)$ denotes the set of all possible augmentation matrices,

\begin{equation*}\mathcal{A}(N)=\bigl\{\textbf{\textit{X}} \in \mathbb{R}^{(N+1)\times(N+1)};\ \textbf{\textit{X}}\geq\textbf{\textit{O}},\ \textbf{\textit{X}}\textbf{\textit{e}}=(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}\bigr\}.\end{equation*}

In the literature, the ATA is called linear if the rank of $\textbf{\textit{Q}}_{\textrm{A}}(N)$ is equal to one [Reference Gibson and Seneta2]. We then define $\mathcal{T}_{\textrm{L}}(N)$ as the set of all possible approximations obtained by the linear ATA:

\begin{equation*}\mathcal{T}_{\textrm{L}}(N)=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}\big[\textbf{\textit{Q}}^{(1,1)}(N)+\textbf{\textit{Q}}_{\textrm{A}}(N)\big] = \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}}=1,\ \textbf{\textit{Q}}_{\textrm{A}}(N) \in \mathcal{A}_{\textrm{L}}(N)\bigr\},\end{equation*}

where $\mathcal{A}_{\textrm{L}}(N)$ denotes the set of all possible linear augmentation matrices,

\begin{equation*}\mathcal{A}_{\textrm{L}}(N)=\bigl\{\textbf{\textit{X}} \in \mathbb{R}^{(N+1)\times(N+1)};\ \textbf{\textit{X}} = (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \boldsymbol{\zeta},\boldsymbol{\zeta} \in \Gamma(N)\bigr\}.\end{equation*}

Recall that $\Gamma(N)$ is the set of all $1 \times (N+1)$ probability vectors, which is defined in (2.5).

Lemma 2.2. For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N))$ , we have

\begin{equation*}\mathcal{T}(N) = \mathcal{T}_{\textrm{L}}(N) = \mathcal{P}(N),\quad N \in \mathbb{Z}^+,\end{equation*}

where $\mathcal{P}(N)$ is given by (2.8).

The proof of Lemma 2.2 is given in Appendix C. Lemma 2.2 implies the following.

Implication 2.1. In view of Remark 2.1, we can consider that the ATA attempts to find an approximation to the weight vector $\boldsymbol{\alpha}^*(N)$ for $\overline{\textbf{\textit{H}}}(N)$ in (2.12) by setting the augmentation matrix $\textbf{\textit{Q}}_{\textrm{A}}(N)$ appropriately. In this sense, the linear ATA has the same degree of freedom as the general ATA does.

We thus restrict our attention to the linear ATA with $\textbf{\textit{Q}}_{\textrm{A}}(N)= (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \boldsymbol{\zeta}$ , where $\boldsymbol{\zeta} \in\Gamma(N)$ . In this case, (2.31) is reduced to

(2.32)

\begin{equation}\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})[\textbf{\textit{Q}}^{(1,1)}(N) + (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \boldsymbol{\zeta}] = \textbf{0},\quad\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})\textbf{\textit{e}}=1.\end{equation}

Note here that $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ in (2.32) is closely related to $\boldsymbol{\pi}(N)$ in a Markov chain with the infinitesimal generator $\textbf{\textit{Q}}$ in (2.26). Specifically, using the cut-flow balance equation

\begin{equation*}\boldsymbol{\pi}^{(1)}(N) (\!-\textbf{\textit{Q}}^{(1,1)}(N)) \textbf{\textit{e}}=\boldsymbol{\pi}^{(2)}(N) (\!-\textbf{\textit{Q}}^{(2,2)}(N)) \textbf{\textit{e}},\end{equation*}

we can rewrite (2.27) to be

\begin{equation*}\boldsymbol{\pi}^{(1)}(N) \bigl[\textbf{\textit{Q}}^{(1,1)}(N)+(\!-\textbf{\textit{Q}}^{(1,1)}(N)) \textbf{\textit{e}} \boldsymbol{\zeta}(\textbf{\textit{x}})\bigr] = \textbf{0}.\end{equation*}

We thus have $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) = \boldsymbol{\pi}(N) =\textbf{\textit{x}}\in \mathcal{P}(N)$ if we set $\boldsymbol{\zeta} = \boldsymbol{\zeta}(\textbf{\textit{x}})$ in (2.32).

Implication 2.2. The linear ATA solution $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ for a specific $\boldsymbol{\zeta}$ is identical to $\boldsymbol{\pi}(N)$ in an ergodic Markov chain whose infinitesimal generator takes the following form:

\begin{equation*}\textbf{\textit{Q}}= \kbordermatrix{&\mathbb{Z}_0^N & \mathbb{Z}_{N+1}^{\infty}\\ \\[-7pt]\mathbb{Z}_0^N & \textbf{\textit{Q}}^{(1,1)}(N) &\quad \textbf{\textit{Q}}^{(1,2)}(N) \\ \\[-7pt] \mathbb{Z}_{N+1}^{\infty}& (\!-\textbf{\textit{Q}}^{(2,2)}(N)\textbf{\textit{e}}) \boldsymbol{\zeta} &\quad \textbf{\textit{Q}}^{(2,2)}(N)}.\end{equation*}

Since $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) \in \mathcal{T}_{\textrm{L}}(N) = \mathcal{P}(N)$ , it follows from Theorem 2.1 that

(2.33)

\begin{equation}\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})=\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) \overline{\textbf{\textit{H}}}(N)\end{equation}

for some $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) \in \Gamma(N)$ . Recall that there is a one-to-one correspondence between $\boldsymbol{\pi}^{\textrm{trunc}}(N;\,\boldsymbol{\zeta})$ and $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})$ , as stated in Remark 2.1. Note also that there are one-to-one correspondences between $\boldsymbol{\zeta}$ and $\boldsymbol{\alpha}(N;\,\boldsymbol{\zeta})$ ,

(2.34)

\begin{equation}\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})=\frac{\boldsymbol{\zeta} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})}{\boldsymbol{\zeta} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})\textbf{\textit{e}}},\qquad\boldsymbol{\zeta}=\frac{\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) \textrm{diag}^{-1}(\textbf{\textit{H}}(N)\textbf{\textit{e}})}{\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) \textrm{diag}^{-1}(\textbf{\textit{H}}(N)\textbf{\textit{e}}) \textbf{\textit{e}}},\end{equation}

and between $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ and $\boldsymbol{\zeta}$ ,

\begin{equation*}\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})=\frac{\boldsymbol{\zeta} \textbf{\textit{H}}(N)}{\boldsymbol{\zeta} \textbf{\textit{H}}(N)\textbf{\textit{e}}},\qquad\boldsymbol{\zeta}=\frac{\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})(\!-\textbf{\textit{Q}}^{(1,1)}(N))}{\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}.\end{equation*}

From (2.33), we also observe that $\boldsymbol{\pi}^{\textrm{trunc}}(N;\,\boldsymbol{\zeta})$ is given by a convex combination of the row vectors $\overline{\textbf{\textit{h}}}_i(N)$ ( $i \in \mathbb{Z}_0^N$ ) of $\overline{\textbf{\textit{H}}}(N)$ with the nonnegative weight vector $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})$ . It then follows from (2.13) that

(2.35)

\begin{equation}\|\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) - \boldsymbol{\pi}(N) \|_1\leq\max_{i \in \mathbb{Z}_0^N}\| \boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) - \overline{\textbf{\textit{h}}}_i(N) \|_1.\end{equation}

This error bound tempts us to set $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})$ in such a way that $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ is located at the center of the $\mathcal{P}(N)$ spanned by the $\overline{\textbf{\textit{h}}}_i(N)$ ( $i\in \mathbb{Z}_0^N$ ). For example, if we set $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) =\textbf{\textit{e}}^T/(N+1)$ , where T stands for the transpose operator, then $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ is given by the center of gravity of $\mathcal{P}(N)$ . Note here that (2.33) is equivalent to $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) (\!-\textbf{\textit{Q}}^{(1,1)}(N)) =\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) \textrm{diag}^{-1}(\textbf{\textit{H}}(N)\textbf{\textit{e}})$ .

Implication 2.3 If we have a desirable $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})$ rather than $\boldsymbol{\zeta}$ itself, $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ for such an $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})$ can be computed as follows:

(i) We first obtain $\boldsymbol{\eta}(N)\coloneqq \textbf{\textit{H}}(N)\textbf{\textit{e}}$ by solving $(\!-\textbf{\textit{Q}}^{(1,1)}(N)) \boldsymbol{\eta}(N) = \textbf{\textit{e}}$ .
(ii) We then obtain $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ by solving
\begin{equation*}\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) (\!-\textbf{\textit{Q}}^{(1,1)}(N)) =\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) \textrm{diag}^{-1}(\boldsymbol{\eta}(N)).\end{equation*}

In the above procedure, we have to solve two systems of linear equations with $-\textbf{\textit{Q}}^{(1,1)}(N)$ . Therefore, relative to the procedure for solving (2.32) directly for a specific $\boldsymbol{\zeta}$ , the computational cost increases by $(N+1)^2$ in terms of the number of multiplications/divisions, if we utilize the LU decomposition. This increase can be regarded as the cost of specifying the weight vector $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})$ directly, instead of specifying $\boldsymbol{\zeta}$ .

Next we consider Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ , where the structural information $\mathcal{J}(N)$ is assumed to be available. It follows from (2.34) that for all $i \in \mathbb{Z}_0^N$ ,

(2.36)

\begin{equation}[\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta})]_i = 0\ \Leftrightarrow\ [\boldsymbol{\zeta}]_i = 0.\end{equation}

We then define $\mathcal{T}_{\textrm{L}}^+(N)$ as the set of all possible approximations obtained by the linear ATA when $\mathcal{J}(N)$ is available:

\begin{equation*}\mathcal{T}_{\textrm{L}}^+(N)=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}\big[\textbf{\textit{Q}}^{(1,1)}(N) + (\!-\textbf{\textit{Q}}^{(1,1)}(N)) \textbf{\textit{e}} \boldsymbol{\zeta}\big]= \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1,\boldsymbol{\zeta} \in \Gamma^+(N)\bigr\},\end{equation*}

where $\Gamma^+(N)$ is given by (2.20). By definition, we have $\mathcal{T}_{\textrm{L}}^+(N) \subseteq \mathcal{T}_{\textrm{L}}(N)$ .

Lemma 2.3. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N)) \neq \emptyset$ . For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N), \mathcal{J}(N))$ , we have

\begin{equation*}\mathcal{T}_{\textrm{L}}^+(N) = \mathcal{P}^+(N),\quad N \in \mathbb{Z}^+,\end{equation*}

where $\mathcal{P}^+(N)$ is given by (2.23).

The proof of Lemma 2.3 is given in Appendix D. We thus have $\boldsymbol{\pi}(N) \in \textrm{ri}\, \mathcal{T}_{\textrm{L}}^+(N)$ from Theorem 2.2. It also follows from (2.36) that if $[\boldsymbol{\zeta}]_i > 0$ for some $i \in \mathbb{Z}_0^N \setminus \mathcal{J}(N)$ , we have $\boldsymbol{\pi}^{\textrm{trunc}}(N;\,\boldsymbol{\zeta}) \notin \mathcal{P}^+(N)$ , because the $\overline{\textbf{\textit{h}}}_j(N)$ ( $j\in \mathbb{Z}_0^N$ ) are linearly independent. Moreover, it follows from Corollary 2.3 that

(2.37)

\begin{equation}\|\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) - \boldsymbol{\pi}(N) \|_1\leq\max_{i \in \mathcal{J}(N)}\| \boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) - \overline{\textbf{\textit{h}}}_i(N) \|_1,\quad\boldsymbol{\zeta} \in \Gamma^+(N),\end{equation}

which is tighter than (2.35). In summary, we have the following.

Implication 2.4. If the structural information $\mathcal{J}(N)$ is available, it is natural that we should choose $\boldsymbol{\zeta}$ from $\textrm{ri}\, \Gamma^+(N)$ in obtaining a linear ATA solution, where $\Gamma^+(N)$ is given by (2.20). For example, we may choose $\boldsymbol{\zeta}$ whose ith ( $i \in \mathbb{Z}_0^N$ ) element is given by $1/|\mathcal{J}(N)|$ if $i\in \mathcal{J}(N)$ and by 0 otherwise.

Implication 2.4 indicates that the last-column augmentation $\boldsymbol{\zeta} =(0\ 0\ \cdots\ 0\ 1)$ , which is one of the common augmentation strategies in the literature, may not be effective unless $\mathcal{J}(N) =\{N\}$ , because if $\mathcal{J}(N) \neq \{N\}$ , we have $\boldsymbol{\zeta} = (0\ 0\ \cdots\ 0\ 1) \not\in \textrm{ri}\, \Gamma^+(N)$ and therefore $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) \not\in \textrm{ri}\, \mathcal{P}^+(N)$ , while $\boldsymbol{\pi}(N) \in \textrm{ri}\, \mathcal{P}^+(N)$ .

Example 2.3. (Example 2.1 continued.) Consider $\textbf{\textit{Q}}$ in Example 2.1 of Section 2.3. Since $\mathcal{J}(N) = \{0,N\}$ , we may set $\boldsymbol{\zeta} = (\zeta_0\ \ 0\ \ 0\ \ \cdots\ \ 0\ \ 1-\zeta_0) \in\textrm{ri}\, \Gamma^+(N)$ for an arbitrary $\zeta_0 \in (0,1)$ . The ATA solution $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta})$ is then obtained by solving

\begin{equation*}\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) [\textbf{\textit{Q}}^{(1,1)}(N) + (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \boldsymbol{\zeta}] = \textbf{0}\end{equation*}

with $\boldsymbol{\pi}^{\textrm{trunc}}(N;\, \boldsymbol{\zeta}) \textbf{\textit{e}} = 1$ . If we are concerned with the error bound in (2.37), we first compute $\overline{\textbf{\textit{h}}}_i$ ( $i=0,N$ ) by the procedure described in Example 2.1 and then adopt $(\overline{\textbf{\textit{h}}}_0+\overline{\textbf{\textit{h}}}_N)/2$ as the ATA solution (or alternatively, we perform the procedure in Implication 2.3 with $\boldsymbol{\alpha}(N;\, \boldsymbol{\zeta}) =(1/2\ 0\ 0\ \cdots\ 0\ 1/2)$ ). Note that this choice yields the minimum error bound of (2.37) in this specific example, which is half the error bound of the last-column augmentation.

Remark 2.2. As shown in (A.1), we have

\begin{equation*}\boldsymbol{\zeta}^*=\frac{\boldsymbol{\pi}^{(2)}(N)\textbf{\textit{Q}}^{(2,1)}(N)}{\boldsymbol{\pi}^{(2)}(N)\textbf{\textit{Q}}^{(2,1)}(N)\textbf{\textit{e}}},\end{equation*}

which satisfies $\boldsymbol{\pi}(N)[\textbf{\textit{Q}}^{(1,1)}(N)+(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \boldsymbol{\zeta}^*] = \textbf{0}$ . Therefore, in order to obtain a good one-point approximation to the conditional stationary distribution $\boldsymbol{\pi}(N)$ , we need some information about $\boldsymbol{\pi}^{(2)}(N)$ , as in the Iterative Aggregation/Disaggregation methods for finite-state Markov chains [Reference Stewart13, Reference Takahashi14].

3. Extensions using state transitions in peripheral states

In this section, we consider extensions of the results in Sections 2.2 and 2.3 by using the $(K+1) \times (K+1)$ northwest corner block of the infinitesimal generator $\textbf{\textit{Q}}$ , where $K > N$ . For this purpose, we partition $\boldsymbol{\pi}$ and $\textbf{\textit{Q}}$ as follows:

(3.1)

\begin{align}\includegraphics{inline1.eps}\end{align}

Note here that from (1.2) and (2.3),

\begin{gather*}\boldsymbol{\pi}^{(1)}(K,N) = \boldsymbol{\pi}^{(1)}(N),\qquad\boldsymbol{\pi}^{(3)}(K,N) = \boldsymbol{\pi}^{(2)}(K),\\ \textbf{\textit{Q}}^{(1,1)}(K,N) = \textbf{\textit{Q}}^{(1,1)}(N),\qquad\textbf{\textit{Q}}^{(3,3)}(K,N) = \textbf{\textit{Q}}^{(2,2)}(K).\end{gather*}

In order to utilize the results in the preceding section, we consider the censored Markov chain $\{\widetilde{X}(t)\}_{t \in (\!-\infty,\infty)}$ obtained by observing the Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ only when $X(t) \in\mathbb{Z}_0^N\cup\mathbb{Z}_{K+1}^{\infty}$ . The infinitesimal generator of $\{\widetilde{X}(t)\}_{t \in (\!-\infty,\infty)}$ is then given by

(3.2)

\begin{equation}\widetilde{\textbf{\textit{Q}}}(K,N)=\kbordermatrix{& \mathbb{Z}_0^N &\quad \mathbb{Z}_{K+1}^{\infty} \\ \\[-7pt] \mathbb{Z}_0^N & \widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N)&\quad \widetilde{\textbf{\textit{Q}}}^{(1,3)}(K,N)\\ \\[-7pt] \mathbb{Z}_{K+1}^{\infty} & \widetilde{\textbf{\textit{Q}}}^{(3,1)}(K,N)&\quad \widetilde{\textbf{\textit{Q}}}^{(3,3)}(K,N)},\end{equation}

where for $i,j=1,3$ ,

(3.3)

\begin{align}\widetilde{\textbf{\textit{Q}}}^{(i,j)}(K,N)&=\textbf{\textit{Q}}^{(i,j)}(K,N)+\textbf{\textit{Q}}^{(i,2)}(K,N)(\!-\textbf{\textit{Q}}^{(2,2)}(K,N))^{-1}\textbf{\textit{Q}}^{(2,j)}(K,N).\end{align}

In particular, we have

(3.4)

\begin{equation}\widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N)=\textbf{\textit{Q}}^{(1,1)}(N)(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N)),\end{equation}

where

(3.5)

\begin{align}\textbf{\textit{R}}(K,N)=(\!-\textbf{\textit{Q}}^{(1,1)}(N))^{-1} \textbf{\textit{Q}}^{(1,2)}(K,N)(\!-\textbf{\textit{Q}}^{(2,2)}(K,N))^{-1}\textbf{\textit{Q}}^{(2,1)}(K,N).\end{align}

Note that $(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))$ in (3.4) is nonsingular because $\widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N)$ and $\textbf{\textit{Q}}^{(1,1)}(N)$ are nonsingular.

Clearly, Theorem 2.1 is applicable to the censored Markov chain $\{\widetilde{X}(t)\}_{t \in (\!-\infty,\infty)}$ on $\mathbb{Z}_0^N \cup \mathbb{Z}_{K+1}^{\infty}$ . Note also that the conditional stationary distribution of $\{\widetilde{X}(t)\}_{t \in (\!-\infty,\infty)}$ given $\widetilde{X}(0) \in \mathbb{Z}_0^N$ is identical to $\boldsymbol{\pi}(N)$ . We define $\widetilde{\Gamma}(K,N)$ ( $K > N$ ) as

(3.6)

\begin{align}\widetilde{\Gamma}(K,N)&=\bigl\{\boldsymbol{\alpha} \in \mathbb{R}^{N+1};\ \boldsymbol{\alpha}=\boldsymbol{\beta} \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}),\ \nonumber\\[3pt] &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\boldsymbol{\beta} \in \Gamma(N)\bigr\},\end{align}

where

(3.7)

\begin{equation}\widetilde{\textbf{\textit{H}}}(K,N)=(\!-\widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N))^{-1}.\end{equation}

Remark 3.1. It can be verified that $\widetilde{\textbf{\textit{H}}}(K,N)$ is identical to the $(N+1) \times (N+1)$ northwest corner block of the $(K+1) \times(K+1)$ matrix $\textbf{\textit{H}}(K) = (\!-\textbf{\textit{Q}}^{(1,1)}(K))^{-1}$ .

For a specific $(K+1) \times (K+1)$ northwest corner block $\textbf{\textit{Q}}^{(1,1)}(K)$ , we define $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K))$ as the collection of ergodic, continuous-time Markov chains on $\mathbb{Z}^+$ whose infinitesimal generators have $\textbf{\textit{Q}}^{(1,1)}(K)$ .

Theorem 3.1. For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K))$ , we have

\begin{equation*}\boldsymbol{\pi}(N) \in \widetilde{\mathcal{P}}(K,N),\quad K, N \in \mathbb{Z}^+,\ K > N,\end{equation*}

where $\widetilde{\mathcal{P}}(K,N)$ denotes an N-simplex on the first orthant of $\mathbb{R}^{N+1}$ which is given by

(3.8)

\begin{align}\widetilde{\mathcal{P}}(K,N)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} (\!-\textbf{\textit{Q}}^{(1,1)}(N)) (\textbf{\textit{I}} - \textbf{\textit{R}}(K,N)) \geq \textbf{0},\textbf{\textit{x}}\textbf{\textit{e}}=1\big\}\end{align}

(3.9)

\begin{align}&\hspace*{-30pt}=\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\alpha}\overline{\textbf{\textit{H}}}(N),\boldsymbol{\alpha} \in \widetilde{\Gamma}(K,N)\},\end{align}

with $\textbf{\textit{R}}(K,N)$ as in (3.5).

Proof. Associated with $\widetilde{\textbf{\textit{H}}}(K,N)$ in (3.7), we define $\overline{\widetilde{\textbf{\textit{H}}}}(K,N)$ as

(3.10)

\begin{align}\overline{\widetilde{\textbf{\textit{H}}}}(K,N)&=\textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})\widetilde{\textbf{\textit{H}}}(K,N).\end{align}

Applying Theorem 2.1 to (3.2), we obtain $\boldsymbol{\pi}(N) \in \widehat{\mathcal{P}}(K,N)$ , where $\widehat{\mathcal{P}}(K,N)$ has two equivalent expressions:

(3.11)

\begin{align}\widehat{\mathcal{P}}(K,N)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} (\!-\widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N)) \geq \textbf{0},\textbf{\textit{x}}\textbf{\textit{e}}=1\big\}\nonumber\\[3pt] &=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\beta} \overline{\widetilde{\textbf{\textit{H}}}}(K,N),\boldsymbol{\beta} \in \Gamma(N)\big\}.\end{align}

Because of (3.4), $\widetilde{\mathcal{P}}(K,N)$ in (3.8) is identical to $\widehat{\mathcal{P}}(K,N)$ .

Next, we show the equivalence between (3.9) and (3.11). It follows from (3.4) and (3.7) that

(3.12)

\begin{align}\widetilde{\textbf{\textit{H}}}(K,N)=(\!-\widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N))^{-1}&=(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} (\!-\textbf{\textit{Q}}^{(1,1)}(N))^{-1}\nonumber\\[3pt] &=(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textbf{\textit{H}}(N).\end{align}

Therefore, for an arbitrary $\boldsymbol{\beta} \in \Gamma(N)$ , we have from (3.10) and (3.12) that

(3.13)

\begin{align}\boldsymbol{\beta} \overline{\widetilde{\textbf{\textit{H}}}}(K,N)&=\boldsymbol{\beta} \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textbf{\textit{H}}(N)\nonumber\\[3pt] &= \boldsymbol{\beta} \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})\overline{\textbf{\textit{H}}}(N)\nonumber\\[3pt] &=\boldsymbol{\alpha} \overline{\textbf{\textit{H}}}(N),\end{align}

where

(3.14)

\begin{equation}\boldsymbol{\alpha} = \boldsymbol{\beta} \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}),\end{equation}

which shows the equivalence between (3.9) and (3.11). We thus conclude that

(3.15)

\begin{equation}\widetilde{\mathcal{P}}(K,N) = \widehat{\mathcal{P}}(K,N).\end{equation}

$\widetilde{\mathcal{P}}(K,N)$ is an N-simplex on the first orthant of $\mathbb{R}^{N+1}$ because of the nonsingularity of $\overline{\widetilde{\textbf{\textit{H}}}}(K,N) \geq \textbf{\textit{O}}$ , (3.11), and (3.15). □

Corollary 3.1. $\widetilde{\Gamma}(K,N)$ in (3.6) is a subset of $\Gamma(N)$ :

(3.16)

\begin{equation}\widetilde{\Gamma}(K,N) \subseteq \Gamma(N),\quad N, K \in \mathbb{Z}^+,\ N < K.\end{equation}

Furthermore,

(3.17)

\begin{equation}\widetilde{\Gamma}(K_2,N) \subseteq \widetilde{\Gamma}(K_1, N),\quad N, K_1, K_2 \in \mathbb{Z}^+,\ N < K_1 < K_2,\end{equation}

and therefore

(3.18)

\begin{equation}\widetilde{\mathcal{P}}(K_2,N)\subseteq\widetilde{\mathcal{P}}(K_1,N)\subseteq\mathcal{P}(N),\end{equation}

where $\mathcal{P}(N)$ is as given in Theorem 2.1.

The proof of Corollary 3.1 is given in Appendix E.

Next, we consider an extension of the results in Section 2.3. For specific $(K+1) \times (K\,{+}\,1)$ northwest corner block $\textbf{\textit{Q}}^{(1,1)}(K)$ and the structural information $\mathcal{J}(K)$ ( $K > N$ ), we define $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\mathcal{J}(K))$ as the collection of ergodic, continuous-time Markov chains on $\mathbb{Z}^+$ whose infinitesimal generators have $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ . For Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ , we define $\widetilde{\mathcal{J}}(K,N)$ ( $K > N$ ) as the set of states in $\mathbb{Z}_0^N$ which are reachable from some states in $\mathbb{Z}_{K+1}^{\infty}$ either directly or only via some states in $\mathbb{Z}_{N+1}^K$ :

(3.19)

\begin{equation}\widetilde{\mathcal{J}}(K,N)=\{j \in \mathbb{Z}_0^N;\ [\textbf{\textit{e}}^T \widetilde{\textbf{\textit{Q}}}^{(3,1)}(K,N)]_j > 0\}.\end{equation}

Note here that $\widetilde{\mathcal{J}}(K,N) \neq \emptyset$ because of the ergodicity. Note also that $\widetilde{\mathcal{J}}(K,N)$ is determined completely by $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ , because from (1.4), (3.1), and (3.3),

\begin{gather*}\mathcal{J}(K)=\{j \in \mathbb{Z}_0^N;\ \bigl[\textbf{\textit{e}}^T \textbf{\textit{Q}}^{(3,1)}(K,N) \bigr]_j > 0\}\cup\{j \in \mathbb{Z}_{N+1}^K;\ \bigl[ \textbf{\textit{e}}^T \textbf{\textit{Q}}^{(3,2)}(K,N) \bigr]_j > 0\},\\[3pt] \widetilde{\textbf{\textit{Q}}}^{(3,1)}(K,N)=\textbf{\textit{Q}}^{(3,1)}(K,N)+\textbf{\textit{Q}}^{(3,2)}(K,N)\cdot(\!-\textbf{\textit{Q}}^{(2,2)}(K,N))^{-1}\textbf{\textit{Q}}^{(2,1)}(K,N).\end{gather*}

Furthermore, by definition,

(3.20)

\begin{equation}\widetilde{\mathcal{J}}(K,N) \subseteq \mathcal{J}(N),\end{equation}

and states in $\mathcal{J}(N) \setminus \widetilde{\mathcal{J}}(K,N)$ , if any, can be reached from $\mathbb{Z}_{K+1}^{\infty}$ only after visiting some states in $\widetilde{\mathcal{J}}(K,N)$ .

Example 3.1. (Example 2.1 continued.) Consider $\textbf{\textit{Q}}$ in Example 2.1 of Section 2.3. By definition, $\mathcal{J}(K)=\{0,K\}$ and $\widetilde{\mathcal{J}}(K,N) = \{0,N\} = \mathcal{J}(N)$ .

Without loss of generality, we assume $\widetilde{\mathcal{J}}(K,N) =\{0,1,\dots,|\widetilde{\mathcal{J}}(K,N)|-1\}$ . We then partition $\widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N)$ in (3.2) into two matrices,

(3.21)

\begin{align}\widetilde{\textbf{\textit{Q}}}^{(1,1)}(K,N)&=\kbordermatrix{&\widetilde{\mathcal{J}}(K,N) &\quad \mathbb{Z}_0^N\setminus\widetilde{\mathcal{J}}(K,N)\\ \\[-9pt] &\widetilde{\textbf{\textit{Q}}}^{(1,1)}_+(K,N)&\quad \widetilde{\textbf{\textit{Q}}}_0^{(1,1)}(K,N)\\},\end{align}

and define $\widetilde{\Gamma}^+(K,N)$ as

(3.22)

\begin{align}\widetilde{\Gamma}^+(K,N)&=\bigl\{\boldsymbol{\alpha} \in \mathbb{R}^{N+1};\ \boldsymbol{\alpha}=\boldsymbol{\beta} \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}),\ \nonumber\\&{\boldsymbol{\beta}} \in \Gamma(N),\ [\boldsymbol{\beta}]_i=0\ (i \in \mathbb{Z}_0^N\setminus\widetilde{\mathcal{J}}(K,N))\bigr\}.\end{align}

Theorem 3.2. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K)) \neq \emptyset$ . For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ ( $K > N$ ), we have

(3.23)

\begin{align}\boldsymbol{\pi}(N) \in \textrm{ri}\, \widetilde{\mathcal{P}}^+(K,N),\quad K, N \in \mathbb{Z}^+,\ K > N,\end{align}

where $\widetilde{\mathcal{P}}^+(K,N)$ denotes a $(|\widetilde{\mathcal{J}}(K,N)|-1)$ -simplex on the first orthant of $\ \mathbb{R}^{N+1}$ which is given by $\widetilde{\mathcal{P}}(K,N)$ if $\widetilde{\mathcal{J}}(K,N) = \mathbb{Z}_0^N$ , and by

(3.24)

\begin{align}\widetilde{\mathcal{P}}^+(K,N)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}(\!-\widetilde{\textbf{\textit{Q}}}_+^{(1,1)}(K,N)) \geq \textbf{0},\nonumber\\& \quad \textbf{\textit{x}}(\!-\widetilde{\textbf{\textit{Q}}}_0^{(1,1)}(K,N)) = \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\big\} \end{align}

(3.25)

\begin{align}\hspace*{60pt}&=\bigl\{ \textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \boldsymbol{\alpha}\overline{\textbf{\textit{H}}}(N),\boldsymbol{\alpha} \in \widetilde{\Gamma}^+(K,N)\bigr\}\end{align}

otherwise.

Proof. If $\widetilde{\mathcal{J}}(K,N) = \mathbb{Z}_0^N$ , Theorem 3.2 can be shown in the same way as Theorem 2.2 for $\mathcal{J}(N)=\mathbb{Z}_0^N$ . We thus assume that $\widetilde{\mathcal{J}}(K,N) \neq \mathbb{Z}_0^N$ . Applying Theorem 2.2 to the censored Markov chain with infinitesimal generator $\widetilde{\textbf{\textit{Q}}}(K,N)$ in (3.2) with partition (3.21), we obtain (3.23) with (3.24), where (3.24) can also be expressed as follows:

(3.26)

\begin{align}\widetilde{\mathcal{P}}^+(K,N) = & \bigl\{ \textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \boldsymbol{\beta} \overline{\widetilde{\textbf{\textit{H}}}}(K,N),\nonumber\\[3pt] &\boldsymbol{\beta} \in \Gamma(N),\ [\boldsymbol{\beta}]_i=0\ (i \in \mathbb{Z}_0^N \setminus \widetilde{\mathcal{J}}(K,N))\bigr\},\end{align}

where $\overline{\widetilde{\textbf{\textit{H}}}}(K,N)$ is given by (3.10). Equation (3.25) now follows from (3.13) and (3.26). Since $\overline{\widetilde{\textbf{\textit{H}}}}(K,N)$ is nonnegative and nonsingular, (3.26) implies that $\widetilde{\mathcal{P}}^+(K,N)$ is a $(|\widetilde{\mathcal{J}}(K,N)|-1)$ -simplex on the first orthant of $\mathbb{R}^{N+1}$ . □

By definition,

(3.27)

\begin{equation}\widetilde{\Gamma}^+(K,N) \subseteq \widetilde{\Gamma}(K,N),\end{equation}

where $\widetilde{\Gamma}(K,N)$ is given by (3.6). We thus have

(3.28)

\begin{equation}\widetilde{\mathcal{P}}^+(K,N) \subseteq \widetilde{\mathcal{P}}(K,N),\end{equation}

where $\widetilde{\mathcal{P}}(K,N)$ is given in Theorem 3.1.

Corollary 3.2. $\widetilde{\Gamma}^+(K,N)$ in (3.22) is a subset of $\Gamma^+(N)$ in (2.20):

(3.29)

\begin{align}\widetilde{\Gamma}^+(K,N) \subseteq \Gamma^+(N).\end{align}

Furthermore,

(3.30)

\begin{equation}\widetilde{\Gamma}^+(K_2,N) \subseteq \widetilde{\Gamma}^+(K_1, N),\quad N, K_1, K_2 \in \mathbb{Z}^+,\ N < K_1 < K_2,\end{equation}

and therefore

(3.31)

\begin{equation}\widetilde{\mathcal{P}}^+(K_2,N)\subseteq\widetilde{\mathcal{P}}^+(K_1, N)\subseteq\mathcal{P}^+(N).\end{equation}

The proof of Corollary 3.2 is given in Appendix F. Since $\widetilde{\mathcal{P}}(K,N)$ in Theorem 3.1 and $\widetilde{\mathcal{P}}^+(K, N)$ in Theorem 3.2 are compact, Corollaries 3.1 and 3.2 suggest that those simplices converge to certain sets as K goes to infinity. In fact, we have the following proposition.

Proposition 3.1. ([8, Theorem 2.3]) In any ergodic, continuous-time Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ on $\mathbb{Z}^+$ ,

\begin{equation*}\lim_{K \to \infty} \overline{\widetilde{\textbf{\textit{H}}}}(K,N)=\textbf{\textit{e}}\boldsymbol{\pi}(N),\quad N \in \mathbb{Z}^+.\end{equation*}

The corollary below follows immediately from (3.11), (3.15), (3.18), (3.26), (3.28), (3.31), and Proposition 3.1.

Corollary 3.3. In any ergodic, continuous-time Markov chain $\{X(t);\ t \geq0\}$ on $\mathbb{Z}^+$ ,

\begin{equation*}\bigcap_{K=N+1}^{\infty} \widetilde{\mathcal{P}}(K,N)=\lim_{K \to \infty} \widetilde{\mathcal{P}}(K,N)=\{\boldsymbol{\pi}(N)\},\quad N \in \mathbb{Z}^+,\end{equation*}

and

\begin{equation*}\bigcap_{K=N+1}^{\infty} \widetilde{\mathcal{P}}^+(K,N)=\lim_{K \to \infty} \widetilde{\mathcal{P}}^+(K,N)=\{\boldsymbol{\pi}(N)\},\quad N \in \mathbb{Z}^+.\end{equation*}

Since $\textrm{ri}\, \widetilde{\mathcal{P}}^+(K,N) \subseteq\widetilde{\mathcal{P}}^+(K,N)$ , $\textrm{ri}\, \widetilde{\mathcal{P}}^+(K,N)$ also converges to $\{\boldsymbol{\pi}(N)\}$ as K goes to infinity. Note here that $\textrm{ri}\, \widetilde{\mathcal{P}}^+(K,N)$ may not be tight in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ for a finite K. For example, we consider $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N+1), \mathcal{J}(N+1))$ , where $K=N+1$ , $\mathcal{J}(N+1) = \{N+1\}$ , and $q_{N+1,j} > 0$ for all $j \in \mathbb{Z}_0^N$ . We then have $\widetilde{\mathcal{J}}(N+1,N) = \mathbb{Z}_0^N$ , and therefore $\widetilde{\mathcal{P}}^+(N+1,N)$ is an N-simplex spanned by $(N+1)$ row vectors of $\overline{\widetilde{\textbf{\textit{H}}}}(N+1,N)$ , as shown in the proof of Theorem 3.2. On the other hand, it follows from Theorem 2.2 that $\mathcal{P}^+(N+1)$ is given by a singleton with the $(N+1)$ th row vector $\overline{\textbf{\textit{h}}}_{N+1}(N+1)$ of $\overline{\textbf{\textit{H}}}(N+1)$ . In other words, all Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(N+1), \{N+1\})$ have the same $\boldsymbol{\pi}(N)$ (i.e. the normalized vector of the first $N+1$ elements of $\overline{\textbf{\textit{h}}}_{N+1}(N+1)$ ), so that $\textrm{ri}\, \widetilde{\mathcal{P}}^+(N+1,N)$ is not tight. In the next section, we find the minimum convex polytope that contains $\boldsymbol{\pi}(N)$ , in the sense that it is tight in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ .

4. Minimum convex polytopes for $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$

In this section, we consider minimum convex polytopes that contain $\boldsymbol{\pi}(N)$ for Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ . The outline of our approach can be summarized as follows. For convenience, we partition $\textbf{\textit{H}}(K)=(\!-\textbf{\textit{Q}}^{(1,1)}(K))^{-1}$ and $\boldsymbol{\pi}(K)$ as follows:

(4.1)

\begin{align}\textbf{\textit{H}}(K) &=\kbordermatrix{&\mathbb{Z}_0^N &\quad \mathbb{Z}_{N+1}^K\\& \textbf{\textit{H}}^{(*,1)}(K;\, N) &\quad \textbf{\textit{H}}^{(*,2)}(K;\, N)},\end{align}

\begin{align}\hspace*{-16pt}\boldsymbol{\pi}(K) &=\kbordermatrix{&\mathbb{Z}_0^N &\quad \mathbb{Z}_{N+1}^K\\& \boldsymbol{\pi}^{(1)}(K;\, N) &\quad \boldsymbol{\pi}^{(2)}(K;\, N)}.\nonumber\end{align}

Since $\boldsymbol{\pi}(K) \in \textrm{ri}\, \mathcal{P}^+(K)$ by Theorem 2.2, $\boldsymbol{\pi}(N)=\boldsymbol{\pi}^{(1)}(K;\,N)/\boldsymbol{\pi}^{(1)}(K;\, N)\textbf{\textit{e}}$ is given by a convex combination of normalized row vectors of $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ corresponding to $\mathcal{J}(K)$ .

Note here that in all sample paths of the first passage from $\mathbb{Z}_{K+1}^{\infty}$ to $\mathbb{Z}_0^N$ , $\{X(t)\}_{t \in (-\infty, \infty)}$ must visit at least one state in $\mathcal{J}(K)$ . In view of this fact, we first introduce (K,N)-skip-free sets. Roughly speaking, a (K, N)-skip-free set $\mathcal{X}$ is a proper subset of $\mathbb{Z}_0^K$ such that in all sample paths of the first passage from $\mathbb{Z}_{K+1}^{\infty}$ to $\mathbb{Z}_{0}^{K}$ , $\{X(t)\}_{t \in (-\infty, \infty)}$ must visit at least one state in $\mathcal{X}$ . We then show in Theorem 4.1 that $\boldsymbol{\pi}(N)=\boldsymbol{\pi}^{(1)}(K;\,N)/\boldsymbol{\pi}^{(1)}(K;\, N)\textbf{\textit{e}}$ is given by a convex combination of normalized row vectors of $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ corresponding to members of $\mathcal{X}$ .

To find the minimum convex polytopes that contain $\boldsymbol{\pi}(N)$ , we introduce a partial order among (K, N)-skip-free sets, considering the first passage time from $\mathbb{Z}_{K+1}^{\infty}$ to them. We then show the inclusion relation among convex polytopes associated with (K, N)-skip-free sets in Lemma 4.1. Finally, in Theorem 4.2, we find the smallest (K, N)-skip-free set and the corresponding minimum convex polytope, which is shown to be tight in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ .

4.1. (K, N)-skip-free sets

For an arbitrary non-empty subset $\mathcal{B}$ of $\mathbb{Z}^+$ , we define $F(\mathcal{B})$ as the first passage time to $\mathcal{B}$ :

\begin{equation*}F(\mathcal{B}) = \inf\{t \geq 0;\ X(t) \in \mathcal{B}\},\qquad\mathcal{B} \subseteq \mathbb{Z}^+.\end{equation*}

Definition 4.1. ((K, N)-skip-free set.) For arbitrarily fixed $K, N \in \mathbb{Z}^+$ ( $K > N$ ), consider a Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\mathcal{J}(K))$ . We refer to a subset $\mathcal{X}$ of $\mathbb{Z}_0^K$ as a (K, N)-skip-free set if $\{X(t)\}_{t \in (\!-\infty,\infty)}$ starting from any state in $\mathbb{Z}_{K+1}^{\infty}$ must visit $\mathcal{X}$ by the first passage time to $\mathbb{Z}_0^N$ , i.e.,

\begin{equation*}\mathbb{P}(X(t) \in \mathcal{X} \mbox{ for some } t \in (0,F(\mathbb{Z}_0^N)]\mid X(0) \in \mathbb{Z}_{K+1}^{\infty}) = 1.\end{equation*}

In particular, a (K, N)-skip-free set $\mathcal{X}$ is called proper if for any proper subset $\mathcal{Y}$ of $\mathcal{X}$ ,

\begin{equation*}\mathbb{P}(X(t) \in \mathcal{Y} \mbox{ for some } t \in (0,F(\mathbb{Z}_0^N)]\mid X(0) \in \mathbb{Z}_{K+1}^{\infty}) < 1.\end{equation*}

Let $\mathcal{S}(K,N)$ denote the family of all (K, N)-skip-free sets and $\mathcal{S}^*(K,N)$ the family of all proper (K, N)-skip-free sets.

Remark 4.1. By definition,

(i) (K, N)-skip-free sets are determined only by $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ ,
(ii) $\mathcal{S}^*(K,N)\subseteq\mathcal{S}(K,N)$ ,
(iii) $\mathcal{J}(K) \in \mathcal{S}(K,N)$ and $\widetilde{\mathcal{J}}(K,N) \in \mathcal{S}^*(K,N)$ , where $\widetilde{\mathcal{J}}(K,N)$ is given by (3.19), and
(iv) $\mathcal{J}(K) \cap \mathbb{Z}_0^N \subseteq \mathcal{X}\ \mbox{for all}\ \mathcal{X} \in\mathcal{S}(K,N)$ .

We can provide an equivalent definition of (K, N)-skip-free sets by considering a directed graph associated with $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ . Specifically, for arbitrarily fixed $K, N \in \mathbb{Z}^+$ ( $K > N$ ), we define the directed graph

(4.2)

\begin{equation}G(K,N)=(V(K,N),E(K,N)),\end{equation}

where V(K, N) and E(K, N) denote sets of nodes and arcs defined as

\begin{align*}V(K,N) &= \{s,t\} \cup \mathbb{Z}_0^K,\\[3pt] E(K,N) &=\{(s,j);\ j \in \mathcal{J}(K)\} \cup \{(i,t);\ i \in \mathbb{Z}_0^N\}\\[3pt]&\quad {}\cup\{(i,j);\ i \in \mathbb{Z}_{N+1}^K,\ j \in \mathbb{Z}_0^K, q_{i,j}>0\}.\end{align*}

Node s represents the set of states in $\mathbb{Z}_{K+1}^{\infty}$ , and node t is a virtual terminal node for the first passage to $\mathbb{Z}_0^N$ . It is readily seen from Definition 4.1 that a subset $\mathcal{X}$ of $\mathbb{Z}_0^K$ is a (K, N)-skip-free set iff $\mathcal{X}$ is an (s,t)-node cut in G(K, N), and that $\mathcal{X}$ is proper iff it is a minimal (s, t)-node cut.

For a given G(K, N), nodes in $\mathbb{Z}_0^K$ can be classified into two classes, depending on whether they appear on the way of at least one path from node s to node t. That is,

\begin{equation*}\mathbb{Z}_0^K=\mathcal{N}_+(K,N) \cup \mathcal{N}_0(K,N),\qquad\mathcal{N}_+(K,N) \cap \mathcal{N}_0(K,N) = \emptyset,\end{equation*}

where nodes in $\mathcal{N}_+(K,N)$ are reachable from node s and each of them has at least one path to node t. Note here that $\mathcal{N}_0(K,N)= \mathbb{Z}_0^K \setminus \mathcal{N}_+(K,N)$ is given by

\begin{equation*}\mathcal{N}_0(K,N) = \mathcal{N}_0^{(1)}(K,N) \cup \mathcal{N}_0^{(2)}(K,N) \cup \mathcal{N}_0^{(3)}(K,N),\end{equation*}

where $\mathcal{N}_0^{(\ell)}(K,N)$ ( $\ell=1,2,3$ ) are defined as follows:

\begin{align*}\mathcal{N}_0^{(1)}(K,N)&=\mathbb{Z}_0^N \setminus \widetilde{\mathcal{J}}(K,N)=\{i \in \mathbb{Z}_0^N;\ \mathbb{P}(X(F(\mathbb{Z}_0^N)) = i \mid X(0) \in \mathbb{Z}_{K+1}^{\infty}) = 0\},\\[3pt] \mathcal{N}_0^{(2)}(K,N)&=\{i \in \mathbb{Z}_{N+1}^K;\ \\[3pt] &\qquad\qquad\mathbb{P}(X(t) \in \mathbb{Z}_{N+1}^{\infty} \mbox{ for all } t \in (0,F(\{i\})]\mid X(0) \in \mathbb{Z}_{K+1}^{\infty}) = 0\},\\[3pt] \mathcal{N}_0^{(3)}(K,N)&=\{i \in \mathbb{Z}_{N+1}^K;\ \mathbb{P}(X(t) \in \mathbb{Z}_{N+1}^K \mbox{ for all } t \in (0,F(\mathbb{Z}_0^N)]\mid X(0) = i) = 0\}.\end{align*}

Roughly speaking, states in $\mathcal{N}_0^{(1)}(K,N) \subseteq \mathbb{Z}_0^N$ and states in $\mathcal{N}_0^{(2)}(K,N) \subseteq \mathbb{Z}_{N+1}^K$ can be reached from any states in $\mathbb{Z}_{K+1}^{\infty}$ only after visiting $\widetilde{\mathcal{J}}(K,N) \subseteq \mathbb{Z}_0^N$ . On the other hand, states in $\mathcal{N}_0^{(3)}(K,N) \in \mathbb{Z}_{N+1}^K$ have no paths to $\widetilde{\mathcal{J}}(K,N)$ without visiting $\mathbb{Z}_{K+1}^{\infty}$ .

By definition, $\mathcal{X} \cap \mathcal{N}_0(K,N) = \emptyset$ if $\mathcal{X} \in\mathcal{S}^*(K,N)$ , whereas there may exist $\mathcal{X} \in \mathcal{S}(K,N)$ such that $\mathcal{X}\cap \mathcal{N}_0(K,N) \neq \emptyset$ . As we will see, all states in $\mathcal{N}_0(K,N)$ can be excluded from consideration when we study $\boldsymbol{\pi}(N)$ for Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\mathcal{J}(K))$ .

Before considering minimum convex polytopes for Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\mathcal{J}(K))$ , we provide a remark on (K, N)-skip-free sets. Note that (K, N)-skip-free sets can be regarded as a natural extension of levels with the skip-free-to-the-left property in Markov chains of M/G/1-type. Suppose a Markov chain $\{X(t)\}_{t\in(\!-\infty, \infty)}$ is of M/G/1-type; i.e., its state space $\mathbb{Z}^+$ is partitioned into disjoint levels $\{\mathcal{L}_m;\ m\in \mathbb{Z}^+\}$ , and transitions between two levels are skip-free to the left. We now consider the first passage time from $\mathcal{L}_{K+1}$ to $\mathcal{L}_N$ ( $K > N$ ). It then follows that each level $\mathcal{L}_m$ ( $m=N,N+1,\ldots,K$ ) can be regarded as a $(\hat{K},\hat{N})$ -skip-free set, where $\hat{K}$ and $\hat{N}$ denote the total number of states in $\bigcup_{m=0}^K \mathcal{L}_m$ and $\bigcup_{m=0}^N \mathcal{L}_m$ . One of the most notable differences between (K, N)-skip-free sets and levels with the skip-free-to-the-left property is that (K, N)-skip-free sets are not disjoint, while levels are disjoint.

Example 4.1. (Example 2.1 continued.) Consider $\textbf{\textit{Q}}$ in Example 2.1 of Section 2.3. $\mathcal{S}^*(K,N)$ is given by

\begin{equation*}\mathcal{S}^*(K,N) = \bigl\{ \{0,k\};\ k \in \mathbb{Z}_N^K \bigr\}.\end{equation*}

4.2. Minimum convex polytopes

We consider convex polytopes associated with (K, N)-skip-free sets. We partition $\textbf{\textit{H}}(K)=(\!-\textbf{\textit{Q}}^{(1,1)}(K))^{-1}$ into two matrices as in (4.1). Note here that

(4.3)

\begin{equation}[\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}}]_{i} = 0,\quad i \in \mathcal{N}_0^{(3)}(K,N),\end{equation}

and $[\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}}]_{i} > 0$ for all $i \in \mathbb{Z}_0^K\setminus \mathcal{N}_0^{(3)}(K,N)$ . For any $(K+1)$ -dimensional vector $\textbf{\textit{x}}$ , we define $\textrm{diag}^*(\textbf{\textit{x}})$ as a $(K+1)$ -dimensional diagonal matrix whose ith ( $i \in \mathbb{Z}_0^K$ ) diagonal element is given by

\begin{equation*}[\textrm{diag}^*(\textbf{\textit{x}})]_{i,i}=\begin{cases}\displaystyle\frac{1}{[\textbf{\textit{x}}]_i}, & [\textbf{\textit{x}}]_i \neq 0,\\[3pt] 0, & \textrm{otherwise}.\end{cases}\end{equation*}

We then define $\Gamma(K,N;\,\mathcal{B})$ ( $K,N \in \mathbb{Z}^+$ , $K > N$ , $\mathcal{B} \subseteq\mathbb{Z}_0^K$ ) as

\begin{align*}\Gamma(K,N;\,\mathcal{B})&=\big\{\boldsymbol{\alpha} \in \mathbb{R}^{N+1};\ \boldsymbol{\alpha}=\boldsymbol{\beta}\textbf{\textit{U}}(K,N),\boldsymbol{\beta} \in \Gamma(K),\\[3pt]& {}[\boldsymbol{\beta}]_i \geq 0\ (i \in \mathcal{B}),\ [\boldsymbol{\beta}]_i = 0\ (i \in \mathbb{Z}_0^K\setminus\mathcal{B})\},\end{align*}

where $\Gamma(n)$ ( $n \in \mathbb{Z}^+$ ) is defined in (2.5) and $\textbf{\textit{U}}(K,N)$ is given by

\begin{align*}\textbf{\textit{U}}(K,N)&=\textrm{diag}^*(\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}})\begin{pmatrix}\textbf{\textit{I}}\\ (\!-\textbf{\textit{Q}}^{(2,2)}(K,N))^{-1}\textbf{\textit{Q}}^{(2,1)}(K,N)\end{pmatrix}\\&\cdot(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}).\end{align*}

Moreover, for $\mathcal{X} \in \mathcal{S}(K,N)$ , we define $\mathcal{X}^*$ as

\begin{equation*}\mathcal{X}^* = \mathcal{X} \setminus \mathcal{N}_0(K,N).\end{equation*}

Note that

(4.4)

\begin{equation}\mathcal{X}^* \in \mathcal{S}(K,N)\ \mbox{ if }\ \mathcal{X} \in \mathcal{S}(K,N),\qquad\mathcal{X}^* = \mathcal{X}\ \mbox{ if }\ \mathcal{X} \in \mathcal{S}^*(K,N).\end{equation}

We then have the following theorem, whose proof is given in Appendix G.

Theorem 4.1. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K)) \neq \emptyset$ . For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\break \mathcal{J}(K))$ , we have for $\mathcal{X} \in\mathcal{S}(K,N)$ ( $K>N$ ) that

\begin{equation*}\boldsymbol{\pi}(N) \in \mathcal{P}^+(K,N;\,\mathcal{X}^*),\end{equation*}

where $\mathcal{P}^+(K,N;\, \mathcal{X}^*)$ denotes a convex polytope on the first orthant of $\mathbb{R}^{N+1}$ which is given by

(4.5)

\begin{align}\mathcal{P}^+(K,N;\, \mathcal{X}^{*})&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \big[(\textbf{\textit{x}}\ \textbf{\textit{y}})(\!-\textbf{\textit{Q}}^{(1,1)}(K))]_i \geq 0\ (i \in \mathcal{X}^*),\nonumber\\[3pt] &\quad\,\big[(\textbf{\textit{x}}\ \textbf{\textit{y}})(\!-\textbf{\textit{Q}}^{(1,1)}(K))]_i = 0\ (i \in \mathbb{Z}_0^K \setminus \mathcal{X}^{*}),\nonumber\\[3pt] &\quad \textbf{\textit{x}}\textbf{\textit{e}}=1,\ \textbf{\textit{y}}\geq\textbf{0}\big\}\end{align}

(4.6)

\begin{align}\hspace*{40pt}&=\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} = \boldsymbol{\alpha}\overline{\textbf{\textit{H}}}(N),\boldsymbol{\alpha} \in \Gamma(K,N;\, \mathcal{X}^*)\}.\end{align}

In particular,

\begin{equation*}\boldsymbol{\pi}(N) \in \textrm{ri}\, \mathcal{P}^+(K,N;\,\mathcal{X}^*) \ \mbox{ if }\ \mathcal{X}^* \in \mathcal{S}^*(K,N).\end{equation*}

Remark 4.2. Consider two (K, N)-skip-free sets $\mathcal{X}_A, \mathcal{X}_B \in \mathcal{S}(K,N)$ . If $\mathcal{X}_A \subseteq \mathcal{X}_B$ , we have $\mathcal{X}_A^* \subseteq \mathcal{X}_B^*$ by definition. Therefore, it follows from (4.5) that $\mathcal{P}^+(K,N;\, \mathcal{X}_A^*) \subseteq \mathcal{P}^+(K,N;\, \mathcal{X}_B^*)$ if $\mathcal{X}_A\subseteq \mathcal{X}_B$ .

Lemma 4.1. For Markov chains in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\mathcal{J}(K))$ with $\mathcal{X}_A,\mathcal{X}_B \in \mathcal{S}(K,N)$ , we consider the first passage times $F(\mathcal{X}_A^*)$ , $F(\mathcal{X}_B^*)$ , and $F(\mathbb{Z}_0^N)$ . If the statements

\begin{equation*}F(\mathcal{X}_A^*) \leq F(\mathcal{X}_B^*)\ \mbox{ given that }\ X(0) \in \mathbb{Z}_{K+1}^{\infty}\end{equation*}

and

(4.7)

\begin{equation}F(\mathcal{X}_B^*) \leq F(\mathbb{Z}_0^N)\ \mbox{ given that }\ X(0) \in \mathcal{X}_A^*,\end{equation}

hold sample-path-wise, we have

\begin{equation*}\mathcal{P}(K,N;\, \mathcal{X}_A^*) \subseteq \mathcal{P}(K,N;\, \mathcal{X}_B^*).\end{equation*}

The proof of Lemma 4.1 is given in Appendix H. By definition, $\widetilde{\mathcal{J}}(K,N) \in \mathcal{S}^*(K,N)$ , and $\mathcal{P}^+(K,N;\,\widetilde{\mathcal{J}}(K,N))$ is identical to $\widetilde{\mathcal{P}}^+(K,N)$ in (3.24). Recall that $\widetilde{\mathcal{J}}(K,N) \subseteq \mathbb{Z}_0^N$ . Therefore, for any $\mathcal{X}\in \mathcal{S}(K,N)$ , $F(\mathcal{X}) \leq F(\widetilde{\mathcal{J}}(K,N))$ holds sample-path-wise, given $X(0) \in \mathbb{Z}_{K+1}^{\infty}$ . On the other hand, $\mathcal{J}(K) \in\mathcal{S}(K,N)$ , and each state in $\mathcal{J}(K)$ is reachable from some states in $\mathbb{Z}_{K+1}^{\infty}$ by direct transitions. Note here that

(4.8)

\begin{equation}\mathcal{J}^*(K)=\mathcal{J}(K) \setminus \mathcal{N}_0(K,N)=\mathcal{J}(K) \setminus \mathcal{N}_0^{(3)}(K,N).\end{equation}

We thus have $F(\mathcal{J}^*(K)) \leq F(\mathcal{X}^*)$ sample-path-wise, given $X(0) \in \mathbb{Z}_{K+1}^{\infty}$ . The corollary below follows from Theorem 4.1 and the above observations.

Corollary 4.1. Among (K, N)-skip-free sets, $\widetilde{\mathcal{J}}(K,N)$ in (3.19) gives a maximum convex polytope, i.e.,

\begin{equation*}\mathcal{P}^+(K,N;\, \mathcal{X}^*) \subseteq \mathcal{P}^+(K,N;\,\widetilde{\mathcal{J}}(K,N)),\quad\mathcal{X} \in \mathcal{S}(K,N).\end{equation*}

On the other hand, among (K, N)-skip-free sets, $\mathcal{J}^*(K)$ gives a minimum convex polytope, i.e.,

\begin{equation*}\mathcal{P}^+(K,N;\,\mathcal{J}^*(K)) \subseteq \mathcal{P}^+(K,N;\, \mathcal{X}^*),\quad\mathcal{X} \in \mathcal{S}(K,N).\end{equation*}

Note here that $\mathcal{J}^*(K)$ is not proper in general. We thus consider a proper (K, N)-skip-free set that gives the minimum convex polytope $\mathcal{P}^+(K,N;\,\mathcal{J}^*(K))$ . Specifically, we introduce a subset $\mathcal{D}(K,N)$ of $\mathcal{J}(K)$ , which is generated by the following procedure.

Procedure 4.1. ([Reference Nagamochi9].)

(i) We obtain a directed graph $G^*(K,N)$ from G(K, N) in (4.2) by removing all incoming edges to nodes in $\mathcal{J}(K)$ .
(ii) We then find the subset $\mathcal{D}(K,N)$ of nodes in $\mathcal{J}(K)$ from which node t is reachable in $G^*(K,N)$ .

Note that Step (ii) can be performed with O(K) operations, by considering reverse paths from node t to node s in the graph $G^*(K,N)$ .

Lemma 4.2. ([Reference Nagamochi9].) Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K)) \neq \emptyset$ . For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\mathcal{J}(K))$ ( $K>N$ ), we have $\mathcal{D}(K,N) \in \mathcal{S}^*(K,N)$ . Furthermore, $\mathcal{D}(K,N) \subseteq \mathcal{X}$ for all $\mathcal{X} \subseteq \mathcal{J}(K)$ such that $\mathcal{X} \in \mathcal{S}(K,N)$ .

The proof of Lemma 4.2 is given in Appendix I.

Lemma 4.3. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K)) \neq \emptyset$ . For any Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K),\break \mathcal{J}(K))$ ( $K>N$ ), we have

\begin{equation*}\mathcal{P}^+(K,N;\,\mathcal{D}(K,N)) = \mathcal{P}^+(K,N;\,\mathcal{J}^*(K)),\quad K,N\in\mathbb{Z}^+,\ K>N,\end{equation*}

where $\mathcal{J}^*(K)$ is given by (4.8).

Proof. Since $\mathcal{D}(K,N) \in \mathcal{S}^*(K,N)$ , we have $\mathcal{D}^*(K,N) = \mathcal{D}(K,N)$ from (4.4). It then follows from Remark 4.2 and $\mathcal{D}(K,N) \subseteq \mathcal{J}(K)$ that

\begin{equation*}\mathcal{P}^+(K,N;\, \mathcal{D}(K,N)) = \mathcal{P}^+(K,N;\, \mathcal{D}^*(K,N)) \subseteq\mathcal{P}^+(K,N;\, \mathcal{J}^*(K)).\end{equation*}

On the other hand, it follows from Lemma 4.2 that $F(\mathcal{J}^*(K)) \leq F(\mathcal{D}(K,N))$ if $X(0) \in\mathbb{Z}_{K+1}^{\infty}$ and $F(\mathcal{D}(K,N)) \leq F(\mathbb{Z}_0^N)$ if $X(0) \in\mathcal{J}^*(K)$ . We thus have $\mathcal{P}^+(K,N;\, \mathcal{J}^*(K)) \subseteq \mathcal{P}^+(K,N;\,\mathcal{D}(K,N))$ from Lemma 4.1, which completes the proof. □

The following theorem shows that the minimum convex polytope $\mathcal{P}^+(K,N;\, \mathcal{D}(K,N))$ is tight in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ .

Theorem 4.2. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K)) \neq \emptyset$ . For any $\textbf{\textit{x}} \in \textrm{ri}\, \mathcal{P}^+(K,N;\,\mathcal{D}(K,N))$ , there exists a Markov chain in $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ whose $\boldsymbol{\pi}(N)$ is given by $\textbf{\textit{x}}$ .

The proof of Theorem 4.2 is given in Appendix J. As shown in the proof, if $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K)) \neq \emptyset$ and $\textbf{\textit{x}} \in\textrm{ri}\, \mathcal{P}^+(K,N;\,\mathcal{D}(K,N))$ , we have $\textbf{\textit{x}} > \textbf{0}$ . The following corollary comes from the fact that $\mathcal{P}^+(K,N;\ \mathcal{D}(K,N))$ is a convex polytope spanned by $\overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N)$ for $i \in\mathcal{D}(K,N)$ (cf. (H.1)), where $\overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N)$ ( $i \in \mathbb{Z}_0^K$ ) denotes the ith normalized row vector of $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ in (4.1).

Corollary 4.2. Suppose $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K)) \neq \emptyset$ . For an arbitrary probability vector $\textbf{\textit{x}} \in \Gamma(N)$ , we have

\begin{equation*}\| \textbf{\textit{x}} - \boldsymbol{\pi}(N) \|_1\leq\max_{i \in \mathcal{D}(K,N)}\| \textbf{\textit{x}} - \overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N) \|_1.\end{equation*}

Furthermore, if $\textbf{\textit{x}} \in \mathcal{P}^+(K,N;\, \mathcal{D}(K,N))$ , we have

\begin{align*}\max_{i \in \mathcal{D}(K,N)}\| \textbf{\textit{x}} - \overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N) \|_1&\leq\max_{i,j \in \mathcal{D}(K,N)}\| \overline{\textbf{\textit{h}}}_j^{(*,1)}(K;\, N)-\overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N) \|_1,\\& \qquad\qquad\qquad\qquad\qquad\qquad\textbf{\textit{x}} \in \mathcal{P}^+(K,N;\, \mathcal{D}(K,N)).\end{align*}

Example 4.2. (Example 2.1 continued.) Consider $\textbf{\textit{Q}}$ in Example 2.1 of Section 2.3. Note that $\mathcal{J}(N)=\{0,N\}$ and $\mathcal{J}(K)= \mathcal{J}^*(K) = \mathcal{D}(K,N) = \{0,K\}$ . We set $q_{i,j} = \lambda_i b_{j-i}$ ( $i \in \mathbb{Z}^+, j > i$ ), $q_{i,i-1} = \mu_i$ ( $i \in \mathbb{Z}^+ \setminus\{0\}$ ), and $q_{i,0} = \gamma_i$ ( $i \in \mathbb{Z}^+ \setminus \{0,1\}$ ), where

\begin{gather*}\lambda_i = 2 - \frac{1}{i+1}\quad (i \in \mathbb{Z}^+),\qquad \hspace*{-12pt}b_i = \frac{1}{2^i}\quad (i \in \mathbb{Z}^+ \setminus \{0\}),\\ \mu_i = \frac{i}{10}\quad (i \in \mathbb{Z}^+ \setminus \{0\}),\qquad\gamma_i = \frac{\lambda_i}{i}\quad (i \in \mathbb{Z}^+ \setminus \{0,1\}).\end{gather*}

Table 1 shows the worst-case error bound

\begin{equation*}\max_{i,j \in \mathcal{D}(K,N)} \| \overline{\textbf{\textit{h}}}_j^{(*,1)}(K;\, N) -\overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N) \|_1 = \|\overline{\textbf{\textit{h}}}_0^{(*,1)}(K;\, N) - \overline{\textbf{\textit{h}}}_K^{(*,1)}(K;\,N) \|_1\end{equation*}

of Corollary 4.2, where $N=50$ , and the result for $K=N$ indicates the worst-case error bound

\begin{equation*}\max_{i,j \in \mathcal{J}(N)} \| \overline{\textbf{\textit{h}}}_j(N) - \overline{\textbf{\textit{h}}}_i(N) \|_1 =\|\overline{\textbf{\textit{h}}}_0(N) - \overline{\textbf{\textit{h}}}_N(N) \|_1\end{equation*}

of Corollary 2.3. We observe that the worst-case error bound steadily decreases as K increases and that if $K=150$ , any probability vector $\textbf{\textit{x}} \in \textrm{ri}\, \mathcal{P}^+(K,N;\, \mathcal{D}(K,N))$ is a good approximation to $\boldsymbol{\pi}(N)$ in this specific case.

Table 1. The worst-case error bounds in Example 4.2 ( $N=50$ ).

5. Concluding remarks

This paper studied the conditional stationary distribution $\boldsymbol{\pi}(N)$ in ergodic, continuous-time Markov chains on $\mathbb{Z}^+$ via systems of linear inequalities. Specifically, we first obtained an N-simplex $\mathcal{P}(N)$ containing $\boldsymbol{\pi}(N)$ , assuming only the knowledge of the $(N+1) \times (N+1)$ northwest corner block $\textbf{\textit{Q}}^{(1,1)}(N)$ of the infinitesimal generator. We then refined this result and eliminated unnecessary vertices from $\mathcal{P}(N)$ , using the structural information $\mathcal{J}(N)$ . These results are closely related to the augmented truncation approximation (ATA), and we provided some practical implications for it. In particular, we observed that the linear ATA is sufficient for computing an ATA solution, and the structural information $\mathcal{J}(N)$ , if available, is useful for obtaining a reasonable ATA solution.

Next, we extended our results to the cases where the $(K+1) \times (K+1)$ ( $K > N$ ) northwest corner block (and $\mathcal{J}(K)$ ) are available. Furthermore, we introduced new state transition structures called (K, N)-skip-free sets, and we obtained a tight convex polytope that contains $\boldsymbol{\pi}(N)$ , under the condition that only $\textbf{\textit{Q}}^{(1,1)}(K)$ and $\mathcal{J}(K)$ are available.

Note that the results in this paper are also applicable to a time-homogeneous, discrete-time Markov chain on $\mathbb{Z}^+$ . Specifically, in a discrete-time Markov chain with a state transition probability matrix $\textbf{\textit{P}}_D$ , the stationary distribution $\boldsymbol{\pi}_D$ , if it exists, satisfies $\boldsymbol{\pi}_D = \boldsymbol{\pi}_D \textbf{\textit{P}}_D$ and $\boldsymbol{\pi}_D \textbf{\textit{e}}=1$ . Therefore, we can study $\boldsymbol{\pi}_D$ by considering the corresponding continuous-time Markov chain with the infinitesimal generator $\textbf{\textit{Q}}=\textbf{\textit{P}}_D-\textbf{\textit{I}}$ .

In this paper, the state space of a Markov chain $\{X(t)\}_{t \in (\!-\infty,\infty)}$ was assumed to be countably infinite. Except for Corollary 3.3, however, all the results in this paper also hold for Markov chains with finite state space $\mathbb{Z}_0^{K^*}$ for some $K^* \in \mathbb{Z}^+$ such that $N < K < K^*$ , if we replace $\mathbb{Z}_N^{\infty}$ and $\mathbb{Z}_K^{\infty}$ by $\mathbb{Z}_N^{K^*}$ and $\mathbb{Z}_K^{K^*}$ .

We scarcely ever discussed one-point approximations to the conditional stationary distribution $\boldsymbol{\pi}(N)$ , except in Section 2.4. In view of the error bounds in Corollaries 2.3 and 4.2, it seems to be reasonable to choose the center of a convex polytope that contains $\boldsymbol{\pi}(N)$ as a one-point approximation to $\boldsymbol{\pi}(N)$ . Note, however, that we cannot identify the exact location of $\boldsymbol{\pi}(N)$ within the convex polytopes unless further information is available. Therefore, for a specific problem, the choice of the center may or may not work well. The development of new approximation algorithms for the (un)conditional stationary distributions of Markov chains remains as future work.

Appendix A. Proof of Corollary 2.2

Substituting (2.15) and (2.16) into (2.12) yields (2.17). We thus consider (2.15) and (2.16) below. Because

\begin{equation*}\textbf{\textit{H}}(N) = (\!-\textbf{\textit{Q}}^{(1,1)}(N))^{-1} = \int_0^{\infty} \exp(\textbf{\textit{Q}}^{(1,1)}(N) t) \textrm{d} t,\end{equation*}

we have

\begin{equation*}[\textbf{\textit{H}}(N)]_{i,j}=\mathbb{E}\Bigl[ \int_{T_n^{(1)}}^{T_n^{(2)}} \mathbb{I}(X(t)=j) \textrm{d} t\mid X(T_n^{(1)}) = i \Bigr],\quad i,j \in \mathbb{Z}_0^N,\end{equation*}

and therefore

\begin{equation*}[\overline{\textbf{\textit{H}}}(N)]_{i,j}=\displaystyle\frac{1}{\mathbb{E}\bigl[ T_n^{(2)} - T_n^{(1)} \mid X(T_n^{(1)}) = i \bigr]}\mathbb{E}\Bigl[ \int_{T_n^{(1)}}^{T_n^{(2)}} \mathbb{I}(X(t)=j) \textrm{d} t\mid X(T_n^{(1)}) = i \Bigr],\end{equation*}

from which (2.15) follows.

We also have $\boldsymbol{\pi}^{(1)}(N)=\boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}^{(2,1)}(N)\textbf{\textit{H}}(N)$ from (2.4) and (2.9), so that

\begin{equation*}\boldsymbol{\pi}^{(1)}(N)\textbf{\textit{e}}=\boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}^{(2,1)}(N)\textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})\textbf{\textit{e}}.\end{equation*}

It then follows from (2.10) and (2.12) that

\begin{equation*}\boldsymbol{\alpha}^*(N)=\frac{\boldsymbol{\pi}^{(2)}(N)}{\boldsymbol{\pi}^{(1)}(N)\textbf{\textit{e}}}\cdot\textbf{\textit{Q}}^{(2,1)}(N) \textrm{diag} (\textbf{\textit{H}}(N) \textbf{\textit{e}})=\frac{\boldsymbol{\zeta}^*(N) \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})}{\boldsymbol{\zeta}^*(N) \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}) \textbf{\textit{e}}},\end{equation*}

where

(A.1)

\begin{equation}\boldsymbol{\zeta}^*(N)=\frac{\boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}^{(2,1)}(N)}{\boldsymbol{\pi}^{(2)}(N) \textbf{\textit{Q}}^{(2,1)}(N)\textbf{\textit{e}}}.\end{equation}

It is readily seen that

\begin{equation*}[\boldsymbol{\zeta}^*(N)]_i = \mathbb{P}(X(T_n^{(1)}) = i),\quad i \in \mathbb{Z}_0^N.\end{equation*}

We thus have

\begin{equation*}[\boldsymbol{\alpha}^*(N)]_i=\displaystyle\frac{1}{\mathbb{E}\bigl[ T_n^{(2)} - T_n^{(1)} \bigr]}\mathbb{E}\Bigl[ \displaystyle\int_{T_n^{(1)}}^{T_n^{(2)}} \mathbb{I}( I(t) = i) \textrm{d} t \Bigr],\quad i \in \mathbb{Z}_0^N,\end{equation*}

from which (2.16) follows.

Appendix B. Equivalence between (2.22) and (2.23) for $\mathcal{J}(N) \neq \mathbb{Z}_0^N$

Suppose $\mathcal{J}(N) \neq \mathbb{Z}_0^N$ . We partition $\overline{\textbf{\textit{H}}}(N)$ into two matrices:

\begin{equation*}\overline{\textbf{\textit{H}}}(N)=\kbordermatrix{ & \mathbb{Z}_0^N\\ \mathcal{J}(N) & \overline{\textbf{\textit{H}}}_+(N)\\ \mathbb{Z}_0^N \setminus \mathcal{J}(N)\quad & \overline{\textbf{\textit{H}}}_0(N)}.\end{equation*}

Note that (2.23) for $\mathcal{J}(N) \neq \mathbb{Z}_0^N$ is equivalent to

\begin{equation*}\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}= \boldsymbol{\alpha} \overline{\textbf{\textit{H}}}_+(N),\boldsymbol{\alpha} \geq \textbf{0},\ \boldsymbol{\alpha}\textbf{\textit{e}} =1\}.\end{equation*}

We now rewrite (2.22) as

\begin{align*}\mathcal{P}^+(N)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}(\!-\textbf{\textit{Q}}_+^{(1,1)}(N)) \geq \textbf{0},\ \textbf{\textit{x}}(\!-\textbf{\textit{Q}}_0^{(1,1)}(N)) = \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} = 1\big\}\\[2pt]&=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} (\!-\textbf{\textit{Q}}_+^{(1,1)}(N)) = \textbf{\textit{y}},\ \textbf{\textit{x}} (\!-\textbf{\textit{Q}}_0^{(1,1)}(N)) = \textbf{0},\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\bigr\}\\[2pt] &=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} (\!- \textbf{\textit{Q}}_+^{(1,1)}(N)\ \ -\textbf{\textit{Q}}_0^{(1,1)}(N)) = (\textbf{\textit{y}}\ \ \textbf{0}),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\bigr\}\\[2pt] &=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}} (\!-\textbf{\textit{Q}}^{(1,1)}(N)) = (\textbf{\textit{y}}\ \ \textbf{0}),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\bigr\}\\[2pt] &=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}= (\textbf{\textit{y}}\ \ \textbf{0}) \textbf{\textit{H}}(N),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\bigr\}\\[2pt] &=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}= (\textbf{\textit{y}}\ \ \textbf{0}) \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}) \overline{\textbf{\textit{H}}}(N),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\bigr\}\\[2pt] &=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}= (\textbf{\textit{y}}\textrm{diag}(\textbf{\textit{H}}_+(N)\textbf{\textit{e}})\ \ \textbf{0}) \overline{\textbf{\textit{H}}}(N),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\bigr\}\\[2pt] &=\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}= \textbf{\textit{y}}\textrm{diag}(\textbf{\textit{H}}_+(N)\textbf{\textit{e}}) \overline{\textbf{\textit{H}}}_+(N),\ \textbf{\textit{y}} \geq \textbf{0},\ \textbf{\textit{x}}\textbf{\textit{e}} =1\bigr\}\\[2pt] & =\bigl\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}= \boldsymbol{\alpha} \overline{\textbf{\textit{H}}}_+(N),\boldsymbol{\alpha} \geq \textbf{0},\ \boldsymbol{\alpha}\textbf{\textit{e}} =1\bigr\},\end{align*}

where the last equality follows from the same reasoning as in (2.11).

Appendix C. Proof of Lemma 2.2

Since $\mathcal{A}_{\textrm{L}}(N)\subseteq\mathcal{A}(N)$ , we have $\mathcal{T}_{\textrm{L}}(N) \subseteq\mathcal{T}(N)$ . We thus show $\mathcal{T}(N) \subseteq \mathcal{P}(N)$ and $\mathcal{P}(N) \subseteq\mathcal{T}_{\textrm{L}}(N)$ , from which the lemma follows.

We first prove $\mathcal{T}(N) \subseteq \mathcal{P}(N)$ . Suppose $\textbf{\textit{x}}\in\mathcal{T}(N)$ ; i.e., $\textbf{\textit{x}}[\textbf{\textit{Q}}^{(1,1)}(N)+ \textbf{\textit{Q}}_{\textrm{A}}(N)] =\textbf{0}$ for some $\textbf{\textit{Q}}_{\textrm{A}}(N) \in \mathcal{A}(N)$ . Post-multiplying both sides of this equation by $\textbf{\textit{H}}(N)=(\!-\textbf{\textit{Q}}^{(1,1)}(N))^{-1}$ and rearranging terms, we obtain

\begin{equation*}\textbf{\textit{x}}=\textbf{\textit{x}} \textbf{\textit{Q}}_{\textrm{A}}(N) \textbf{\textit{H}}(N)=\boldsymbol{\alpha}\overline{\textbf{\textit{H}}}(N),\end{equation*}

where $\boldsymbol{\alpha} = \textbf{\textit{x}} \textbf{\textit{Q}}_{\textrm{A}}(N) \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})$ . Note here that $\boldsymbol{\alpha} \in \Gamma(N)$ because $\boldsymbol{\alpha} \geq\textbf{0}$ , $\textbf{\textit{x}} \textbf{\textit{e}} = 1$ , and $\textbf{\textit{x}} \textbf{\textit{e}} = \boldsymbol{\alpha}\overline{\textbf{\textit{H}}}(N) \textbf{\textit{e}} = \boldsymbol{\alpha} \textbf{\textit{e}}$ . We thus obtain $\textbf{\textit{x}} \in \mathcal{P}(N)$ from (2.8), so that $\mathcal{T}(N)\subseteq \mathcal{P}(N)$ .

Next, we prove $\mathcal{P}(N) \subseteq \mathcal{T}_{\textrm{L}}(N)$ . For an arbitrarily fixed $\textbf{\textit{x}} \in \mathcal{P}(N)$ , let $\textbf{\textit{y}} = \textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N)) \geq \textbf{0}$ (cf. (2.7)). Since $\textbf{\textit{x}}\textbf{\textit{e}} = 1$ and $\textbf{\textit{Q}}^{(1,1)}(N)$ is nonsingular, we have $\textbf{\textit{y}} \neq \textbf{0}$ . We then consider a linear augmentation matrix $\textbf{\textit{Q}}_{\textrm{A}}^{\dag}(N)$ given by

\begin{equation*}\textbf{\textit{Q}}_{\textrm{A}}^{\dag}(N)=(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \cdot \frac{\textbf{\textit{y}}}{\textbf{\textit{y}}\textbf{\textit{e}}}=(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}\cdot\frac{\textbf{\textit{y}}}{\textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}.\end{equation*}

It is clear that $\textbf{\textit{Q}}_{\textrm{A}}^{\dag}(N) \in \mathcal{A}_{\textrm{L}}(N)$ and

\begin{equation*}\textbf{\textit{x}} [ \textbf{\textit{Q}}^{(1,1)}(N) + \textbf{\textit{Q}}_{\textrm{A}}^{\dag}(N) ]=- \textbf{\textit{y}} + \textbf{\textit{x}} \cdot\frac{(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}{\textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}\cdot \textbf{\textit{y}} = \textbf{0},\end{equation*}

which implies $\textbf{\textit{x}} \in \mathcal{T}_{\textrm{L}}(N)$ , so that $\mathcal{P}(N) \subseteq \mathcal{T}_{\textrm{L}}(N)$ .

Appendix D. Proof of Lemma 2.3

If $\mathcal{J}(N) = \mathbb{Z}_0^N$ , Lemma 2.3 immediately follows from Lemma 2.2. We thus assume $\mathcal{J}(N)\neq \mathbb{Z}_0^N$ . Implication 2.2 with $\boldsymbol{\zeta} \in \Gamma^+(N)$ implies $\mathcal{T}_{\textrm{L}}^+(N) \subseteq\mathcal{P}^+(N)$ . We thus show $\mathcal{P}^+(N) \subseteq \mathcal{T}_{\textrm{L}}^+(N)$ below. For an arbitrarily fixed $\textbf{\textit{x}} \in \mathcal{P}^+(N)$ , let $\textbf{\textit{y}} = \textbf{\textit{x}}(\!-\textbf{\textit{Q}}^{(1,1)}(N))$ . Since $\textbf{\textit{x}}\textbf{\textit{e}}=1$ and $\textbf{\textit{Q}}^{(1,1)}(N)$ is nonsingular, $\textbf{\textit{y}} \neq \textbf{0}$ . Furthermore, by definition, $\textbf{\textit{x}}(\!-\textbf{\textit{Q}}_+^{(1,1)}(N)) \geq \textbf{0}$ and $\textbf{\textit{x}}(\!-\textbf{\textit{Q}}_0^{(1,1)}(N)) = \textbf{0}$ , where $\textbf{\textit{Q}}_+^{(1,1)}(N)$ and $\textbf{\textit{Q}}_0^{(1,1)}(N)$ are given in (2.18). We then partition $\textbf{\textit{y}}$ into two parts:

\begin{equation*}\textbf{\textit{y}} = \kbordermatrix{&\mathcal{J}(N) &\quad \mathbb{Z}_0^N \setminus \mathcal{J}(N)\\& \textbf{\textit{y}}_+ &\quad \textbf{0}},\end{equation*}

where $\textbf{\textit{y}}+ \geq \textbf{0}$ and $\textbf{\textit{y}}_+ \textbf{\textit{e}} > 0$ . We now consider a linear augmentation matrix $\textbf{\textit{Q}}_{\textrm{A}}^{\dag}(N)$ given by

\begin{equation*}\textbf{\textit{Q}}_{\textrm{A}}^{\dag}(N)=(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}} \cdot \frac{\textbf{\textit{y}}}{\textbf{\textit{y}}\textbf{\textit{e}}}=(\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}\cdot\frac{(\textbf{\textit{y}}_+\ \ \textbf{0})}{\textbf{\textit{x}} (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}.\end{equation*}

It is clear that

\begin{equation*}\frac{(\textbf{\textit{y}}_+\ \ \textbf{0})}{\textbf{\textit{x}} (\!-\textbf{\textit{Q}}^{(1,1)}(N))\textbf{\textit{e}}}\in \Gamma^+(N),\end{equation*}

and

which implies $\textbf{\textit{x}} \in \mathcal{T}_{\textrm{L}}^+(N)$ , so that $\mathcal{P}^+(N) \subseteq \mathcal{T}_{\textrm{L}}^+(N)$ .

Appendix E. Proof of Corollary 3.1

We first prove (3.16) by showing that $\boldsymbol{\beta} \in \Gamma(N) \Rightarrow \boldsymbol{\alpha}\in \Gamma(N)$ in (3.14), i.e., that

\begin{equation*}\boldsymbol{\beta} \geq \textbf{0},\ \boldsymbol{\beta}\textbf{\textit{e}} = 1\\ \Rightarrow\ \boldsymbol{\alpha} \geq \textbf{0},\ \boldsymbol{\alpha}\textbf{\textit{e}} = \boldsymbol{\beta}\textbf{\textit{e}}\ (= 1).\end{equation*}

For this purpose, we consider (3.5). By definition, we have for $i,j \in \mathbb{Z}_0^N$ ,

\begin{equation*}[\textbf{\textit{R}}(K,N)]_{i,j}=\mathbb{P}(X(T_{n+1}^{(1)}) = j,\ X(t) \in \mathbb{Z}_{N+1}^K\, (T_n^{(2)} \leq t < T_{n+1}^{(1)})\mid X(T_n^{(1)}) = i),\end{equation*}

and therefore $\textbf{\textit{R}}(K,N)$ is substochastic; i.e.,

\begin{equation*}\textbf{\textit{R}}(K,N) \geq \textbf{\textit{O}},\qquad\textbf{\textit{R}}(K,N)\textbf{\textit{e}} \leq \textbf{\textit{e}},\qquad\textbf{\textit{R}}(K,N) \textbf{\textit{e}} \neq \textbf{\textit{e}}.\end{equation*}

It follows that

\begin{equation}(\textbf{\textit{I}}-\textbf{\textit{R}}(K,N))^{-1}=\sum_{n=0}^{\infty}\big(\textbf{\textit{R}}(K,N)\big)^{n}\geq \textbf{\textit{O}}.\nonumber\end{equation}

Recall that $\textbf{\textit{H}}(N) \geq \textbf{\textit{O}}$ and $\widetilde{\textbf{\textit{H}}}(K,N) \geq\textbf{\textit{O}}$ . We thus conclude $\boldsymbol{\beta} \geq\textbf{0}\ \Rightarrow\ \boldsymbol{\alpha} \geq \textbf{0}$ from (3.14). Furthermore, we have

\begin{align*}\boldsymbol{\alpha}\textbf{\textit{e}}&=\boldsymbol{\beta} \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textbf{\textit{H}}(N)\textbf{\textit{e}}\\&=\boldsymbol{\beta} \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})\widetilde{\textbf{\textit{H}}}(K,N) \textbf{\textit{e}}\\&=\boldsymbol{\beta}\textbf{\textit{e}}.\end{align*}

Next we show (3.17). For this purpose, we consider the censored process on $\mathbb{Z}_0^N \cup \mathbb{Z}_{K_1+1}^{\infty}$ whose generator is given by $\widetilde{\textbf{\textit{Q}}}(K_1,N)$ . We then apply Theorem 2.1 with $N = K = K_1$ and obtain $\widetilde{\Gamma}(K_1,N)$ . Moreover, we apply Theorem 3.1 to the above censored process on $\mathbb{Z}_0^N \cup\mathbb{Z}_{K_1+1}^{\infty}$ with $K=K_2$ and obtain $\widetilde{\Gamma}(K_2,N)$ . Now (3.17) follows from (3.16), with $\widetilde{\Gamma}(K_1,N)$ and $\widetilde{\Gamma}(K_2,N)$ corresponding to $\Gamma(N)$ and $\widetilde{\Gamma}(K,N)$ in (3.16).

Appendix F. Proof of Corollary 3.2

We first consider (3.29). If $\mathcal{J}(N) = \mathbb{Z}_0^N$ , (3.29) is immediate because

\begin{equation*}\widetilde{\Gamma}^+(K,N)\subseteq\widetilde{\Gamma}(K,N)\subseteq\Gamma(N)=\Gamma^+(N).\end{equation*}

We thus assume $\mathcal{J}(N)\neq\mathbb{Z}_0^N$ . Noting (3.20), we partition $\mathbb{Z}_0^N$ into the three subsets $\widetilde{\mathcal{J}}(K,N)$ , $\mathcal{J}(N)\setminus\widetilde{\mathcal{J}}(K,N)$ , and $\mathbb{Z}_0^N\setminus\mathcal{J}(N)$ :

(F.1)

\begin{equation}\textbf{\textit{Q}}^{(2,1)}(K,N)=\bordermatrix{& {\scriptstyle \widetilde{\mathcal{J}}(K,N)}& {\scriptstyle \mathcal{J}(N) \setminus \widetilde{\mathcal{J}}(K,N)}& {\scriptstyle \mathbb{Z}_0^N\setminus\mathcal{J}(N)} \cr& \textbf{\textit{Q}}_+^{(2,1)}(K,N) & \textbf{\textit{Q}}_{++}^{(2,1)}(K,N)& \textbf{\textit{O}}}.\end{equation}

Note here that $\mathcal{J}(N)\setminus\widetilde{\mathcal{J}}(K,N)$ may be empty; if this is the case, we simply ignore the corresponding terms. It follows from (3.5) that

(F.2)

\begin{align}(\textbf{\textit{I}}-\textbf{\textit{R}}(K,N))^{-1}= \textbf{\textit{I}} + (\textbf{\textit{I}}-\textbf{\textit{R}}(K,N))^{-1}\textbf{\textit{R}}(K,N)= \textbf{\textit{I}} + \textbf{\textit{B}} \textbf{\textit{Q}}^{(2,1)}(K,N),\end{align}

where

\begin{equation*}\textbf{\textit{B}}=(\textbf{\textit{I}}-\textbf{\textit{R}}(K,N))^{-1}(\!-\textbf{\textit{Q}}^{(1,1)}(K,N))^{-1}\textbf{\textit{Q}}^{(1,2)}(K,N)(\!-\textbf{\textit{Q}}^{(2,2)}(K,N))^{-1} \geq \textbf{\textit{O}}.\end{equation*}

Suppose $\boldsymbol{\alpha} \in \widetilde{\Gamma}^+(K,N)$ , i.e., there exists a nonnegative vector $\boldsymbol{\beta}$ such that

\begin{equation*}\boldsymbol{\beta} =\begin{array}[b]{r@{}c@{\quad}c@{\quad}c@{}l@{}}& {\scriptstyle \widetilde{\mathcal{J}}(K,N)}& {\scriptstyle \mathcal{J}(N) \setminus \widetilde{\mathcal{J}}(K,N)}& {\scriptstyle \mathbb{Z}_0^N\setminus\mathcal{J}(N)} \\ ( & \boldsymbol{\beta}_1 & \textbf{0} & \textbf{0} & )\end{array},\quad\boldsymbol{\beta}_1 \geq \textbf{0},\quad\boldsymbol{\beta}_1 \textbf{\textit{e}} = 1,\end{equation*}

and

\begin{align*}\boldsymbol{\alpha}&= \kbordermatrix{& \widetilde{\mathcal{J}}(K,N)&\quad \mathcal{J}(N) \setminus \widetilde{\mathcal{J}}(K,N)&\quad \mathbb{Z}_0^N\setminus\mathcal{J}(N)\\[3pt] & \boldsymbol{\alpha}_1 &\quad \boldsymbol{\alpha}_2 &\quad \boldsymbol{\alpha}_3}\\[3pt] &= (\boldsymbol{\beta}_1\ \ \textbf{0}\ \ \textbf{0})\textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})(\textbf{\textit{I}} - \textbf{\textit{R}}(K,N))^{-1} \textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}).\end{align*}

It follows from (3.16) and (3.27) that $\widetilde{\Gamma}^+(K,N) \subseteq \Gamma(N)$ , so that $\boldsymbol{\alpha}\textbf{\textit{e}}=1$ and $\boldsymbol{\alpha} \geq \textbf{0}$ . Moreover, using (F.1) and (F.2), we can rewrite $\boldsymbol{\alpha}$ as

\begin{align*}\boldsymbol{\alpha}&= (\boldsymbol{\beta}_1\ \ \textbf{0}\ \ \textbf{0})\textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})\textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}})\\[3pt] & +\boldsymbol{\beta}\textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,N)\textbf{\textit{e}})\textbf{\textit{B}}\bigl(\textbf{\textit{Q}}_+^{(2,1)}(K,N)\ \ \textbf{\textit{Q}}_{++}^{(2,1)}(K,N)\ \ \textbf{\textit{O}}\bigr)\textrm{diag}(\textbf{\textit{H}}(N)\textbf{\textit{e}}),\end{align*}

and therefore $\boldsymbol{\alpha}_3=\textbf{0}$ , which completes the proof of (3.29).

We omit the proof of (3.30) because it is almost the same as the proof of (3.17) in Corollary 3.1, except that the former uses Theorem 2.2, Theorem 3.2, and (3.29) where the latter uses Theorem 2.1, Theorem 3.1, and (3.16), respectively.

Appendix G. Proof of Theorem 4.1

Without loss of generality, we assume $\mathbb{Z}_0^N \cup \mathcal{X}^* = \mathbb{Z}_0^M$ , where $M = |\mathbb{Z}_0^N \cup \mathcal{X}^*|-1$ . It then follows from Theorem 3.1 with $N\coloneqq M$ that

\begin{equation*}\boldsymbol{\pi}(M) \in \widetilde{\mathcal{P}}(K,M),\end{equation*}

where from (3.11) and (3.15),

\begin{align*}\widetilde{\mathcal{P}}(K,M)=\widehat{\mathcal{P}}(K,M)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{M+1};\ \textbf{\textit{x}}=\boldsymbol{\beta}_M \overline{\widetilde{\textbf{\textit{H}}}}(K,M),\boldsymbol{\beta}_M \in \Gamma(M)\big\}.\end{align*}

Note here that if $X(0) \in \mathbb{Z}_{K+1}^{\infty}$ , the first passage time to $\mathbb{Z}_0^M$ ends on $\mathcal{X}^*$ . Therefore, following a discussion similar to the proof of Theorem 3.2, we obtain

(G.1)

\begin{equation}\boldsymbol{\pi}(M) \in \widehat{\mathcal{P}}^+(K,M;\, \mathcal{X}^*)\quad \mbox{ for }\quad\mathcal{X} \in S(K,N),\end{equation}

where

(G.2)

\begin{align}\widehat{\mathcal{P}}^+(K,M;\, \mathcal{X}^*)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{M+1};\ \textbf{\textit{x}}=\boldsymbol{\beta}_M \overline{\widetilde{\textbf{\textit{H}}}}(K,M),\ \nonumber\\& \boldsymbol{\beta}_M \in \Gamma(M),[\boldsymbol{\beta}_M]_i = 0\ (i \in \mathbb{Z}_0^M \setminus \mathcal{X}^*)\big\}.\end{align}

For a proper $\mathcal{X}^*$ , we have $\mathcal{X}^* \cap \mathcal{N}_0(K,N) = \emptyset$ ; i.e., the probability that the first passage time from $\mathbb{Z}_{K+1}^{\infty}$ to $\mathbb{Z}_0^M$ ends on state i is strictly positive for all $i \in \mathcal{X}^*$ . We thus have $\mathcal{X}^* = \widetilde{\mathcal{J}}(K,M)$ for $\mathcal{X}^* \in \mathcal{S}^*(K,N)$ , and therefore from Theorem 3.2,

\begin{equation*}\boldsymbol{\pi}(M) \in \textrm{ri}\, \widetilde{\mathcal{P}}^+(K,M;\, \mathcal{X}^*)\quad \mbox{ for }\quad\mathcal{X}^* \in S^*(K,N).\end{equation*}

Since $\mathbb{Z}_0^N \cup \mathcal{X}^* = \mathbb{Z}_0^M$ , $\boldsymbol{\pi}(N)$ can be expressed in terms of $\boldsymbol{\pi}(M)$ . Specifically, we first partition $\boldsymbol{\pi}(M)$ and $\widetilde{\textbf{\textit{H}}}(K,M)$ as follows:

\begin{align*}\boldsymbol{\pi}(M)&=\kbordermatrix{&\mathbb{Z}_0^N &\quad \mathbb{Z}_{N+1}^{M}\\& \boldsymbol{\pi}^{(1)}(M;\, N) &\quad \boldsymbol{\pi}^{(2)}(M;\, N)},\\ \widetilde{\textbf{\textit{H}}}(K,M)&=\kbordermatrix{&\mathbb{Z}_0^N &\quad \mathbb{Z}_{N+1}^{M}\\& \widetilde{\textbf{\textit{H}}}^{(*,1)}(K,M;\, N)&\quad \widetilde{\textbf{\textit{H}}}^{(*,2)}(K,M;\, N)}.\end{align*}

It then follows from (G.1) and (G.2) that $\boldsymbol{\pi}^{(1)}(M;\, N) \in\widehat{\mathcal{P}}_M^{+(1)}(K,M;\, \mathcal{X}^*)$ , where

(G.3)

\begin{align}\widehat{\mathcal{P}}_M^{+(1)}(K,M;\, \mathcal{X}^*)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\beta}_M \textrm{diag}^{-1}(\widetilde{\textbf{\textit{H}}}(K,M)\textbf{\textit{e}})\widetilde{\textbf{\textit{H}}}^{(*,1)}(K,M;\, N),\ \nonumber\\&\quad \boldsymbol{\beta}_M \in \Gamma(M),\ [\boldsymbol{\beta}_M]_i = 0\ (i \in \mathbb{Z}_0^M \setminus \mathcal{X}^*)\big\}.\end{align}

By definition, $\boldsymbol{\pi}(N) = \boldsymbol{\pi}^{(1)}(M;\, N) /\boldsymbol{\pi}^{(1)}(M;\, N) \textbf{\textit{e}}$ . It follows from (G.3) that $\boldsymbol{\pi}^{(1)}(M;\, N)$ is given by a weighted sum of row vectors of $\widetilde{\textbf{\textit{H}}}^{(*,1)}(K,M;\,N)$ . On the other hand, it follows from Remark 3.1 that if $\mathbb{Z}_0^N \cup \mathcal{X}^* = \mathbb{Z}_0^M$ , then $\widetilde{\textbf{\textit{H}}}(K,M)$ is identical to the $(M+1) \times(M+1)$ northwest corner block of $\textbf{\textit{H}}(K)$ . Therefore, the row vectors corresponding to state $i \in \mathcal{X}^*$ in $\widetilde{\textbf{\textit{H}}}^{(*,1)}(K,M;\, N)$ and in $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ are identical. Therefore, replacing $\widetilde{\textbf{\textit{H}}}^{(*,1)}(K,M;\,N)$ by $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ and normalizing $\boldsymbol{\pi}^{(1)}(M;\,N)$ , we obtain

\begin{equation*}\boldsymbol{\pi}(N) \in \widehat{\mathcal{P}}^+(K,N;\, \mathcal{X}^*)\ \mbox{ for }\ \mathcal{X} \in \mathcal{S}(K,N),\end{equation*}

where

(G.4)

\begin{align}\widehat{\mathcal{P}}^+(K,N;\, \mathcal{X}^*)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\beta}\textrm{diag}^*(\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}})\textbf{\textit{H}}^{(*,1)}(K;\, N),\nonumber\\& \quad\boldsymbol{\beta}\in\Gamma(K),\ [\boldsymbol{\beta}]_i = 0\ (i \in \mathbb{Z}_0^K \setminus \mathcal{X}^*)\big\}.\end{align}

In particular, for a proper $\mathcal{X}^*$ , we have

\begin{equation*}\boldsymbol{\pi}(N) \in \textrm{ri}\, \widehat{\mathcal{P}}^+(K,N;\, \mathcal{X})\ \mbox{ for }\ \mathcal{X}^* \in \mathcal{S}^*(K,N).\end{equation*}

Note that $\widehat{\mathcal{P}}^+(K,N;\, \mathcal{X}^*)$ in (G.4) is a convex polytope on the first orthant of $\mathbb{R}^{N+1}$ .

The proof of (4.5) will be complete if we can show that

(G.5)

\begin{equation}\widehat{\mathcal{P}}^+(K,N;\, \mathcal{X}^*) = \mathcal{P}^+(K,N;\, \mathcal{X}^*);\end{equation}

this will be done below. Note that $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ is given in terms of $\widetilde{\textbf{\textit{H}}}(K,N)$ in (3.7):

(G.6)

\begin{equation}\textbf{\textit{H}}^{(*,1)}(K;\, N)=\kbordermatrix{\\ \mathbb{Z}_0^N & \textbf{\textit{I}}\\ \mathbb{Z}_{N+1}^K &(\!-\textbf{\textit{Q}}^{(2,2)}(K,N))^{-1}\textbf{\textit{Q}}^{(2,1)}(K,N)}\widetilde{\textbf{\textit{H}}}(K,N).\end{equation}

It then follows from (3.12) and (G.6) that

\begin{align*}&\boldsymbol{\beta}\textrm{diag}^*(\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}})\textbf{\textit{H}}^{(*,1)}(K;\, N)\quad \\&=\boldsymbol{\beta}\textrm{diag}^*(\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}})\begin{pmatrix}\textbf{\textit{I}}\nonumber\\ (\!-\textbf{\textit{Q}}^{(2,2)}(K,N))^{-1}\textbf{\textit{Q}}^{(2,1)}(K,N)\end{pmatrix}\widetilde{\textbf{\textit{H}}}(K,N)\\&=\boldsymbol{\beta}\textbf{\textit{U}}(K,N)\overline{\textbf{\textit{H}}}(N),\end{align*}

so that $\widehat{\mathcal{P}}^+(K,N;\, \mathcal{X}^*)$ is identical to $\mathcal{P}^+(K,N;\,\mathcal{X}^*)$ in (4.6). The equivalence between (4.5) and (4.6) can be shown in the same way as in the proof of Theorem 2.2, so that we omit it.

Appendix H. Proof of Lemma 4.1

If $\mathcal{X}_A^* = \mathcal{X}_B^*$ , Lemma 4.1 holds. We thus assume $\mathcal{X}_A^* \neq \mathcal{X}_B^*$ . Note that for $\mathcal{X} \in S(K,N)$ , $\mathcal{P}^+(K,N;\, \mathcal{X}^*)$ is equivalent to $\widehat{\mathcal{P}}^+(K,N;\,\mathcal{X}^*)$ in (G.4); i.e.,

(H.1)

\begin{align}\widehat{\mathcal{P}}^+(K,N;\, \mathcal{X}^*)&=\big\{\textbf{\textit{x}} \in \mathbb{R}^{N+1};\ \textbf{\textit{x}}=\boldsymbol{\beta} \overline{\textbf{\textit{H}}}^{(*,1)}(K;\, N),\nonumber\\&\quad \boldsymbol{\beta}\in\Gamma(K),\ [\boldsymbol{\beta}]_i = 0\ (i \in \mathbb{Z}_0^K \setminus \mathcal{X}^*)\big\},\end{align}

where

\begin{equation*}\overline{\textbf{\textit{H}}}^{(*,1)}(K;\, N)=\textrm{diag}^*(\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}})\textbf{\textit{H}}^{(*,1)}(K;\, N).\end{equation*}

Let $\textbf{\textit{h}}_j^{(*,1)}(K;\, N)$ and $\overline{\textbf{\textit{h}}}_j^{(*,1)}(K;\,N)$ ( $j \in \mathbb{Z}_0^K$ ) denote the jth row vectors of $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ and $\overline{\textbf{\textit{H}}}^{(*,1)}(K;\, N)$ . Note that the lemma is proven if

(H.2)

\begin{align}\overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N) \in \widehat{\mathcal{P}}(K,N;\, \mathcal{X}_B^*),\quad i \in \mathcal{X}_A^*.\end{align}

It is clear that (H.2) holds for $i \in \mathcal{X}_A^* \cap\mathcal{X}_B^*$ . We thus show (H.2) for $i \in \mathcal{X}_A^*\setminus \mathcal{X}_B^*$ below.

For a fixed $i \in \mathcal{X}_A^* \setminus \mathcal{X}_B^*$ , we consider a censored Markov chain $\{\widetilde{X}(t)\}_{t \in (\!-\infty,\infty)}$ by observing $\{X(t)\}_{t \in (\!-\infty,\infty)}$ only when $X(t) \in\mathbb{Z}_0^N \cup \mathcal{X}_B^* \cup \{i\} \cup \mathbb{Z}_{K+1}^{\infty}$ . The infinitesimal generator $\widetilde{\textbf{\textit{Q}}}\coloneqq \widetilde{\textbf{\textit{Q}}}(K,N,\mathcal{X}_B^*,i)$ of the censored Markov chain $\{\widetilde{X}(t)\}_{t \in (\!-\infty,\infty)}$ takes the following form:

\begin{equation*}\widetilde{\textbf{\textit{Q}}}=\kbordermatrix{&\mathbb{Z}_0^N \cup \mathcal{X}_B^* \cup \{i\} &\quad \mathbb{Z}_{K+1}^{\infty}\\ \\[-9pt] \mathbb{Z}_0^N \cup \mathcal{X}_B^* \cup \{i\}\quad &\widetilde{\textbf{\textit{Q}}}^{(1,1)} &\quad \widetilde{\textbf{\textit{Q}}}^{(1,2)} \\ \\[-9pt] \mathbb{Z}_{K+1}^{\infty} & \widetilde{\textbf{\textit{Q}}}^{(2,1)} &\quad \widetilde{\textbf{\textit{Q}}}^{(2,2)}},\end{equation*}

where, with $\mathbb{Z}_B=\mathbb{Z}_0^N \cup \mathcal{X}_B^*$ ,

\begin{equation*}\widetilde{\textbf{\textit{Q}}}^{(1,1)}=\kbordermatrix{& \mathbb{Z}_B \quad& \{i\}\quad \\ \mathbb{Z}_B &\widetilde{\textbf{\textit{Q}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(1,1)} &\quad \widetilde{\textbf{\textit{q}}}_{\mathbb{Z}_B,i}^{(1,1)}\\ \\[-9pt] \{i\} &\widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B}^{(1,1)} &\quad -\widetilde{q}_{i,i}^{(1,1)}}.\end{equation*}

Note here that

(H.3)

\begin{equation}\widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B}^{(1,1)}=\kbordermatrix{&\mathbb{Z}_0^N \setminus \mathcal{X}_B^* &\quad \mathcal{X}_B^*\\ \\[-9pt] & \textbf{0} &\quad \widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B,+}^{(1,1)}},\end{equation}

since $i \in \mathcal{X}_A^* \setminus \mathcal{X}_B^*$ and (4.7) holds sample-path-wise. We define $\widetilde{\textbf{\textit{H}}}\coloneqq \widetilde{\textbf{\textit{H}}}(K,N,\mathcal{X}_B^*,i)$ as

\begin{align*}\widetilde{\textbf{\textit{H}}}=\big(\!- \widetilde{\textbf{\textit{Q}}}^{(1,1)} \big)^{-1}&=\kbordermatrix{& \mathbb{Z}_B & \{i\}\\ \mathbb{Z}_B &\quad \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B} &\quad \widetilde{\textbf{\textit{h}}}_{\mathbb{Z}_B,i}\\ \\[-9pt]\{i\} & \widetilde{\textbf{\textit{h}}}_{i,\mathbb{Z}_B} &\quad \widetilde{h}_{i,i}}\\ \\[-9pt] &=\kbordermatrix{& \mathbb{Z}_B & \{i\}\\ \\[-9pt] \mathbb{Z}_B & \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B} & \widetilde{\textbf{\textit{h}}}_{\mathbb{Z}_B,i}\\ \\[-9pt] \{i\} & (\widetilde{q}_{i,i}^{(1,1)})^{-1}\widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B}^{(1,1)} \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}&\quad \widetilde{h}_{i,i}}.\end{align*}

Note here that if we partition $\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}$ as

\begin{equation*}\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}=\kbordermatrix{& \mathbb{Z}_0^N &\quad \mathbb{Z}_B \setminus \mathbb{Z}_0^N\\ \\[-9pt] \mathbb{Z}_0^N \setminus \mathcal{X}_B^* &\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(1,1)} &\quad \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(1,2)}\\ \\[-7pt] \mathcal{X}_B^* &\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,1)} &\quad \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,2)}},\end{equation*}

the row vector of $\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,1)}$ corresponding to $j \in \mathcal{X}_B^*$ is identical to $\textbf{\textit{h}}_j^{(*,1)}(K;\, N)$ . Furthermore, noting (H.3), we have

\begin{align*}\widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B}^{(1,1)} \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}&=\left(\widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B}^{(1,1)} \begin{pmatrix}\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(1,1)} \\ \\[-9pt] \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,1)}\end{pmatrix} \widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B}^{(1,1)}\begin{pmatrix}\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(1,2)} \\ \\[-9pt] \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,2)}\end{pmatrix}\right)\\ \\[-8pt] &=\big(\widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B,+}^{(1,1)}\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,1)}\quad \widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B,+}^{(1,1)}\widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,2)}\big),\end{align*}

and

\begin{equation*}\textbf{\textit{h}}_i^{(*,1)}(K;\, N) = (\widetilde{q}_{i,i}^{(1,1)})^{-1}\widetilde{\textbf{\textit{q}}}_{i,\mathbb{Z}_B,+}^{(1,1)} \widetilde{\textbf{\textit{H}}}_{\mathbb{Z}_B,\mathbb{Z}_B}^{(2,1)}.\end{equation*}

Therefore, $\textbf{\textit{h}}_i^{(*,1)}(K;\, N)$ ( $i \in \mathcal{X}_A^* \setminus\mathcal{X}_B^*$ ) is given by a linear combination of the $\textbf{\textit{h}}_j^{(*,1)}(K;\,N)$ ( $j \in \mathcal{X}_B^*$ ) with nonnegative weights $\alpha_{i,j}$ :

\begin{equation*}\textbf{\textit{h}}_i^{(*,1)}(K;\, N)=\sum_{j \in \mathcal{X}_B^*} \alpha_{i,j} \textbf{\textit{h}}_j^{(*,1)}(K;\, N).\end{equation*}

We thus conclude that $\overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N)$ ( $i \in\mathcal{X}_A^* \setminus \mathcal{X}_B^*$ ) is given by a convex combination of $\overline{\textbf{\textit{h}}}_j^{(*,1)}(K;\, N)$ ( $j \in \mathcal{X}_B^*$ ); i.e.,

\begin{equation*}\overline{\textbf{\textit{h}}}_i^{(*,1)}(K;\, N)=\sum_{j \in \mathcal{X}_B^*} \beta_{i,j} \overline{\textbf{\textit{h}}}_j^{(*,1)}(K;\, N),\end{equation*}

where

\begin{equation*}\beta_{i,j}=\frac{\alpha_{i,j} \textbf{\textit{h}}_j^{(*,1)}(K;\, N) \textbf{\textit{e}}}{\textbf{\textit{h}}_i^{(*,1)}(K;\, N) \textbf{\textit{e}}},\quad j \in \mathcal{X}_B^*.\end{equation*}

Note that $\beta_{i,j} \geq 0$ and $\sum_{j \in \mathcal{X}_B^*} \beta_{i,j} =1$ , which completes the proof.

Appendix I. Proof of Lemma 4.2

We first prove that $\mathcal{D}(K,N)$ is an (s, t)-node cut in G(K, N), and then we show its minimality. To prove the former, we show that the graph $G(K,N;\, -\mathcal{D}(K,N))$ , which is obtained from G(K, N) by removing all nodes in $\mathcal{D}(K,N)$ , has no directed paths from s to t. To prove this, we assume that $G(K,N;\, -\mathcal{D}(K,N))$ has a directed path P from s to t and derive a contradiction. Since $\mathcal{J}(K)$ is the set of all neighboring nodes of s, P must contain a node $v \in\mathcal{J}(K) \setminus \mathcal{D}(K,N)$ , where we assume that if P contains more than one node in $\mathcal{J}(K) \setminus\mathcal{D}(K,N)$ , then v is the node closest to t. Because the directed path from v to t has no edges incoming to any nodes in $\mathcal{J}(K)$ , $G^*(K,N)$ should contain the subpath $P_{v,t}$ of P from v to t. This, however, contradicts the fact that $\mathcal{D}(K,N)$ contains all nodes in $\mathcal{J}(K)$ from which t is reachable in $G^*(K,N)$ .

Next we show the minimality of $\mathcal{D}(K,N)$ . For each node $v \in\mathcal{D}(K,N)$ , $G^*(K,N)$ contains a directed path $P_{v,t}$ from v that visits t without passing through any other nodes in $\mathcal{J}(K)$ . As a result, the edge $(s,v) \in E(K,N)$ and $P_{v,t}$ form a directed path from s to t in G(K, N), and therefore $v \in \mathcal{D}(K,N)$ must be included in any (s,t)-node cut $\mathcal{X} \subseteq \mathcal{J}(K)$ in G(K, N). We thus conclude that $\mathcal{D}(K,N) \subseteq \mathcal{X}$ for any $\mathcal{X}$ such that $\mathcal{X} \subseteq \mathcal{J}(K)$ and $\mathcal{X} \in S(K,N)$ , which completes the proof.

Appendix J. Proof of Theorem 4.2

Note that $\textbf{\textit{x}} \in \textrm{ri}\, \mathcal{P}^+(K,N;\, \mathcal{J}^*(K))$ from Lemma 4.3 and $\mathcal{P}^+(K,N;\,\mathcal{J}^*(K))=\widehat{\mathcal{P}}^+(K,N;\, \mathcal{J}^*(K))$ from (G.5), where $\widehat{\mathcal{P}}^+(K,N;\, \mathcal{J}^*(K))$ is given in (G.5). We thus have $\textbf{\textit{x}} \in\textrm{ri}\, \widehat{\mathcal{P}}^+(K,N;\, \mathcal{J}^*(K))$ ; therefore there exists $\boldsymbol{\beta} \in \Gamma(K)$ such that $[\boldsymbol{\beta}]_i > 0\ (i\in \mathcal{J}^*(K))$ , $[\boldsymbol{\beta}]_i = 0\ (i \in \mathbb{Z}_0^K \setminus\mathcal{J}^*(K))$ , and

\begin{equation*}\textbf{\textit{x}}=\boldsymbol{\beta} \textrm{diag}^*(\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}}) \textbf{\textit{H}}^{(*,1)}(K;\, N).\end{equation*}

On the other hand, it follows from (4.8) that $\mathcal{J}(K) \setminus \mathcal{J}^*(K) = \mathcal{J}(K) \cap \mathcal{N}_0^{(3)}(K,N)$ . Therefore, we have from (4.3) that

\begin{equation*}\textbf{\textit{h}}_i^{(*,1)}(K;\, N) = \textbf{0},\quad i \in \mathcal{J}(K) \setminus \mathcal{J}^*(K),\end{equation*}

where $\textbf{\textit{h}}_i^{(*,1)}(K;\, N)$ ( $i \in \mathbb{Z}_0^K$ ) denotes the ith row vector of $\textbf{\textit{H}}^{(*,1)}(K;\, N)$ . Let $\boldsymbol{\gamma} \in\mathbb{R}^{K+1}$ be defined by

\begin{equation*}\boldsymbol{\gamma}=\boldsymbol{\beta} \textrm{diag}^*(\textbf{\textit{H}}^{(*,1)}(K;\, N)\textbf{\textit{e}})+\sum_{i \in \mathcal{J}(K) \setminus \mathcal{J}^*(K)} \textbf{\textit{e}}_i,\end{equation*}

where $\textbf{\textit{e}}_i \in \Gamma(K)$ ( $i \in \mathbb{Z}_0^K$ ) denotes the unit vector whose ith element is equal to one. Note here that

(J.1)

\begin{align}[\boldsymbol{\gamma}]_i & > 0, \quad i \in \mathcal{J}(K),\end{align}

(J.2)

\begin{align}[\boldsymbol{\gamma}]_i & = 0, \quad i \in \mathbb{Z}_0^K \setminus \mathcal{J}(K).\end{align}

We thus have

\begin{equation*}\boldsymbol{\gamma} \textbf{\textit{H}}^{(*,1)}(K;\, N)=\textbf{\textit{x}}+\sum_{i\in\mathcal{J}(K)\setminus\mathcal{J}^*(K)}\textbf{\textit{h}}_i^{(*,1)}(K;\, N)=\textbf{\textit{x}}.\end{equation*}

Let $\textbf{\textit{y}}=\boldsymbol{\gamma}\textbf{\textit{H}}^{(*,2)}(K;\, N)$ . We then have

\begin{equation*}(\textbf{\textit{x}}\ \textbf{\textit{y}}) (\!-\textbf{\textit{Q}}^{(1,1)}(K)) = \boldsymbol{\gamma}.\end{equation*}

It follows from (J.1) and (J.2) that

\begin{align*}[(\textbf{\textit{x}}\ \textbf{\textit{y}}) \textbf{\textit{Q}}^{(1,1)}(K)]_i &> 0,\quad i \in \mathcal{J}(K),\\ [(\textbf{\textit{x}}\ \textbf{\textit{y}}) \textbf{\textit{Q}}^{(1,1)}(K)]_i &= 0,\quad i \in \mathbb{Z}_0^K \setminus \mathcal{J}(K).\end{align*}

Furthermore, $(\textbf{\textit{x}}\ \textbf{\textit{y}}) \textbf{\textit{e}} = \boldsymbol{\gamma} \textbf{\textit{H}}(K)\textbf{\textit{e}} >\textbf{0}$ since $\textbf{\textit{H}}(K)\textbf{\textit{e}} > \textbf{0}$ . We thus have $(\textbf{\textit{x}}\ \textbf{\textit{y}})/((\textbf{\textit{x}}\ \textbf{\textit{y}})\textbf{\textit{e}}) \in \textrm{ri}\, \mathcal{P}^+(K)$ , and therefore

\begin{equation*}(\textbf{\textit{x}}\ \textbf{\textit{y}}) > \textbf{0},\end{equation*}

from Lemma 2.1. We then define $\boldsymbol{\zeta}(\textbf{\textit{x}},\textbf{\textit{y}})$ as

\begin{equation*}\boldsymbol{\zeta}(\textbf{\textit{x}},\textbf{\textit{y}})=\frac{(\textbf{\textit{x}}\ \textbf{\textit{y}})(\!-\textbf{\textit{Q}}^{(1,1)}(K))}{(\textbf{\textit{x}}\ \textbf{\textit{y}})(\!-\textbf{\textit{Q}}^{(1,1)}(K))\textbf{\textit{e}}}=\frac{\boldsymbol{\gamma}}{\boldsymbol{\gamma}\textbf{\textit{e}}}.\end{equation*}

Note that $\boldsymbol{\zeta}(\textbf{\textit{x}},\textbf{\textit{y}}) \in \textrm{ri}\, \Gamma^+(K)$ , where

\begin{equation*}\Gamma^+(K)=\bigl\{\boldsymbol{\alpha} \in \mathbb{R}^{K+1};\ \boldsymbol{\alpha} \geq \textbf{0},\boldsymbol{\alpha}\textbf{\textit{e}}=1,\ [\boldsymbol{\alpha}]_i=0\ (i \in \mathbb{Z}_0^K\setminus\mathcal{J}(K))\bigr\}.\end{equation*}

We now consider a Markov chain whose infinitesimal generator $\textbf{\textit{Q}}$ is given by

(J.3)

\begin{equation}\textbf{\textit{Q}}= \kbordermatrix{&\mathbb{Z}_0^K & \mathbb{Z}_{K+1}^{\infty}\\ \\[-9pt] \mathbb{Z}_0^K & \textbf{\textit{Q}}^{(1,1)}(K) &\quad (\!-\textbf{\textit{Q}}^{(1,1)}(K))\textbf{\textit{e}}\textbf{\textit{z}}\\ \\[-9pt] \mathbb{Z}_{K+1}^{\infty}& \textbf{\textit{e}} \boldsymbol{\zeta}(\textbf{\textit{x}},\textbf{\textit{y}}) &\quad -\textbf{\textit{I}},},\end{equation}

where $\textbf{\textit{z}}$ denotes a $1 \times \infty$ positive probability vector. Note here that the Markov chain with $\textbf{\textit{Q}}$ given by (J.3) is a member of $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ if it is ergodic, i.e., if the global balance equation $\boldsymbol{\pi}\textbf{\textit{Q}}=\textbf{0}$ has a unique positive solution $\boldsymbol{\pi}$ satisfying $\boldsymbol{\pi}\textbf{\textit{e}} = 1$ , where $\boldsymbol{\pi}\textbf{\textit{Q}}=\textbf{0}$ is written as

\begin{align*}\boldsymbol{\pi}^{(1)}(K) \textbf{\textit{Q}}^{(1,1)}(K)+\boldsymbol{\pi}^{(2)}(K) \textbf{\textit{e}} \boldsymbol{\zeta}(\textbf{\textit{x}},\textbf{\textit{y}}) &= \textbf{0},\\ \boldsymbol{\pi}^{(1)}(N) (\!-\textbf{\textit{Q}}^{(1,1)}(K))\textbf{\textit{e}} \textbf{\textit{z}} - \boldsymbol{\pi}^{(2)}(K)&= \textbf{0}.\end{align*}

Solving the above with $\boldsymbol{\pi}^{(1)}(K)\textbf{\textit{e}} +\boldsymbol{\pi}^{(2)}(K)\textbf{\textit{e}} = 1$ , we obtain

\begin{align*}\boldsymbol{\pi}&=\begin{pmatrix}\boldsymbol{\pi}^{(1)}(K) & \boldsymbol{\pi}^{(2)}(K)\end{pmatrix}\\&=\frac{1}{(\textbf{\textit{x}}\ \textbf{\textit{y}})\textbf{\textit{e}} +(\textbf{\textit{x}}\ \textbf{\textit{y}})(\!-\textbf{\textit{Q}}^{(1,1)}(K))\textbf{\textit{e}}}\begin{pmatrix}(\textbf{\textit{x}}\ \textbf{\textit{y}}) &\quad (\textbf{\textit{x}}\ \textbf{\textit{y}})(\!-\textbf{\textit{Q}}^{(1,1)}(K))\textbf{\textit{e}} \cdot \textbf{\textit{z}}\end{pmatrix}\\&=\frac{1}{1 + \textbf{\textit{y}}\textbf{\textit{e}} + \boldsymbol{\gamma}\textbf{\textit{e}}}\begin{pmatrix}(\textbf{\textit{x}}\ \textbf{\textit{y}}) &\quad \boldsymbol{\gamma}\textbf{\textit{e}} \cdot \textbf{\textit{z}}\end{pmatrix}> \textbf{0};\end{align*}

in particular,

\begin{equation*}\boldsymbol{\pi}^{(1)}(N)=\boldsymbol{\pi}^{(1)}(K,N)=\frac{1}{1 + \textbf{\textit{y}}\textbf{\textit{e}} + \boldsymbol{\gamma}\textbf{\textit{e}}}\cdot \textbf{\textit{x}}.\end{equation*}

We thus conclude that the Markov chain with $\textbf{\textit{Q}}$ in (J.3) is a member of $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$ , and it has $\boldsymbol{\pi}(N) = \textbf{\textit{x}}$ .

Acknowledgements

The authors would like to thank Dr. Hiroshi Nagamochi for his instructive comments on the minimal node cuts of directed graphs. The research of the second author was supported in part by JSPS KAKENHI Grant No. 19K11841.

References

Bright, L. and Taylor, P. G. (1995). Calculating the equilibrium distribution in level dependent quasi-birth-and-death processes. Stoch. Models 11, 497–525.CrossRef Google Scholar

Gibson, D. and Seneta, E. (1987). Augmented truncations of infinite stochastic matrices. J. Appl. Prob. 24, 600–608.CrossRef Google Scholar

Grunbaum, B. (2003). Convex Polytopes, 2nd edn. Springer, New York, NY.CrossRef Google Scholar

Hart, A. G. and Tweedie, R. L. (2012). Convergence of invariant measures of truncation approximations to Markov processes. Appl. Math. 3, 2205–2215.CrossRef Google Scholar

Kimura, M. and Takine, T. (2018). Computing the conditional stationary distribution in Markov chains of level-dependent M/G/1-type. Stoch. Models 34, 207–238.CrossRef Google Scholar

Latouche, G. and Ramaswami, V. (1999). Introduction to Matrix Analytic Methods in Stochastic Modeling. SIAM, Philadelphia, PA.CrossRef Google Scholar

Liu, Y., Li, W. and Masuyama, H. (2018). Error bounds for augmented truncation approximations of continuous-time Markov chains. Operat. Res. Lett. 46, 409–413.CrossRef Google Scholar

Masuyama, H. (2018). Limit formulas for the normalized fundamental matrix of the northwest-corner truncation of Markov chains: Matrix-infinite-product-form solutions of block-Hessenberg Markov chains. Preprint. Available at https://arxiv.org/pdf/1603.07787v7.pdf.Google Scholar

Nagamochi, H. (2018). Personal communication.Google Scholar

Neuts, M. F. (1981). Matrix-Geometric Solutions in Stochastic Models. Johns Hopkins University Press.Google Scholar

Neuts, M. F. (1989). Structured Stochastic Matrices of M/G/1 Type and Their Applications. Marcel Dekker, New York, NY.Google Scholar

Seneta, E. (1980). Computing the stationary distribution for infinite Markov chains. Linear Algebra Appl. 34, 259–267.CrossRef Google Scholar

Stewart, W. J. (1994). Introduction to the Numerical Solution of Markov Chains. Princeton University Press.Google Scholar

Takahashi, Y. (1975). A lumping method for numerical calculations of stationary distributions of Markov chains. Res. Rep. B-18, Department of Information Sciences, Tokyo Institute of Technology.Google Scholar

Takine, T. (2016). Analysis and computation of the stationary distribution in a special class of level-dependent M/G/1-type and its application to BMAP/M/

$\infty$ and BMAP/M/c+M queues. Queueing Systems 84, 49–77.CrossRef Google Scholar

Zhao, Y. Q. and Liu, D. (1996). The censored Markov chain and best augmentation. J. Appl. Prob. 33, 623–629.CrossRef Google Scholar

Table 1. The worst-case error bounds in Example 4.2 ($N=50$).

Article contents

Characterization of the conditional stationary distribution in Markov chains via systems of linear inequalities

Abstract

Keywords

MSC classification

1. Introduction

2. The characterization of $\boldsymbol{\pi}(N)$ by linear inequalities: fundamental results

2.1. Preliminaries

2.2. Characterization of $\boldsymbol{\pi}(N)$ in terms of $\textbf{\textit{Q}}^{(1,1)}(N)$

2.3. Characterization of $\boldsymbol{\pi}(\textbf{\textit{N}})$ in terms of $\textbf{\textit{Q}}^{(1,1)}(\textbf{\textit{N}})$ and $\mathcal{J}(\textbf{\textit{N}})$

2.4. Implications for the augmented truncation approximation

3. Extensions using state transitions in peripheral states

4. Minimum convex polytopes for $\mathcal{M}(\textbf{\textit{Q}}^{(1,1)}(K), \mathcal{J}(K))$

4.1. (K, N)-skip-free sets

4.2. Minimum convex polytopes

5. Concluding remarks

Appendix A. Proof of Corollary 2.2

Appendix B. Equivalence between (2.22) and (2.23) for $\mathcal{J}(N) \neq \mathbb{Z}_0^N$

Appendix C. Proof of Lemma 2.2

Appendix D. Proof of Lemma 2.3

Appendix E. Proof of Corollary 3.1

Appendix F. Proof of Corollary 3.2

Appendix G. Proof of Theorem 4.1

Appendix H. Proof of Lemma 4.1

Appendix I. Proof of Lemma 4.2

Appendix J. Proof of Theorem 4.2

Acknowledgements

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests