Single-item continuous-review inventory models with random supplies

Kurt L. Helmes; Richard H. Stockbridge; Chao Zhu

doi:10.1017/apr.2024.23

Single-item continuous-review inventory models with random supplies

Part of: Operations research and management science Stochastic systems and control Stochastic analysis

Published online by Cambridge University Press: 27 January 2025

Kurt L. Helmes ,

Richard H. Stockbridge and

Chao Zhu

Show author details

Kurt L. Helmes*: Affiliation:
Humboldt University of Berlin
Richard H. Stockbridge*: Affiliation:
University of Wisconsin–Milwaukee
Chao Zhu*: Affiliation:
University of Wisconsin–Milwaukee
*: *Postal address: Institute for Operations Research, Humboldt University of Berlin, Spandauer Str. 1, 10178, Berlin, Germany. Email address: helmes@wiwi.hu-berlin.de
**Postal address: Department of Mathematical Sciences, University of Wisconsin–Milwaukee, Milwaukee, WI 53201, USA.
**Postal address: Department of Mathematical Sciences, University of Wisconsin–Milwaukee, Milwaukee, WI 53201, USA.

Article contents

Abstract
Introduction
Formulation and existence result
Expected occupation and ordering measures
The auxiliary function ${U}_0$
Policy class $ \mathcal{A}_0$ and optimality
Examples
Funding information
Competing interests
References

Rights & Permissions

Abstract

This paper analyzes single-item continuous-review inventory models with random supplies in which the inventory dynamic between orders is described by a diffusion process, and a long-term average cost criterion is used to evaluate decisions. The models in this class have general drift and diffusion coefficients and boundary points that are consistent with the notion that demand should tend to reduce the inventory level. Random yield is described by a (probability) transition function which depends on the inventory on hand and the nominal amount ordered; it is assumed to be a distribution with support in the interval determined by the order-from and the nominal order-to locations of the stock level. Using weak convergence arguments involving average expected occupation and ordering measures, conditions are given for the optimality of an (s, S) ordering policy in the general class of policies with finite expected cost. The characterization of the cost of an (s, S) policy as a function of two variables naturally leads to a nonlinear optimization problem over the stock levels s and S, and the existence of an optimizing pair $(s^*,S^*)$ is established under weak conditions. Thus, optimal policies of inventory models with random supplies can (easily) be numerically computed. The range of applicability of the optimality result is illustrated on several inventory models with random yields.

Keywords

Inventory models with random supplies impulse control long-term average cost general diffusion models (s, S) policies weak convergence

MSC classification

Primary: 93E20: Optimal stochastic control

Secondary: 90B05: Inventory, storage, reservoirs 60H30: Applications of stochastic analysis (to PDE, etc.)

Type: Original Article
Information: Advances in Applied Probability , First View , pp. 1 - 37

DOI: https://doi.org/10.1017/apr.2024.23 [Opens in a new window]
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of Applied Probability Trust

1. Introduction

This paper analyzes a continuous-review inventory management problem when the stock-level process is a diffusion with deficient supply; a long-term average cost criterion is used. The control over the inventory levels is through the action of ordering additional nominal stock, which then results in a random yield of whatever has been ordered. We identify sufficient conditions for the optimality of an (s, S) ordering policy in the most general class of admissible policies.

We model the inventory processes (in the absence of orders) as solutions to a stochastic differential equation

(1.1)

\begin{equation} dX_0(t) = \mu(X_0(t))\, dt + \sigma(X_0(t))\, dW(t), \qquad X_0(0) = x_0,\end{equation}

taking values in an interval ${\mathcal I} = (a,b)$ ; negative values of $X_0(t)$ represent back-ordered inventory. The detailed discussion in [Reference Chen, Wu and Yao4] validates state-dependent diffusion models for inventory management.

Following the classical approach in inventory theory, an ordering policy $(\tau,O)$ for a model with random supplies is a sequence of pairs $\{(\tau_k,O_k)\;:\; k \in \mathbb N\}$ in which $\tau_k$ denotes the (random) time at which the kth order is placed and $O_k$ denotes its (nominal) size. The random supply is modeled by the random slack $\Theta$ , which is a sequence in which, for each k, $\Theta_k$ gives the deficit of the quantity delivered from the order amount $O_k$ ; it also represents the deficiency between the intended inventory level and the actual level after the order delivery. While the order quantities $\{O_k\}$ are determined by the decision-maker, the corresponding slack variables $\{\Theta_k\}$ arise from factors involving the supplier. The inventory level process X resulting from an ordering policy $(\tau,O)$ and corresponding slack $\Theta$ therefore satisfies the equation

(1.2)

\begin{equation} X(t) = x_0 + \int_0^t \mu(X(s))\, ds + \int_0^t \sigma(X(s))\, dW(s) + \sum_{k=1}^\infty I_{\{\tau_k \leq t\}} (O_k-\Theta_k).\end{equation}

Note that the initial inventory level $X(0-\!)=x_0$ may be such that an order is placed at time 0, resulting in a new inventory level at time 0; this possibility occurs when $\tau_1 = 0$ . Also observe that $X(\tau_k-\!)$ is the inventory level just before the kth order is placed, while $X(\tau_k)=X(\tau_k-\!)+O_k-\Theta_k$ is the level with the new inventory. Thus, this model assumes that orders are filled instantaneously. Section 2 describes the inventory process X more formally as an impulse-controlled diffusion process and adopts a different formulation of a nominal ordering policy $(\tau,Z)$ in which $Z=\{Z_k\}$ denotes the nominal inventory levels following (non-deficient) orders.

For the time being, continuing with the informal description above, let $(\tau,O)$ be an ordering policy, $\Theta$ the corresponding slack, and X the resulting inventory level process satisfying (1.2). Let $c_0$ and $c_1$ denote the holding/back-order cost rate and (nominal) ordering cost functions, respectively. We assume there is some constant $k_1 > 0$ such that $c_1 \geq k_1$ ; this constant represents the fixed cost for placing each order. The long-term average expected holding/back-order plus ordering cost to be analyzed is

(1.3)

\begin{equation} J\;:\!=\; \limsup_{t\rightarrow \infty} t^{-1} \mathbb E\left[\int_0^t c_0(X(s))\, ds + \sum_{k=1}^\infty I_{\{\tau_k \leq t\}} c_1(X(\tau_k-\!),X(\tau_k))\right];\end{equation}

the expectation is with respect to all random factors involved in the model. The goal is to identify an ordering policy so as to minimize the cost. For models with random supplies there are other more exotic cost structures that can be considered. The use of $X(\tau_k)$ in the cost functional (1.3) captures the situation ‘you pay for what you get’; see the paragraph following Condition 3 for further details.

As mentioned earlier, we study a generalization of the problem examined in [Reference Helmes, Stockbridge and Zhu9]. In particular, we refer the reader to that paper and to [Reference Helmes, Stockbridge and Zhu8] for a discussion of the existing literature related to the problem with non-deficient supplies, in which $\Theta_k=0$ for all k; see also [Reference Bensoussan2] and references therein. As far as problems with random yield are concerned, the papers [Reference Tinani and Kandpal20, Reference Yano and Lee21] provide excellent reviews of such single-item continuous-review inventory models. In particular, the survey paper [Reference Yano and Lee21] offers an extensive account of how various yield distributions and cost structures arise in practical applications. The papers [Reference Federgruen and Zipkin6, Reference Zheng and Federgruen23] are more technical in nature. Furthermore, [Reference Federgruen and Zipkin6] explicitly addresses the optimality of (s, S) policies for a special continuous-review model with random supplies. The paper [Reference Zheng and Federgruen23] is most useful since it describes an efficient algorithm for computing optimal (s, S) policies and applies to both periodic-review and continuous-review inventory systems. The paper [Reference Bar-Lev, Parlar and Perry1] considers a continuous-review problem with (proportional) random yield. The authors use renewal theory to analyze their inventory model, which is also used in this paper. Among the many other papers devoted to inventory problems with random yield, we would like to point out the publications [Reference Inderfurth and Transchel11, Reference Inderfurth and Vogelsang12, Reference Song and Wang19], which analyze periodic-review problems and nicely describe the challenges due to the presence of a (uniformly distributed) random supply. The paper [Reference Sato, Yagi and Shimakazi16] analyzes an infinite-horizon discounted cost criterion for a distributor when the supplier has uncertain production. Furthermore, it considers both the supplier’s and the distributor’s problems, showing that coordinated decision-making results in reduced expected costs.

Irrespective of the many different models that have been considered in the literature, a common theme which lurks in the background of all the papers devoted to random yield is the quest to identify either an optimal or at least a nearly optimal order strategy. In some publications, the thrust is to propose and justify a heuristic policy, assuming that an optimal order policy has a particular (simple) structure. By contrast, in the present paper we formulate general conditions on the model under which an (s, S) policy is optimal for the long-term average criterion.

This paper extends to the case with random yield our examination in [Reference Helmes, Stockbridge and Zhu9] of inventory models of diffusion type with non-deficient supplies. Even though the same approach is used in these two papers, the analyses are more technical in the present paper because of the inclusion of random supplies satisfying Condition 2. For example, a minimum delivery guarantee condition is required for the existence of a valid mathematical model, a point that has been overlooked in the literature; see for example [Reference Korn14]. Also, an assured supply commitment condition is essential to our proof of the optimality of an (s, S) policy; see Theorem 2. Furthermore, Condition 4 of this paper removes a monotonicity requirement in Condition 2.3 of [Reference Helmes, Stockbridge and Zhu9], allowing the results to apply to a larger class of models.

This paper is organized as follows. In the next section we formulate the problem; in particular, we state conditions on the family of random yield measures that are key to the existence of a mathematical model for continuous-time inventory management as well as the optimization results. Furthermore, we introduce two important functions and adapt some results from [Reference Helmes, Stockbridge and Zhu9] to the model having random supply. The section culminates with the main existence result in Theorem 1. Sections 3 and 4 briefly discuss the expected occupation and ordering measures, adapted for models with random yield, and an auxiliary function $U_0$ , which are at the heart of the analysis. Section 5 then establishes the optimality of an (s, S) policy within the much larger class of admissible nominal ordering policies. The main optimality result is in Theorem 2; its proof is broken into several parts, which precede it. The paper concludes with a discussion of three examples in Section 6 which indicate the usefulness of Theorems 1 and 2 in obtaining an optimal ordering policy.

2. Formulation and existence result

This section briefly establishes the models under consideration, which generalize those studied in [Reference Helmes, Stockbridge and Zhu9]. While the general approach is very similar to the one taken in that paper, special care must be taken pertaining to the formulation of the random yield, the cost structure, the definition of the (nominal) occupation measure, the particular jump operators, and the proofs of several results. The differences between the two papers will be highlighted in the following sections. For a detailed discussion of the dynamics of the underlying uncontrolled diffusion and its boundary behavior, we refer the reader to [Reference Helmes, Stockbridge and Zhu9] and to Chapter 15 of [Reference Karlin and Taylor13]. The latter reference is particularly useful when checking properties of the scale function and the speed measure; both concepts are used in the definition of the functions in Section 2.2.

2.1. Formulation of the model

Let ${\mathcal I} = (a,b) \subseteq {\mathbb R}$ . In the absence of ordering, the inventory process $X_0$ satisfies (1.1) and is a regular diffusion. Throughout the paper we assume that the functions $\mu$ and $\sigma$ are continuous on ${\mathcal I}$ , and that (1.1) is nondegenerate. The initial position of $X_0$ is taken to be $x_0$ for some $x_0 \in \mathcal{I}$ . We place the following assumptions on the underlying diffusion model.

Condition 1.

(a) Both the speed measure M and the scale function S of the process $X_0$ are absolutely continuous with respect to Lebesgue measure.
(b) The left boundary a is attracting and the right boundary b is non-attracting. Moreover, when b is a natural boundary, $M[y,b) < \infty$ for each $y \in {\mathcal I}$ . The boundaries $a = -\infty$ and $b = \infty$ are required to be natural.

Associated with the scale function S of Condition 1, one can define the scale measure on the Borel sets of ${\mathcal I}$ by $S[y,v] = S(v) - S(y)$ for $[y,v] \subset {\mathcal I}$ . From the modeling point of view, Condition 1(b) is reasonable since it essentially says that, in the absence of ordering, demand tends to reduce the size of the inventory. The boundary point a may be regular, exit, or natural, with a being attainable in the first two cases and unattainable in the third. In the case that a is a regular boundary, its boundary behavior must also be specified as being either reflective or sticky. The boundary point b is either natural or entrance and is unattainable from the interior in both cases. Following the approach in [Reference Helmes, Stockbridge and Zhu8, Reference Helmes, Stockbridge and Zhu9], we define the state space of possible inventory levels to be the interval $ \mathcal{E}$ which excludes any natural boundary point; it includes a when a is attainable, and b when it is entrance. Since orders typically increase the inventory level, define $ \mathcal{R} = \{(y,z) \in \mathcal{E}^2\;:\;\; y < z\}$ , the set of states cross the set of feasible actions (in a particular state), in which y denotes the pre-order inventory level and the control value z is the nominal post-order level. The actual post-order inventory level will be determined by y, z, and the realization of the slack variable of the associated order size; explained differently, the post-order inventory level is given as the realization of a transition function $Q(\!\cdot;\;y,z)$ which depends on (y, z).

Since we are using weak convergence methods for measures on $ \mathcal{E}$ and $ \mathcal{R}$ , we will need the closures of these sets as well. Define $\overline{\mathcal{E}}$ to be the closure in $\mathbb R$ of $\mathcal{E}$ ; thus when a boundary is finite and natural, it is not an element of $\mathcal{E}$ but is in $\overline{\mathcal{E}}$ . Note that $\pm \infty \notin \overline{\mathcal{E}}$ . Also set $\overline{\mathcal{R}} = \{(y,z) \in \mathcal{E}^2\;:\;\; y \leq z\}$ ; in contrast to $ \mathcal{R}$ , the set $\overline{\mathcal{R}}$ includes orders of size 0. Notice the subtle distinction between $\overline{\mathcal{E}}$ , which includes boundaries that are finite and natural, and $\overline{\mathcal{R}}$ , which does not allow either coordinate to be such a point.

The random yields are determined by the family $ \mathcal{Q} = \{Q(\!\cdot;\;y,z) \;:\; (y,z) \in \overline{\mathcal{R}}\}$ of probability measures parametrized by $(y,z)\in \overline{\mathcal{R}}$ such that (i) $Q(\!\cdot;y,z)$ is a probability measure for each $(y,z)\in \overline{\mathcal{R}}$ , and (ii) for each $E\in {\mathcal B}( \mathcal{E})$ , $(y,z) \to Q(E;\;y,z)$ is measurable. We have that Q is a transition function on $ \mathcal{E}\times \overline{\mathcal{R}}$ . The probability measure $Q(\!\cdot;\; y,z)$ is the distribution for the resulting inventory level following an order of size $z-y$ . We further impose support, continuity, and supply requirements on this family.

Condition 2. The collection $ \mathcal{Q}$ satisfies the following:

(a)
1. (i) For each $y \in \mathcal{E}$ , $Q(\!\cdot;\;y,y) = \delta_{\{y\}}(\!\cdot\!)$ .
2. (ii) For each $(y,z) \in \mathcal{R}$ , supp $(Q(\!\cdot;\;y,z)) \subset (y,z]$ .
(b) For each $(y,z) \in \overline{\mathcal{R}}$ , for any sequence $\{(y_n,z_n)\in \overline{\mathcal{R}} \;:\; n\in \mathbb N\}$ with $y_n\rightarrow y$ and $z_n\rightarrow z$ as $n\rightarrow \infty$ , the measures $Q(\!\cdot;\;y_n,z_n)$ converge weakly to $Q(\!\cdot;\;y,z)$ as $n\rightarrow \infty$ . This weak convergence is denoted by $Q(\!\cdot;\;y_n,z_n) \Rightarrow Q(\!\cdot;\;y,z)$ .
(c) When b is a natural boundary, for each $[d_1,d_2]\subset {\mathcal I}$ , there exists a $\delta > 0$ such that for each $\widetilde{z}_1$ with $d_2 < \widetilde{z}_1 < b$ ,
(2.1) \begin{equation} \liminf_{z\to b} \inf_{y\in [d_1,d_2]} Q((\widetilde{z}_1,b);\;y,z) \geq \delta.\end{equation}

Condition 2(a,i) indicates that an active order of nominal size 0 will not change the inventory level. Condition 2(a,ii) implies the existence of a minimal delivery guarantee (MDG) that, with probability 1, assures the delivery of a fixed positive amount (up to the amount ordered) when a positive nominal amount is ordered. This condition is essential for showing that each admissible policy, including (s, S) policies, has a valid mathematical model for random supplies (cf. Definition 2.3 and the subsequent comments in [Reference Helmes, Stockbridge and Zhu10]). The fact that one needs to impose this kind of condition on inventory models with random supply to obtain a proper mathematical model of the controlled process has been overlooked in the literature. Condition 2(b) requires continuity of the mapping Q in the topology of weak convergence. Condition 2(c) is an assured supply commitment (ASC) that can be interpreted to be a ‘very important customer’ condition, in the sense that a customer who nominally orders to very high levels of inventory has a significant likelihood of receiving almost all of his order. This condition is used to establish the existence of an optimizer in Theorem 1 and to establish the optimality of a nominal (s, S) policy in Section 5.

We illustrate how Condition 2 may be satisfied when $b=\infty$ , a natural boundary. For fixed $0 < \Delta < 1$ , let $\widetilde{Q} \in \mathcal{P}[\Delta,1]$ be fixed. For $(y,z) \in \mathcal{R}$ , let $f_{(y,z)}\;:\; [0,1] \rightarrow \mathcal{E}$ be the linear mapping with $f_{(y,z)}(0)=y$ and $f_{(y,z)}(1)=z$ . Then the family $ \mathcal{Q}$ defined for $(y,z) \in \mathcal{R}$ by $Q(\!\cdot;\;y,z) = \widetilde{Q}f_{(y,z)}^{-1}(\!\cdot\!)$ always satisfies Condition 2. A special case of this family occurs when $\widetilde{Q}$ is the uniform distribution on $[\Delta,1]$ , resulting in a continuous-review inventory model with nearly stochastically proportional yields. A second special case having $\widetilde{Q}(\!\cdot\!)=\delta_{\{1\}}(\!\cdot\!)$ corresponds to the slack being 0 and therefore models non-deficient supply. Further examples will be examined in Section 6, for example when b is a finite natural boundary.

It will be important throughout the paper to average functions using transition functions. For a measurable function $\ell$ on $\overline{\mathcal{R}}$ and a transition function Q, we adopt the shorthand notation

(2.2)

\begin{equation} \widehat \ell(y,z)\;:\!=\; \int \ell(y,v) Q(dv;\;\; y,z), \qquad (y,z) \in \overline{\mathcal{R}},\end{equation}

with the understanding that the integral exists in $\overline{\mathbb R}$ .

Turning to the cost functions, we impose the following standing assumptions throughout the paper.

Condition 3.

(a) The holding/back-order cost function $c_0\;:\; {\mathcal I} \rightarrow {\mathbb R}^+$ is continuous. Moreover, at the boundaries,
\begin{align*}\lim_{x\rightarrow a} c_0(x) \;=\!:\; c_0(a) \mbox{ $exists\;in\; \overline{ {\mathbb R}^+}\; and$}\;\lim_{x\rightarrow b} c_0(x) \;=\!:\; c_0(b) \mbox{ $exists\;in\;\overline{ {\mathbb R}^+}$;}\end{align*}
we require $c_0(\!\pm\infty) = \infty$ . Finally, for each $y \in \mathcal{I}$ ,
(2.3) \begin{equation} \int_y^b c_0(v)\, \, dM(v) < \infty.\end{equation}
(b) The function $c_1\;:\;\overline{\mathcal{R}} \rightarrow \overline{ {\mathbb R}^+}$ is in $C(\overline{\mathcal R})$ with $c_1 \geq k_1 > 0$ for some constant $k_1$ .

The function $c_1$ is the building block for more complex cost structures of models with random supplies. For example, in the case when the decision-maker ‘pays for what he orders’, the ordering cost function is $c_1$ itself. When the cost structure is ‘you pay for what you get’, the function $\widehat{c}_1$ is used. For the remainder of the main sections, we analyze the inventory problem using $\widehat{c}_1$ , i.e. we pay for what we get; see also the following subsection.

We adapt to this inventory application the model constructed in [Reference Helmes, Stockbridge and Zhu10] for impulse-controlled processes having processes that are continuous between impulses. The model is built on an augmentation of the space $D_ \mathcal{E}[0,\infty)$ of càdlàg paths from $[0,\infty)$ to $ \mathcal{E}$ using the natural filtration $\{ \mathcal{F}_t\}$ in which X is the coordinate process and $ \mathcal{F}_t = \sigma(X(s)\;:\; 0 \leq s \leq t)$ .

We now define a nominal ordering policy. In order to do so, we need to specify the filtration of information used by the decision-maker to determine the jump-from locations and the nominal jump-to locations of a policy. Let $\{ \mathcal{F}_{t-}\}$ be given by $ \mathcal{F}_{t-} = \sigma(X(s)\;:\; 0 \leq s < t)$ for $t> 0$ , with $ \mathcal{F}_{0-}=\sigma(X(0-\!))$ being the $\sigma$ -algebra generated by the inventory level before any intervention at time 0. It is also important to specify the $\sigma$ -algebra of information available before a stopping time. Let $\eta$ be an $\{ \mathcal{F}_{t-}\}$ -stopping time. The $\sigma$ -algebra $ \mathcal{F}_{\eta-} \;:\!=\; \sigma(\{A\cap \{\eta > t\}\;:\; A\in \mathcal{F}_t, t\geq 0\})$ .

For the inventory management problem with random supply, the class $ \mathcal{A}$ of admissible nominal ordering policies $(\tau,Z) =\{(\tau_{k}, Z_{k}), k\in \mathbb N \}$ is defined as follows:

(i) $\{\tau_k\;:\;k\in \mathbb N\}$ is a strictly increasing sequence of $\{ \mathcal{F}_{t-}\}$ -stopping times with $\tau_k \to \infty$ ;
(ii) for each $k \in \mathbb N$ , $Z_k \in \mathcal{E}$ is $ \mathcal{F}_{\tau_{k}-}$ -measurable with $Z_k > X(\tau_k-\!)$ ; and
(iii) the cost (1.3) is finite and is denoted by $J(\tau,Z)$ ; note the inclusion of the policy in the notation.

The requirement that the sequence $\{\tau_k\}$ be strictly increasing implies that at most one order can be placed at any time, while the use of $\{ \mathcal{F}_{t-}\}$ prevents the ordering decision-makers from knowing the supplied amount when an order is placed. The random variable $Z_k$ in (ii) is the nominal order-to location, so it has the value $X(\tau_k-\!)+O_k$ when $O_k$ denotes the nominal order size. The construction in [Reference Helmes, Stockbridge and Zhu10] uses the measure $Q(\!\cdot;\;X(\tau_k-\!),Z_k)$ to select the actual random supply inventory level $X(\tau_k)$ at time $\tau_k$ . Hence the corresponding random slack is $\Theta_k = Z_k - X(\tau_k)$ .

Thus, given the transition functions $ \mathcal{Q}$ and an admissible policy in the class $ \mathcal{A}$ , the associated inventory process X will be a jump-diffusion process characterized by the generator of the process $X_0$ , the jump operator determined by the decision of ordering up to a nominal level z and the transition function Q.

Looking at the infinitesimal behavior, the generator of the process X between jumps (corresponding to the diffusion $X_0$ ) is $Af = \frac{\sigma^{2}}{2} f'' + \mu f'$ , which is defined for all $f \in C^2({\mathcal I})$ ; equivalently, $Af = \frac{1}{2} \frac{d\;}{dM} \left(\frac{df}{dS}\right)$ . The effects that ordering and random yields have on the inventory process and its expected cost will be defined by the jump operator $B\;:\; C( \mathcal{E}) \rightarrow C(\overline{\mathcal{R}})$ , $Bf(y,z) \;:\!=\; f(z) - f(y)$ for $(y,z) \in \overline{\mathcal{R}}$ for an order with non-deficient supply having transition function $Q(\!\cdot;\;y,z)=\delta_{z}(\!\cdot\!)$ , and for the case of random yield by the $\widehat{\;}$ operation $\widehat{Bf}(y,z) \;:\!=\; \, \int Bf(y,v)\, Q(dv;\;y,z)$ when the order-from location is y and the action z selects a transition function $Q(\!\cdot;\;y,z)$ .

2.2. Important functions

As in [Reference Helmes, Stockbridge and Zhu9], the following two functions play a central role in our search for an optimal ordering policy. Recall that M denotes the speed measure and S represents the scale measure. Using the initial position $x_0\in \mathcal{I}$ , define the functions $g_0$ and $\zeta$ on ${\mathcal I}$ by

(2.4)

\begin{align} g_{0}(x) \;:\!=\; \int_{x_0}^{x} \int_{u}^{b} 2 c_{0}(v)\, dM(v)\, dS(u) \quad \textrm{and} \quad\zeta(x) \;:\!=\; \int_{x_0}^{x} \int_{u}^{b} 2\, dM(v)\, dS(u) ,\end{align}

and extend these functions to $\overline{\mathcal{E}}$ by continuity. Observe that both $g_0$ and $\zeta$ are negative on $(a,x_0)$ and positive on $(x_0,b)$ ; also, $g_0$ may take values $\pm \infty$ at the boundaries, while $\zeta$ is $\pm \infty$ for natural boundaries. Using the second characterization of A, it immediately follows that $g_0$ and $\zeta$ , respectively, are particular solutions on $\mathcal{I}$ of

(2.5)

\begin{equation} \left\{\begin{array}{l}Af = - c_0, \\[5pt] f(x_0) = 0,\end{array} \right. \quad \mbox{ and } \quad \left\{\begin{array}{l}Af = -1, \\[5pt] f(x_0) = 0.\end{array}\right.\end{equation}

Other solutions to these differential equations having value 0 at $x_0$ include summands of the form $K(S(x) - S(x_0))$ , $K \in \mathbb R$ , since the constant function and the scale function S are linearly independent solutions of the homogeneous equation $Af = 0$ . However, such additional terms grow too quickly near the boundary b, so that the transversality condition (4.3) in Proposition 6 below fails (see Remark 4.2 of [Reference Helmes, Stockbridge and Zhu9]), and therefore the definitions of $g_0$ and $\zeta$ in (2.4) exclude these terms.

To gain some intuition for the functions $g_0$ and $\zeta$ , let $y, v \in \mathcal{E}, y < v$ , and let $X_0$ satisfy (1.1) with $X_0(0) = v$ . Define $\tau_{v,y} \;:\!=\; \inf\{t \geq 0\;:\; X_0(t) = y\}$ . Then Proposition 2.6 in [Reference Helmes, Stockbridge and Zhu8] shows that

\begin{align*}\mathbb E_v\left[\int_0^{\tau_{v,y}} c_0(X_0(s))\, ds\right] = {B}g_0(y,v)\qquad \mbox{and}\qquad \mathbb E_v[\tau_{v,y}] = {B}\zeta(y,v),\end{align*}

and a simple extension establishes that if $X_0(0) \sim Q(\!\cdot;\;y,z)$ , for $(y,z) \in \mathcal{R}$ , then

\begin{align*}\int \mathbb E_v\left[\int_0^{\tau_{v,y}} c_0(X_0(s))\, ds\right]Q(dv;\;y,z) = \widehat{Bg_0}(y,z)\end{align*}

and

\begin{align*} \int \mathbb E_v[\tau_{v,y}]Q(dv;\;y,z)= \widehat{B\zeta}(y,z).\end{align*}

The proof of our basic existence result, Theorem 1, relies on the asymptotic behavior of the functions $c_0$ , $g_0$ , and $\zeta$ when the boundaries are natural. The following lemma, whose proof can be found in Lemma 2.1 of [Reference Helmes, Stockbridge and Zhu9], summarizes such asymptotic behavior.

Lemma 1. Assume Condition 1. Suppose a and b are natural boundaries, and let $c_0(a)$ and $c_0(b)$ be as in Condition 3(a). Then the following asymptotic behaviors hold:

(2.6)

\begin{align} & \lim_{y\rightarrow a} \frac{Bg_0(y,v)}{B\zeta(y,v)} = c_0(a), \quad \forall v\in \mathcal{I}; \ & & \lim_{v\rightarrow b} \frac{Bg_0(y,v)}{B\zeta(y,v)} = c_0(b), \quad \forall y\in \mathcal{I}; &\end{align}

(2.7)

\begin{align}& \lim_{(y,v)\rightarrow (a,a)} \frac{Bg_0(y,v)}{B\zeta(y,v)} = c_0(a); \quad & & \lim_{(y,v) \rightarrow (b,b)} \frac{Bg_0(y,v)}{B\zeta(y,v)} = c_0(b); &\end{align}

(2.8)

\begin{align}& \lim_{y\rightarrow a} \frac{g_0(y)}{\zeta(y)} = c_0(a); \quad & & \lim_{v\rightarrow b} \frac{g_0(v)}{\zeta(v)} = c_0(b).&\end{align}

These behaviors imply that $\lim_{y\rightarrow a} g_0(y) = -\infty$ when $c_0(a) > 0$ and $\lim_{v\rightarrow b} g_0(v) = \infty$ when $c_0(b) > 0$ .

Another function of importance to the solution of the problem is $\widehat{c}_1$ , which we remind the reader is defined to be $\widehat{c}_1(y,z)=\int c_1(y,v) Q(dv;\;y,z)$ , where $(y,z) \in \overline{\mathcal{R}}$ . The first proposition indicates a difference between the properties of the ordering cost structure of the random supply model and the model with non-deficient deliveries.

Proposition 1. Assume Conditions 1–3. Then $\widehat{c}_1$ is lower semicontinuous.

Proof. We need to show that for every $(y,z) \in {\overline{\mathcal{R}}}$ and every sequence $\{(y_n,z_n)\;:\; n\in\mathbb N\}$ in ${\overline{\mathcal{R}}}$ which converges to (y, z),

(2.9)

\begin{equation} \widehat{c}_1(y,z) \le \liminf_{n\rightarrow \infty} \widehat{c}_1(y_n,z_n).\end{equation}

We may assume that the function $c_1$ is bounded; the monotone convergence theorem implies the inequality (2.9) for unbounded cost functions once it has been established for a truncated form of $c_1$ . To verify (2.9) we shall rely on the elementary but most useful Lemma 2.1 in [Reference Serfozo17]. In the sequel, we verify the hypothesis of this lemma. To this end, for the given pair (y, z) and the points $y_n, n \in \mathbb N$ , we define nonnegative continuous functions f and $f_n$ on $\mathcal{E}$ as follows. For $v \in \mathcal{E}$ , let

(2.10)

\begin{equation}f(v) \;:\!=\; \left\{\begin{array}{cl}\displaystyle{c_1}(y,v), & \quad v \ge y, \rule[-15pt]{0pt}{15pt}\\[5pt] {c_1}(y,y), & \quad v \le y,\end{array} \right. \textrm{and} \quad f_n(v) \;:\!=\; \left\{\begin{array}{cl}\displaystyle{c_1}(y_n,v), & \quad v \ge y_n, \rule[-15pt]{0pt}{15pt}\\[5pt] {c_1}(y_n,y_n), & \quad v \le y_n.\end{array} \right.\end{equation}

For the remainder of this proof, we simplify notation by setting

(2.11)

\begin{equation}Q(\!\cdot\!) \;:\!=\; Q(\!\cdot;\;y,z) \qquad \textrm{and}\quad Q_n(\!\cdot\!) \;:\!=\; Q(\!\cdot;\;y_n,z_n).\end{equation}

Since f is continuous, for every $t \in {\mathbb R}$ and $\epsilon > 0$ the set $\{v \in {\mathcal{E}}\;:\; f(v) > t+{\epsilon}\}$ is an open set. Moreover, $c_1$ is uniformly continuous on any compact subset in $\overline{\mathcal{R}}$ . Hence, for sufficiently large n, $v\in\{f>t+\epsilon\}$ implies $v\in\{f_n > t\}$ . By Condition 2(b), the measures $Q_n$ converge weakly to Q on $\mathcal{E}$ , and thus by the portmanteau theorem (cf. [Reference Ethier and Kurtz5, Theorem 3.3.1, p. 108]) for the first inequality below and the inclusion $\{f>t+\epsilon\}\subset \{f_n>t\}$ for n sufficiently large for the second inequality,

(2.12)

\begin{equation}Q(\{ f > t+\epsilon \}) \leq \liminf_{n\rightarrow \infty} Q_n(\{ f> t+\epsilon \}) \leq \liminf_{n\rightarrow \infty} Q_n(\{f_n> t \}).\end{equation}

Since $\epsilon$ is arbitrary, the hypothesis of Lemma 2.1 in [Reference Serfozo17] is satisfied and it therefore follows that

\begin{align*}\int{f(v)Q(dv)} \le \liminf_{n\rightarrow \infty}\int{f_n(v)Q_n(dv)}.\end{align*}

By Condition 2(a) and the notation (2.11), $Q(\!\cdot\!)$ has its support in (y, z], and similarly for $Q_n(\!\cdot\!)$ . Therefore

\begin{align*}\widehat{c}_1(y,z) = \int{f(v)Q(dv)} \qquad \mbox{and} \qquad \widehat{c}_1(y_n,z_n) = \int{f_n(v)Q_n(dv)},\end{align*}

implying that (2.9) holds true.

2.3. Analysis of nominal (s, S) ordering policies

Both this paper and [Reference Helmes, Stockbridge and Zhu9] rely on characterizing the long-term average cost for (s, S) ordering policies in the cases of deficient supplies or of full supplies using a renewal reward theorem. For $(y,z) \in \mathcal{R}$ , define the nominal (y, z) ordering policy $(\tau,Z)$ so that $\tau_0=0$ and

(2.13)

\begin{equation} \tau_k = \inf\{t > \tau_{k-1}\;:\; X(t-\!) \leq y\} \quad \mbox{ and } \quad Z_k = z,\quad k \geq 1,\end{equation}

in which X is the inventory level process satisfying (1.2) with this ordering policy. The above definition of $\tau_k$ must be slightly modified when $k=1$ , to $\tau_1 = \inf\{t \geq 0\;:\; X(t-\!) \leq y\}$ , to allow for the first jump to occur at time 0 when $x_0 \leq y$ . Observe that X is a delayed renewal process, since the single distribution $Q(\!\cdot;,y,z)$ is used to determine the random supply for all orders $k \geq 2$ ; it is a renewal process when $y \leq x_0$ . We note that the definition of $\tau_k$ in (2.13) needs to be more precisely stated as in Section 6 of [Reference Helmes, Stockbridge and Zhu10] because of the particular construction of the mathematical model. However, the definition in (2.13) provides the correct intuition, so we rely on this simpler statement of the intervention times.

Theorem 2.1 of [Reference Sigman and Wolff18] provides the existence and uniqueness of the stationary distribution for the process X arising from a nominal (y, z) ordering policy for any $(y,z) \in \mathcal{R}$ , and moreover, the one-dimensional distributions $\mathbb P(X(t) \in \cdot)$ converge weakly to the stationary distribution as t tends to infinity. A straightforward generalization of Proposition 3.1 of [Reference Helmes, Stockbridge and Zhu8] characterizes the density $\pi$ of the stationary distribution for X and the long-run frequency $\widehat\kappa = \frac{1}{\widehat{B\zeta}(y,z)\rule{0pt}{10pt}}$ of orders.

By renewal theory, the long-term average running cost for the nominal (y, z) ordering policy (cf. (2.13)) equals

(2.14)

\begin{equation}\lim_{t\rightarrow \infty} \frac{1}{t} \int_{0}^{t} c_{0}(X(s))ds = \frac{\widehat{Bg_0}(y,z)}{\widehat{B\zeta}(y,z)} \;\;(\mbox{almost surely and in } L^1),\end{equation}

and therefore the long-term average cost $J(\tau,Z)$ of (1.3) is given by

(2.15)

\begin{equation}J(\tau,Z) = \frac{\widehat{c}_{1}(y,z) + \widehat{Bg_{0}}(y,z)}{\widehat{B\zeta}(y,z)}.\end{equation}

Motivated by (2.15), define the function ${H}_0\;:\; \overline{\mathcal{R}} \rightarrow \overline{\mathbb R^+}$ by

(2.16)

\begin{equation} {H}_0(y,z) \;:\!=\; \left\{\begin{array}{cl}\displaystyle\frac{\widehat{c}_1(y,z) + \widehat{Bg_0} (y,z)} {\widehat{B\zeta}(y,z)}, & \quad (y,z) \in {\mathcal R}, \rule[-15pt]{0pt}{15pt}\\[5pt] \infty, & \quad (y,y) \in \overline{\mathcal{R}}.\end{array} \right.\end{equation}

We note that $H_0$ is an adaptation of the function $F_0$ in [Reference Helmes, Stockbridge and Zhu9] to the case of random yields. Recall that $Q(\!\cdot;\;y,z)$ has its support in (y, z] and the collection is weakly convergent. Since $g_0$ and $\zeta$ are continuous, it follows that $\widehat{Bg_0}$ and $\widehat{B\zeta}$ are also continuous, as well as being nonnegative. Therefore ${H}_0$ is lower semicontinuous on $\overline{\mathcal{R}}$ by Proposition 1.

Similarly to the case of non-deficient deliveries, our goal is to minimize ${H}_0$ . Since $c_1 > 0$ , and hence $\widehat{c}_1$ is positive, ${H}_0(y,z) > 0$ for every $(y,z) \in \overline{\mathcal{R}}$ . Thus, $\inf_{(y,z)\in \overline{\mathcal{R}}} {H}_0(y,z) \;=\!:\; {H}_0^* \geq 0$ . The models with a natural boundary allow ${H}_0^* = 0$ as a limit as the appropriate coordinate approaches the boundary point, in which case it immediately follows that there is no minimizing pair $({y_0^*},{z_0^*})$ of ${H}_0$ . The imposition of Condition 4 below eliminates the possibility that $H_0^*=0$ .

It is helpful to define a family $\{\mathfrak{P}(\!\cdot;\;y,z)\;:\; (y,z)\in \mathcal{R}\}$ of probability measures on $ \mathcal{E}$ as follows:

\begin{align*}\mathfrak{P}(\Gamma;\;y,z) = \int_\Gamma B\zeta(y,v) \mbox{$\frac{1}{\widehat{B\zeta}(y,z)\rule{0pt}{10pt}}$}\, Q(dv;\;y,z), \qquad \Gamma \in {\mathcal B}( \mathcal{E}).\end{align*}

Note that the value $\mathfrak{P}(\Gamma;\;y,z)$ gives the proportion of the expected cycle length $\widehat{B\zeta}(y,z)$ due to the random effect distribution $Q(\!\cdot;\;y,z)$ delivering to inventory levels $v\in \Gamma$ following the order. Also observe that $\mathfrak{P}(\!\cdot;\;y,z)$ inherits its support from $Q(\!\cdot;\;y,z)$ .

The next result shows that the infimum $F_0^*$ of the function $F_0$ in [Reference Helmes, Stockbridge and Zhu9] (see (2.17) below) is a lower bound for the value $ H_{0}^{*}$ . The function $F_0$ gives the long-term average cost of a (y, z) policy for non-deficient supply models.

Proposition 2. Assume Conditions 1–3. Define the function

(2.17)

\begin{equation} F_0(y,z) \;:\!=\; \left\{\begin{array}{cl}\displaystyle\frac{c_1(y,z) + Bg_0 (y,z)} {B\zeta(y,z)}, & \quad (y,z) \in {\mathcal R}, \rule[-15pt]{0pt}{15pt}\\[5pt] \infty, & \quad { (y,z) \in \overline{\mathcal{R}} \;\textit{with}\; y=z,}\end{array} \right.\end{equation}

and let $F_0^* = \inf_{(y,z)\in \overline{\mathcal{R}}} F_0(y,z)$ . Then $ H_{0}^{*}\ge F_{0}^{*}$ .

Proof. Observe that the function ${H}_0$ defined by (2.16) can also be written as

(2.18)

\begin{equation} {H}_0(y,z) \;:\!=\; \left\{\begin{array}{cl}\displaystyle\int \frac{c_1(y,v) + Bg_0 (y,v)} {\widehat{B\zeta}(y,z)}\, Q(dv;\;y,z), & \quad (y,z) \in {\mathcal R}, \rule[-15pt]{0pt}{15pt}\\[5pt] \infty, & \quad (y,z) \in \overline{\mathcal{R}} \textrm{ with } y=z. \rule{0pt}{14pt}\end{array} \right.\end{equation}

Using the factor $\frac{B\zeta(y,v)}{B\zeta(y,v)}=1$ , the expression for $H_0$ when $y < z$ yields

\begin{align*}H_0(y,z) = \int \frac{c_1(y,v) + Bg_0(y,v)}{B\zeta(y,v)}\, \mathfrak{P}(dv;\;y,z) = \int F_0(y,v)\, \mathfrak{P}(dv;\;y,z) \geq F_0^*.\end{align*}

Taking the infimum over $(y,z)\in \mathcal{R}$ therefore establishes the result.

Similarly as in [Reference Helmes, Stockbridge and Zhu9], our main optimality result depends on the existence of a minimizing pair $({y_0^*},{z_0^*})\in \mathcal{R}$ of ${H}_0$ . An important subtlety is that properties of the function ${H}_0$ on compact subsets of $ \mathcal{R}$ and close to the boundary of $ \mathcal{R}$ are not simply determined by the properties of the functions $c_1$ , $g_0$ , and $\zeta$ in these regions as they were for non-deficient supply models. In fact, the behavior of the function ${H}_0$ near the boundary depends crucially on properties of the measure-valued transition functions $Q(\!\cdot;\;y,z)$ as functions on $ \mathcal{R}$ , and in particular on the behavior of the function $\widehat{B\zeta}$ near the boundary. As a consequence, a proof of a general optimality result for an (s, S) policy for inventory models with random supply requires additional conditions. Before presenting these conditions, however, we identify an important relationship between Condition 2(c) and the family of measures $\{\mathfrak{P}(\!\cdot;\;y,z)\}$ .

Lemma 2. Let b be a natural boundary for which Condition 2(c) holds. Then for each interval $[d_1,d_2] \subset {\mathcal I}$ and for every $\check{z}$ with $d_2 < \check{z} < b$ ,

(2.19)

\begin{equation} \lim_{z\to b}\, \inf_{y\in [d_1,d_2]} \mathfrak{P}((\check{z},b);\;y,z) = 1.\end{equation}

Proof. Let $[d_1,d_2] $ and $\check{z}$ be given as in the statement of the lemma. Define

\begin{align*} M \;:\!=\; \sup\{B\zeta(y,v)\;:\;\; y \in [d_1,d_2], v\in [y, \check z] \} < \infty.\end{align*}

Furthermore, let $\delta > 0$ be as in Condition 2(c). For any $\varepsilon > 0$ , choose an $N \in \mathbb N$ so that $N > \frac{2M}{\delta\,\varepsilon}$ .

Since b is a natural boundary, $\lim_{v\to b}[\zeta(v) - \zeta(y)] = \infty$ uniformly for $y\in [d_1,d_2]$ . Consequently, for the $N\in \mathbb N$ chosen above, there exists a $z_{N} < b$ (without loss of generality, we can assume that $z_{N} > \check z$ ) such that

\begin{align}\zeta(v) - \zeta(y) \ge N, \quad \textrm{ for all } v \ge z_{N} \textrm{ and }y \in [d_1,d_2].\end{align}

Now, for the chosen $z_{N}$ , Condition 2(c) says that we can find a $z_{\varepsilon} \in (z_{N}, b)$ so that

\begin{align}Q([z_{N}, z];\; y,z) \ge \frac\delta{2}, \quad \textrm{ for all }z > z_{\varepsilon} \textrm{ and } y \in [d_1,d_2].\end{align}

Then for all $y \in [d_1,d_2]$ and $z > z_{\varepsilon}$ , we have

\begin{align*} \widehat{B\zeta}(y,z) & = \int_{y}^{z_{N}} B\zeta(y,v) Q(dv;\; y,z) + \int_{z_{N}}^{z} B\zeta(y,v) Q(dv;\; y,z) \\[5pt] & \ge 0 + N Q([ z_{N}, z];\; y,z) \ge \mbox{$\frac{N \delta}{2}$}.\end{align*}

Consequently, it follows that for any $y \in [d_1,d_2]$ and $z > z_{\varepsilon}$ , we have

\begin{align*} \mathfrak{P}((\check{z},b);\;y,z) & =\int_{\check z}^{z} B\zeta(y,v) \frac{1}{\widehat{B\zeta} (y,z)} Q(dv;\;\; y,z)\\[5pt] & = \frac{\int_{y}^{z} B\zeta(y,v) Q(dv;\; y,z)- \int_{y}^{\check z} B\zeta(y,v) Q(dv;\; y,z)}{\widehat{B\zeta} (y,z)}\\[5pt] & \ge 1 - \frac{M}{\widehat{B\zeta} (y,z) } \ge 1 - \mbox{$\frac{2M}{N\delta}$} > 1-\varepsilon.\end{align*}

This establishes (2.19) and hence completes the proof.

Remark 1. Condition 2(c) is stronger than the conclusion of this lemma. To see this, assume b is a natural boundary, let Condition 1 hold, and let $\zeta$ be given by (2.4). We identify a family $\mathcal{Q}$ for which (2.19) holds but Condition 2(c) fails. We focus on the subset of $ \mathcal{R}$ for which $B\zeta > 1$ . For each such (y, z), let $\breve{y}$ satisfy $\breve{y} > y$ with $B\zeta(y,\breve{y})=\frac{1}{2}$ ; also set $m_1 \;:\!=\; \frac{1}{\sqrt{B\zeta(y,z)}}$ and $m_0\;:\!=\; 1-m_1$ . Now consider the random supply measures for (y, z) with $B\zeta(y,z) > 1$ given by

\begin{align*}Q(\!\cdot;\;y,z) = m_0 \delta_{\breve{y}}(\!\cdot\!) + m_1 \delta_{z}(\!\cdot\!).\end{align*}

Notice that

\begin{align*}\widehat{B\zeta}(y,z) = B\zeta(y,\breve{y}) m_0 + B\zeta(y,z) m_1 = \frac{m_0}{2} + \sqrt{B\zeta(y,z)},\end{align*}

so for fixed y, $\widehat{B\zeta}(y,z) \to \infty$ as $z\to b$ . This convergence then implies (2.19) holds for the fixed y, and a simple argument extends this to a uniform convergence for $y\in [d_1,d_2]$ .

Now for $y\in [d_1,d_2]$ and (y, z) with $B\zeta(y,z) > 1$ , for any $\check{z} > d_2$ , $Q((\check{z},b);\;y,z) = $ $\frac{1}{\sqrt{B\zeta(y,z)}} \to 0$ as $z\to b$ . Hence Condition 2(c) fails.

Now, combined with Conditions 1, 2, and 3, the following set of conditions will be sufficient to guarantee the existence of a minimizer of the function $H_0$ on $\mathcal{R}$ .

Condition 4. The following conditions hold:

(a) Either the boundary a is regular or exit, or a is a natural boundary for which there exists some $ (y_1,z_1) \in \mathcal{R}$ such that $H_0(y_1,z_1) < c_0(a)$ .
(b) Either the boundary b is entrance, or b is natural and there exists some $(y_2,z_2)\in \mathcal{R}$ such that $H_0(y_2,z_2) < c_0(b)$ .

Remark 2. In comparing the random supply model of this paper with the non-deficient supply model of [Reference Helmes, Stockbridge and Zhu9], we observe that Condition 1 is the same in each paper and Condition 3 of this paper is Condition 2.2 of [Reference Helmes, Stockbridge and Zhu9]. The present Condition 2 exists only in this paper. Furthermore, Condition 4 corresponds to Condition 2.3 in [Reference Helmes, Stockbridge and Zhu9]. It uses $H_0$ in place of $F_0$ to account for random supplies and also removes a monotonicity requirement for $F_0$ near natural boundaries.

We now state our main existence result, which when combined with Theorem 2 establishes the optimality of a nominal (s, S) ordering policy within the large class of admissible policies. See Section 6 for examples which illustrate these results.

Theorem 1. Assume Conditions 1–4 are satisfied. Then there exists a pair $({y_0^*},{z_0^*}) \in \mathcal R$ such that

(2.20)

\begin{equation} {H}_0({y_0^*},{z_0^*}) = {H}_0^* = \inf\{{H}_0(y,z)\;:\; (y,z) \in \overline{\mathcal R}\}.\end{equation}

Proof. The proof consists of several parts, corresponding to pieces of the boundary of ${\mathcal R}$ , the type of boundary point, and the values of $c_0$ at a and b. Since much of the analysis is similar in every part, we shall only spell out the details of the case in which a and b are natural boundaries. When a is attainable or b is an entrance boundary, the boundary is included in $ \mathcal{E}$ , so the minimum of $H_0$ may be achieved using a boundary point. The proofs in these cases follow a similar line of argument.

Our method of proof is to show that $H_0$ is strictly greater than its infimum in a neighborhood of the boundary. To begin, recall that

(2.21)

\begin{equation} H_0(y,z) = \int_ \mathcal{E} F_0(y,v)\, \mathfrak{P}(dv;\;y,z).\end{equation}

The challenge is that $\mathfrak{P}(\!\cdot;\;y,z)$ may place mass throughout most of the interval (y, z], so we need to be careful in developing the lower bounds of the integrand near different segments of the boundary; Figure 1 aids in visualizing this analysis. With reference to Figure 1, the bound

(2.22)

\begin{equation} F_0(y,v) = \frac{c_1(y,v)+Bg_0(y,v)}{B\zeta(y,v)} > \frac{Bg_0(y,v)}{B\zeta(y,v)}\end{equation}

will be used in the regions $E_1$ , $E_2$ , $E_3$ , $E_4$ , and $E_5$ , while

(2.23)

\begin{equation} F_0(y,v) = \frac{c_1(y,v)+Bg_0(y,v)}{B\zeta(y,v)} \geq \frac{c_1(y,v)}{B\zeta(y,v)} \geq \frac{k_1}{B\zeta(y,v)}\end{equation}

will be used for the region $E_6$ .

Figure 1. Neighborhoods of the boundary.

The two parts of Condition 4 can be combined to yield a single pair $(y_1,z_1)\in \mathcal{R}$ for which $c_0(a) \wedge c_0(b)> H_0(y_1,z_1)$ . Select $\varepsilon \in (0,1)$ so that

(2.24)

\begin{equation} c_0(a)\wedge c_0(b) > \frac{1+\varepsilon}{1-\varepsilon} H_0(y_1,z_1) + \varepsilon \quad \mbox{and}\quad \varepsilon < \frac{k_1}{H_0(y_1,z_1)}.\end{equation}

• By (2.7) of Lemma 1, there exists some $z_\varepsilon$ such that
\begin{align*}\frac{Bg_0(y,v)}{B\zeta(y,v)} > H_0(y_1,z_1),\quad \forall z_\varepsilon \leq y < v < b.\end{align*}
Define the neighborhood of (b, b) to be $E_1 = \{(y,z)\in \mathcal{R}\;:\; z_\varepsilon \leq y < z < b\}$ .
• Again by (2.7) of Lemma 1, there exists some $y_\varepsilon$ such that
\begin{align*}\frac{Bg_0(y,v)}{B\zeta(y,v)} > H_0(y_1,z_1), \quad \forall a < y < v \leq y_\varepsilon.\end{align*}
Define the neighborhood of (a, a) to be $E_2 = \{(y,z)\in \mathcal{R}\;:\; a < y < z \leq y_\varepsilon\}$ .
• Recall that $x_0$ is the initial position. Using $x_0$ as the fixed value in the two asymptotic results in (2.6) of Lemma 1, we find that there exist $\overline{y}$ and $\overline{z}$ such that for $y \leq \overline{y}$ and $v \geq \overline{z}$ , respectively,
(2.25) \begin{equation} \frac{Bg_0(y,x_0)}{B\zeta(y,x_0)} > H_0(y_1,z_1) \quad \textrm{and}\quad \frac{Bg_0(x_0,v)}{B\zeta(x_0,v)} > H_0(y_1,z_1).\end{equation}
For notational simplicity, we may assume $\overline{y}=y_\varepsilon$ and $\overline{z}=z_\varepsilon$ by using $y_\varepsilon\wedge \overline{y}$ and $z_\varepsilon \vee \overline{z}$ in the two previous parts as well as here. Now define
\begin{align*}M\;:\!=\; \max_{y_\varepsilon \leq v \leq z_\varepsilon} (|g_0(v)| \vee |\zeta(v)|)\end{align*}
and note that $M < \infty$ since $g_0$ and $\zeta$ are continuous. Using the fact that $\lim_{y\to a} \zeta(y) = -\infty$ along with (2.8) of Lemma 1, we have that there exists a $\widetilde{y} \leq y_\varepsilon$ such that for $y \leq \widetilde{y}$ ,
\begin{align*}\frac{M}{\zeta(y)} \leq \varepsilon \quad \textrm{and}\quad \frac{g_0(y)}{\zeta(y)} > \frac{1+\varepsilon}{1-\varepsilon}H_0(y_1,z_1) + \varepsilon.\end{align*}
Define a neighborhood of the left boundary segment between $(a,y_\varepsilon)$ and $(a,z_\varepsilon)$ to be $E_3 = \{(y,z)\in \mathcal{R}\;:\;\; y \leq \widetilde{y} \textrm{ and } y_\varepsilon \leq z \leq z_\varepsilon\}$ . Observe that for all $(y,z) \in E_3$ ,
\begin{align*}\frac{Bg_0(y,v)}{B\zeta(y,v)} & = \frac{g_0(y) - g_0(v)}{\zeta(y)-\zeta(v)} \geq \frac{g_0(y) - M}{\zeta(y) + M} = \frac{\frac{g_0(y)}{\zeta(y)} - \frac{M}{\zeta(y)}}{1 + \frac{M}{\zeta(y)}} \\[5pt] & > \frac{\frac{1+\varepsilon}{1-\varepsilon}H_0(y_1,z_1) + \varepsilon - \varepsilon}{1+\varepsilon} = \frac{H_0(y_1,z_1)}{1-\varepsilon} > H_0(y_1,z_1).\end{align*}
• Again, let $z_\varepsilon$ be as in the definition of $E_1$ , let $y_\varepsilon$ be from $E_2$ , and let $\widetilde{y}$ be as in $E_3$ . A key observation is that the inequalities (2.25) establish that for $a < y \leq y_\varepsilon$ and $z_\varepsilon \leq v < b$ ,
\begin{align*}Bg_0(y,v) = Bg_0(y,x_0) + Bg_0(x_0,v) & > H_0(y_1,z_1) (B\zeta(y,x_0) + B\zeta(x_0,v)) \\[5pt] & = H_0(y_1,z_1) B\zeta(y,v),\end{align*}
and therefore
(2.26) \begin{equation} \frac{Bg_0(y,v)}{B\zeta(y,v)} > H_0(y_1,z_1), \quad \forall a < y \leq y_\varepsilon \textrm{ and } z_\varepsilon \leq v < b.\end{equation}
Since $\widetilde{y} \leq y_\varepsilon$ , this inequality holds in the neighborhood of (a, b) defined by $E_4 \;:\!=\; \{(y,z)\in \mathcal{R}\;:\; a < y \leq \widetilde{y} \textrm{ and } z_\varepsilon \leq z < b\}$ .
• Yet again, let $z_\varepsilon$ be as in the definition of $E_1$ and $\widetilde{y}$ be from $E_3$ . Now set $M_1 = \max_{\widetilde{y}\leq v\leq z_\varepsilon} (g_0(v)\vee \zeta(v))$ , noting that $M_1 \geq M$ since $[\widetilde{y},z_\varepsilon] \supset [y_\varepsilon,z_\varepsilon]$ . Since b is a natural boundary, $\lim_{v\to b} \zeta(v) = \infty$ and the asymptotic relation in (2.8) of Lemma 1 holds. Thus there exists some $\check{z} \geq z_\varepsilon$ such that for $v \geq \check{z}$ ,
\begin{align*}\frac{M_1}{\zeta(v)} \leq \varepsilon \quad \textrm{and}\quad \frac{g_0(v)}{\zeta(v)} > \frac{1+\varepsilon}{1-\varepsilon}H_0(y_1,z_1) + \varepsilon, \end{align*}
and hence
(2.27) \begin{align} \nonumber\frac{Bg_0(y,v)}{B\zeta(y,v)} = \frac{g_0(z) - g_0(y)}{\zeta(z)-\zeta(y)} & \geq \frac{g_0(v) - M_1}{\zeta(v) + M_1} = \frac{\frac{g_0(z)}{\zeta(z)} - \frac{M_1}{\zeta(z)}}{1 + \frac{M_1}{\zeta(z)}} \\[5pt] & > \frac{\frac{1+\varepsilon}{1-\varepsilon}H_0(y_1,z_1) + \varepsilon-\varepsilon}{1+\varepsilon} = \frac{H_0(y_1,z_1)}{1-\varepsilon}.\end{align}
Using this $\check{z}$ in (2.19) of Lemma 2, we have that there is some $\widetilde{z} > \check{z}$ such that for $z > \widetilde{z}$ ,
(2.28) \begin{equation} \inf_{y\in [y_\varepsilon,z_\varepsilon]} \mathfrak{P}((\check{z},b);\;y,z) > 1-\varepsilon.\end{equation}
Define a neighborhood of the top boundary segment between $(y_\varepsilon,b)$ and $(z_\varepsilon,b)$ by $E_5 = \{(y,z)\in \mathcal{R}\;:\; \widetilde{y} \leq y \leq z_\varepsilon \textrm{ and } z \geq \widetilde{z}\}$ .
• Let $y_\varepsilon$ , $z_\varepsilon$ , $E_1$ , and $E_2$ be as in the previous steps. From the first two analyses we know that for all $(y,z)\in E_1\cup E_2$ , $H_0(y,z) \geq H_0(y_1,z_1)$ . We therefore only need to consider a neighborhood of the diagonal segment having $y\in [y_\varepsilon,z_\varepsilon]$ . Pick $\check{y}$ with $a < \check{y} < y_\varepsilon$ to allow a slight overlap with the region $E_2$ .

Since $\zeta$ is continuous, it is uniformly continuous on the interval $[\check{y},z_\varepsilon]$ . Let $\delta$ be such that $\check{y} \leq y \leq z_\varepsilon$ and $y \leq z \leq y+\delta$ implies $B\zeta(y,z) < \varepsilon$ . Define the neighborhood of the cropped diagonal to be $E_6 = \{(y,z)\in \mathcal{R}\;:\; \check{y} \leq y \leq z_\varepsilon, y < z \leq y+\delta \}$ . Recall from (2.24) that $\varepsilon < \frac{k_1}{H_0(y_1,z_1)}$ ; it therefore follows from (2.23) that for all $(y,z)\in E_6$ and $y <v \leq z$ ,
\begin{align*}F_0(y,v) > \frac{k_1}{B\zeta(y,v)} > \frac{k_1}{\varepsilon} > H_0(y_1,z_1).\end{align*}

Returning to (2.21), observe that the integration is with respect to the second variable v, so it is integration over the vertical line segment from the point (y, y) on the diagonal to (y, z). In particular, for $(y,z) \in E_1\cup E_2 \cup E_3 \cup E_4 \cup E_6$ , supp $(\mathfrak{P}(\!\cdot;\;y,z))$ is contained in this union.

Now in the regions $E_1$ to $E_4$ , combine (2.22) with the fact that $\frac{Bg_0(y,v)}{B\zeta(y,v)} > H_0(y_1,z_1)$ to see that $F_0(y,v) > H_0(y_1,z_1)$ . Similarly for the region $E_6$ , use the relation $F_0(y,v) > H_0(y_1,z_1)$ to obtain the same result. It now follows from (2.21) and the fact that $\mathfrak{P}(\!\cdot;\;y,z)$ is a probability measure that on the regions $E_1$ , $E_2$ , $E_3$ , $E_4$ , and $E_6$ , we have $H_0(y,z) > H_0(y_1,z_1)$ , and hence the infimum does not occur in these regions or in the limit at the outer boundaries.

More care must be taken in the region $E_5$ , since for $(y,z) \in E_5$ , supp $(\mathfrak{P}(\!\cdot;\;y,z))$ may not be contained in $\cup_{i=1}^6 E_i$ where $F_0(y,v)$ is larger than $H_0(y,v)$ . Using (2.21), (2.22), (2.27), and (2.28), for $(y,z) \in E_5$ ,

\begin{align*}H_0(y,z) = \int_ \mathcal{E} F_0(y,v)\, \mathfrak{P}(dv;\;y,z) & \geq \int_{(\check{z},b)} F_0(y,v)\, \mathfrak{P}(dv;\;y,z) \\[5pt] & > \frac{H_0(y_1,z_1)}{1-\varepsilon}\, \mathfrak{P}((\check{z},b);\;y,z) > H_0(y_1,z_1).\end{align*}

It thus follows that the infimum $H_0^*$ is not achieved or approached in $\cup_{i=1}^6 E_i$ . Therefore $H_0^*$ is achieved at some $(y_0^*,z_0^*)\in \overline{(\cup_{i=1}^6 E_i)^c}\subsetneq \mathcal{R}$ since $H_0$ is lower semicontinuous on this compact region.

Remark 3. For inventory models with non-deficient supply and specially structured diffusion dynamics under appropriate conditions for the cost functions, the first-order optimality conditions (see (3.17) of [Reference Helmes, Stockbridge and Zhu8]) involving $F_0$ of (2.17) can be utilized to obtain uniqueness of the optimizing policy. The inclusion of the random yield measure adversely affects this analytical approach, and we have been unable to derive general uniqueness results.

Remark 4. Though the statement of Theorem 1 requires Condition 2, a careful examination of the proof reveals that only (2.19) is used, which is implied by Condition 2(c). Thus existence of an optimizer holds when the weaker condition is imposed. In addition, compared with Theorem 2.1 of [Reference Helmes, Stockbridge and Zhu9], our more careful analysis of $H_0$ at the boundaries using (2.24) proves the existence of an optimizing pair without the need of the monotonicity requirement of $F_0$ from Condition 2.3 of [Reference Helmes, Stockbridge and Zhu9].

3. Expected occupation and ordering measures

To establish the general optimality of the $({y_0^*},{z_0^*})$ policy for an inventory problem with random yield, we apply weak convergence arguments with average expected occupation and average expected nominal ordering measures, as well as expected stock-level measures, which we now define. For $(\tau,Z) \in {\mathcal A}$ , let X denote the resulting inventory level process satisfying (1.2). For each $t > 0$ , define the average expected occupation measure ${\mu}_{0,t}$ on $ \mathcal{E}$ , and the average expected nominal ordering measure ${\nu}_{1,t}$ and stock-level measure ${\mu}_{1,t}$ on $\overline{\mathcal{R}}$ , of the inventory process with random yield during the time interval [0, t] by

(3.1)

\begin{equation} \begin{aligned} {\mu}_{0,t}(\Gamma_0) &\;:\!=\; \displaystyle \mbox{$\frac{1}{t}$} \mathbb E\left[\int_0^t I_{\Gamma_0}(X(s))\, ds\right], & \hspace{-0.5cm}\quad \Gamma_0 \in \mathcal{B}( \mathcal{E}), \rule[-15pt]{0pt}{15pt} \\[5pt] {\nu}_{1,t}(\Gamma_1) &\;:\!=\; \displaystyle \mbox{$\frac{1}{t}$} \mathbb E\left[\sum_{k=1}^\infty I_{\{\tau_k \leq t\}} I_{\Gamma_1}(X(\tau_k-\!),Z_k)\right], &\hspace{-0.5cm} \quad \Gamma_1 \in \mathcal{B}(\overline{\mathcal{R}}), \\[5pt] {\mu}_{1,t}(\Gamma_2) &\;:\!=\; \displaystyle \mbox{$\frac{1}{t}$} \mathbb E\left[\sum_{k=1}^\infty I_{\{\tau_k \leq t\}} {I_{\Gamma_2}}(X(\tau_k-\!),X(\tau_k))\right], & \hspace{-0.5cm} \quad \Gamma_2 \in \mathcal{B}(\overline{\mathcal{R}}). \end{aligned}\end{equation}

The distinction between ${\nu}_{1,t}$ and ${\mu}_{1,t}$ is that the former is a measure on the (state, action) space while the latter is a measure on a (state, state) space, both spaces being correctly denoted by $\overline{\mathcal{R}}$ .

Using the construction of the underlying probability model of the inventory process X corresponding to a policy $(\tau,Z) \in {\mathcal A}$ in [Reference Helmes, Stockbridge and Zhu10], we can rewrite the expected stock-level measure (up to time t) as follows:

(3.2)

\begin{equation}\begin{array}{rcll}{\mu}_{1,t}(\Gamma_2) &=& \displaystyle \mbox{$\frac{1}{t}$} \sum_{k=1}^\infty \mathbb E\left[ \mathbb E\left[I_{\{\tau_k \leq t\}} I_{\Gamma_2}(X(\tau_k-\!),X(\tau_k)) | \mathcal{F}_{\tau_k-}\right] \right] & \\[5pt] &=& \displaystyle \mbox{$\frac{1}{t}$} \mathbb E\left[\sum_{k=1}^\infty I_{\{\tau_k \leq t\}} \int I_{\Gamma_2}(X(\tau_k-\!),v)\, Q(dv;\; X(\tau_k-\!),Z_k)\right] & \\[5pt] &=& \displaystyle \mbox{$\frac{1}{t}$} \mathbb E\left[\sum_{k=1}^\infty I_{\{\tau_k \leq t\}} \widehat{I_{\Gamma_2}}(X(\tau_k-\!),Z_k)\right] \rule{0pt}{22pt} &\\[5pt] &=& \displaystyle \int \widehat{I_{\Gamma_2}}(y,z)\, \nu_{1,t}(dy\times dz). & \\[5pt] \end{array}\end{equation}

Consequently, for any bounded, measurable f and $t > 0$ , we have

(3.3)

\begin{eqnarray} \nonumber\mathbb E\Bigg[\sum_{k=1}^{\infty} I_{\{ \tau_{k}\le t\}} Bf (X(\tau_{k}-\!), X(\tau_{k})) \Bigg] &=& \int Bf(y,v) \mu_{1,t} (dy\times d v) \\[5pt] &=& \int \widehat {Bf}(y,z) \nu_{1,t} (dy\times d z).\end{eqnarray}

Furthermore, using the measures $\mu_{0,t}, \mu_{1,t}$ , and $\nu_{1,t}$ , for any $t > 0$ we can write

(3.4)

\begin{align} \nonumber t^{-1} &\mathbb E\bigg[\int_{0}^{t} c_{0}(X(s) ) ds + \sum_{k=1}^{\infty} I_{\{ \tau_{k}\le t\}} c_{1}(X(\tau_{k}-\!), X(\tau_{k})) \bigg] \\[5pt] \nonumber & = \int c_{0}(x) \mu_{0,t}(dx) + \int c_{1}(y,v) \mu_{1,t} (dy\times d v) \\[5pt] & = \int c_{0}(x) \mu_{0,t}(dx) + \int \widehat c_{1}(y,z) \nu_{1,t} (dy\times d z).\end{align}

These observations will be used in Section 5.

Note that for the controlled process X, the expected stock-level measure ${\mu_{1,t}}$ counts the relative number of times the pairs of order-from locations and inventory levels (after the supply has arrived) hit the set $\Gamma_2$ during the time interval [0, t], while the expected nominal ordering measure ${\nu_{1,t}}$ does so for the pairs of order-from locations and control values (hitting the set $\Gamma_1$ ).

Furthermore, if a is a reflecting boundary and if $L_a$ denotes the local time of X at a, define the average expected local time measure $\mu_{2,t}$ for each $t > 0$ to place a point mass on $\{a\}$ given by

(3.5)

\begin{equation} \mu_{2,t}(\{a\}) = \mbox{$\frac{1}{t}$} \mathbb E[L_a(t)].\end{equation}

Remark 5. As in the case of inventory models with non-deficient yield in [Reference Helmes, Stockbridge and Zhu9], the average expected occupation measure ${\mu}_{0,t}$ is a probability measure on $ \mathcal{E}$ for each $t > 0$ . In addition, for each $(\tau,Z) \in {\mathcal A}$ with $J(\tau,Z) < \infty$ , ${\nu}_{1,t}$ has finite mass and $\limsup_{t\rightarrow \infty} {\nu}_{1,t}(\overline{\mathcal{R}}) \leq J(\tau,Z)/k_1$ . Observe that when a is a sticky boundary, ${\mu}_{0,t}$ places a point mass at a for those policies $(\tau,Z)$ that allow the process X to stick at a with positive probability.

Aside from the notation, the next two propositions and their proofs are the same as those in Section 3 of [Reference Helmes, Stockbridge and Zhu9]. The two propositions focus on the relative compactness of the collection of $\mu_{0,t}$ measures and the associated convergence (or not) of the functionals with integrand $c_0$ .

Proposition 3. (Proposition 3.1 of [Reference Helmes, Stockbridge and Zhu9].) Assume Conditions 1–3 are satisfied. For $(\tau,Z) \in {\mathcal A}$ , let X denote the resulting inventory process satisfying (1.2). Let $\{t_i\;:\; i \in \mathbb N\}$ be a sequence such that $\lim_{i\rightarrow \infty} t_i = \infty$ , and for each i, define $\mu_{0,t_i}$ by (3.1). If $J(\tau,Z) < \infty$ , then $\{\mu_{0,t_i}\;:\; i \in \mathbb N\}$ is tight.

Proposition 4. (Proposition 3.2 of [Reference Helmes, Stockbridge and Zhu9].) Assume Conditions 1–3 are satisfied. Let $(\tau,Z) \in \mathcal{A}$ with $J(\tau,Z) < \infty$ , let X satisfy (1.2), and define $\mu_{0,t}$ by (3.1) for each $t > 0$ . Then for each $\mu_0$ attained as a weak limit of some sequence $\{\mu_{0,t_j}\}$ as $t_j \rightarrow \infty$ ,

\begin{align*}\int_{\overline{\mathcal{E}}} c_0(x)\, \mu_0(dx) \leq J(\tau,Z) < \infty.\end{align*}

We note that $c_0$ being infinite at a boundary implies that $\mu_0$ cannot assign any positive mass at this point. In particular, for models in which a is a sticky boundary and $c_0(a) = \infty$ , any policy which allows X to stick at a on a set of positive probability incurs an infinite average expected cost for each t and thus has $J(\tau,Z) = \infty$ . The requirement that $J(\tau,Z) < \infty$ therefore eliminates such $(\tau,Z)$ from consideration.

4. The auxiliary function ${U}_0$

To prove the optimality of an (s, S) policy for inventory models with random yield, we have to further adapt some of the concepts introduced in [Reference Helmes, Stockbridge and Zhu9] to the case under consideration. In particular, we (slightly) modify the function $G_0=g_0 - F_0^*\zeta$ introduced in Section 4 of that paper. To this end, recall that ${H}_0^*$ is the infimum of the function $H_0$ and Condition 3 requires continuity of $c_0$ at the boundary, even for finite, natural boundaries; $c_0$ may take the value $\infty$ at the boundaries. Define the auxiliary function ${U}_0$ on $\mathcal{E}$ by

(4.1)

\begin{equation} U_0 \;:\!=\; g_0 - H_0^* \zeta,\end{equation}

and observe that the function $U_0$ differs from the function $G_0$ only as far as the constant $F_0^*$ is concerned; this constant is replaced by $H_0^*$ . Hence, the (new auxiliary) function ${U}_0$ inherits the essential properties of the function ${G}_0$ . Specifically, it is an element of $C(\mathcal{E})\cap C^2({\mathcal I})$ , and it also extends uniquely to $\overline{\mathcal{E}}$ thanks to the existence of $(y_1,z_1)$ and $(y_2,z_2)$ in Condition 4 or to $c_0$ being infinite at the boundaries. This observation follows immediately when a is attainable and when b is an entrance boundary, since $\zeta$ is finite in these cases. When a or b is a natural boundary, Lemma 1 combined with Condition 4 shows that

(4.2)

\begin{equation} \lim_{x\rightarrow a} {U}_0(x) = \lim_{x\rightarrow a} (g_0(x) - {H}_0^* \zeta(x)) = \lim_{x\rightarrow a} \left(\mbox{$\frac{g_0(x)}{\zeta(x)}$} - {H}_0^*\right) \zeta(x) = -\infty,\end{equation}

and similarly $\lim_{x\rightarrow b} {U}_0(x) = \infty$ .

Remark 6. The function $U_0$ provides the following interpretation of the numerator of the function $H_0$ . Let $(y,z) \in \mathcal{R}$ ; then

\begin{align*}\widehat{c}_1(y,z) + \widehat{BU_0}(y,z) &= \widehat{c}_1(y,z) + \widehat{Bg_0}(y,z) - H_0^* \widehat{B\zeta}(y,z) \\[5pt] &= \left(\frac{\widehat{c}_1(y,z) + \widehat{Bg_0}(y,z)}{\widehat{B\zeta}(y,z)} - H_0^*\right) \widehat{B\zeta}(y,z) \\[5pt] & = (H_0(y,z) - H_0^*) \widehat{B\zeta}(y,z).\end{align*}

Notice that the relation $H_0^* \leq H_0(y,z)$ holds for all $(y,z) \in \mathcal{R}$ . Thus, the function $\widehat{c}_1(y,z) + \widehat{BU_0}(y,z)$ gives the increase in cost over a cycle incurred by using the nominal (y, z) ordering policy rather than an optimal nominal ordering policy.

Like the function $G_0$ , the function $U_0$ also satisfies an (important) system of relations.

Proposition 5. Assume Conditions 1–4 are satisfied. Let $(y_0^*,z_0^*)\in \mathcal{R}$ be given by Theorem 1 and let ${U}_0$ be as in (4.1). Then ${U}_0$ is a solution of the system

\begin{align*}\left\{\begin{array}{c@{\quad}l@{\quad}l@{\quad}l}Af(x) + c_0(x) - {H}_0^* &=& 0, & x\in \mathcal{I}, \\[5pt] \widehat{Bf}(y,z) + \widehat{c}_1(y,z) &\geq& 0, & (y,z) \in \overline{\mathcal{R}}, \\[5pt] f(x_0) &=& 0, & \\[5pt] \widehat{Bf}({y_0^*},{z_0^*}) + \widehat{c}_1({y_0^*},{z_0^*}) &=& 0. &\end{array} \right.\end{align*}

Moreover, the first relation extends by continuity to $\overline{\mathcal{E}}$ .

The proof is straightforward, so it is left to the reader. With the appropriate use of the $\widehat{\;}$ operation in (2.2), the arguments in the proof of the following proposition are identical to those in the proof of Proposition 4.2 in [Reference Helmes, Stockbridge and Zhu9] for models with non-deficient supply. Similarly, Remark 4.2 of [Reference Helmes, Stockbridge and Zhu9] remains valid, explaining why the definitions of $g_0$ and $\zeta$ exclude the solutions to the homogeneous equations in (2.5).

Proposition 6. Assume Conditions 1–4. Let $x_0 \in \mathcal{I}$ be fixed. For $a \leq y < z < b$ , let $(\tau,Z)$ be the (y, z) ordering policy defined by (2.13), and let X satisfy (1.2). Define the process $\widetilde{M}$ by

\begin{align*}\widetilde{M}(t) \;:\!=\; \int_0^t \sigma(X(s)) U'_{\!\!0}(X(s))\, dW(s), \quad t \geq 0.\end{align*}

Then there exists a localizing sequence $\{\beta_n\;:\; n\in \mathbb N\}$ of stopping times such that for each n, $\widetilde{M}(\!\cdot \wedge \beta_n)$ is a martingale, and the following transversality condition holds:

(4.3)

\begin{equation} \lim_{t\rightarrow \infty} \lim_{n\rightarrow \infty} \mbox{$\frac{1}{t}$} \mathbb E[{U}_0(X(t\wedge \beta_n))] = 0.\end{equation}

In addition, for a given (y, z) policy, where z denotes the nominal upper stock level, defining $\mu_0^{(y,z)}$ to be the stationary measure of the controlled state process X and defining $\mu_1^{(y,z)}$ to place point mass $\widehat\kappa=\frac{1}{\widehat{B\zeta}(y,z)\rule{0pt}{10pt}}$ (the long-run frequency of orders) on $\{(y,z)\}$ , we have

\begin{align*} \int_ \mathcal{E} A{U}_0(x)\, \mu_0^{(y,z)}(dx) + \widehat{BU_0}(y,z)\, \widehat\kappa = 0.\end{align*}

5. Policy class $ \mathcal{A}_0$ and optimality

We prove the optimality of an (s, S)-type policy in the class of admissible policies $ \mathcal{A}$ for models with random yield very similarly as in Section 5 of [Reference Helmes, Stockbridge and Zhu9] for models with non-deficient deliveries. However, Proposition 5.3 and Corollary 5.6 of that paper require extensive modifications to apply to models with deficient supply. These results and their proofs are carefully presented in this section.

Again, for models having a reflecting boundary point a, we are only able to prove the optimality of a $({y_0^*},{z_0^*})$ ordering policy within a slightly smaller class of admissible policies than the class $ \mathcal{A}$ . (Note that there is no restriction on the class ${\mathcal A}$ when a is not a reflecting boundary.)

Definition 1. For models in which a is a reflecting boundary point, the class ${\mathcal A}_0 \subset {\mathcal A}$ consists of those policies $(\tau,Z)$ for which the transversality condition on the local-time process $L_a$ of the inventory process X,

(5.1)

\begin{equation} \lim_{t\rightarrow \infty} t^{-1} \mathbb E[L_a(t)] = 0,\end{equation}

holds.

The definition of an appropriate class of test functions $ \mathcal{D}$ is as in [Reference Helmes, Stockbridge and Zhu9].

Definition 2. A function f is in $ \mathcal{D}$ provided it satisfies the following conditions:

(a) $f\in C(\overline{\mathcal{E}}) \cap C^2({\mathcal I})$ , and there exists $L_f < \infty$ such that
1. (i) $|f| \leq L_f$ ,
2. (ii) $(\sigma f')^2 \leq L_f(1+c_0)$ , and
3. (iii) $|Af| \leq L_f$ ;
(b)
1. (i) for all models, at each boundary where $c_0$ is finite, Af extends continuously to the boundary with a finite value;
2. (ii) when a is a reflecting boundary, $|f'(a)| < \infty$ ; and
3. (iii) when a is a sticky boundary and $c_0(a) < \infty$ , $\sigma f'$ extends continuously at a to a finite value.

Using the class $ \mathcal{D}$ , we have the following version of the limiting adjoint equation for inventory models with random supply.

Proposition 7. Assume Conditions 1–3. Let $(\tau,Z) \in \mathcal{A}_0$ with $J(\tau,Z) < \infty$ , and let X satisfy (1.2). For $t > 0$ , define $\mu_{0,t}$ , $\mu_{1,t}$ , and $\nu_{1,t}$ by (3.1), and let $\mu_0$ be such that $\mu_{0,t_j} \Rightarrow \mu_0$ as $j\rightarrow \infty$ for some sequence $\{t_j\;:\; j \in \mathbb N\}$ with $\lim_{j\rightarrow \infty} t_j = \infty$ . Then the following limiting adjoint relation holds:

(5.2)

\begin{equation} \forall f \in \mathcal{D}, \quad \int_{\overline{\mathcal{E}}} Af(x)\, {\mu}_0(dx) + \lim_{j\rightarrow \infty} \int_{\overline{\mathcal{R}}} \widehat{Bf}(y,z)\, {\nu}_{1,t_j}(dy\times dz) = 0.\end{equation}

Proof. Using the same arguments as those in the proof of Proposition 5.1 in [Reference Helmes, Stockbridge and Zhu9], we can derive the following:

\begin{align} \int_{\overline{\mathcal{E}}} Af(x)\, {\mu}_0(dx) + \lim_{j\rightarrow \infty} \int_{\overline{\mathcal{R}}} {Bf}(y,v)\, \mu_{1,t_j}(dy\times dv) = 0.\end{align}

Then (5.2) follows from (3.3).

Using a proof similar to that of Corollary 5.1 in [Reference Helmes, Stockbridge and Zhu9], along with Proposition 7, we obtain the existence of an optimal $({y_0^*},{z_0^*})$ policy when $U_0 \in \mathcal{D}$ .

Corollary 1. Assume Conditions 1–4. Suppose $U_0 \in \mathcal{D}$ . Then for every $(\tau,Z) \in \mathcal{A}_0$ , we have $J(\tau,Z) \geq H_0^*$ , and the $({y_0^*},{z_0^*})$ ordering policy is optimal in the class $ \mathcal{A}_0$ , where $({y_0^*},{z_0^*})$ is given by Theorem 1.

Unfortunately, it is frequently the case that $U_0 \notin \mathcal{D}$ , so it is necessary to approximate $U_0$ by functions in $\mathcal{D}$ and pass to a limit. Recall from (4.2) that when a is a natural boundary, $U_0(a) \;:\!=\; \lim_{x\rightarrow a} U_0(x) = -\infty$ , and similarly $U_0(b)\;:\!=\; \lim_{x\rightarrow b} U_0(x) = \infty$ when b is natural. To proceed, we impose the following set of conditions.

Condition 5. Let $U_0$ be as defined in (4.1).

(a) There exist some $L < \infty$ and some $y_1 > a$ such that the following hold:
1. (i) For models having $c_0(a) = \infty$ ,
  \begin{align*}\frac{c_0(x)}{(1 + |{U}_0(x)|)^2} + \frac{(\sigma(x) U'_{\!\!0}(x))^2}{(1 + |{U}_0(x)|)^3}\leq L, \qquad a < x < y_1.\end{align*}
2. (ii) For models in which $c_0(a) < \infty$ , there is some $\epsilon \in (0,1)$ such that
  \begin{align*}\frac{(\sigma(x) U'_{\!\!0}(x))^2}{(1 + |{U}_0(x)|)^{2+\epsilon}} \leq L, \qquad a \leq x < y_1.\end{align*}
(b) There exist some $L < \infty$ and some $z_1 < b$ such that the following hold:
1. (i) For models having $c_0(b) = \infty$ ,
  \begin{align*}\frac{c_0(x)}{(1 + |{U}_0(x)|)^2}+ \frac{(\sigma(x)U'_{\!\!0}(x))^2}{(1+|{U}_0(x)|)(1+c_0(x))} \leq L, \qquad z_1 < x < b.\end{align*}
2. (ii) For models in which $c_0(b) < \infty$ , there is some $\epsilon \in (0,1)$ such that
  \begin{align*}\frac{(\sigma(x) U'_{\!\!0}(x))^2}{(1 + |{U}_0(x)|)^{2+\epsilon}} + \frac{(\sigma(x)U'_{\!\!0}(x))^2}{(1+|{U}_0(x)|)(1+c_0(x))} \leq L, \qquad z_1 < x \leq b.\end{align*}
(c)
1. (i) When ${U}_0(a) > -\infty$ , or when a is a sticky boundary with $c_0(a) < \infty$ , $\displaystyle\lim_{x\rightarrow a} \sigma(x) U'_{\!\!0}(x)$ exists and is finite.
2. (ii) When a is a reflecting boundary, $U'_{\!\!0}(a)$ exists and is finite.
3. (iii) When ${U}_0(b) < \infty$ , $\displaystyle\lim_{x\rightarrow b} \sigma(x) U'_{\!\!0}(x)$ exists and is finite.

First note that the bound in Condition 5(b,i) at the boundary b is more restrictive than the similar bound at a in Condition 5(a,i), since

(5.3)

\begin{equation} \frac{(\sigma(x) U'_{\!\!0}(x))^2}{(1 + |U_0(x)|)^3} = \frac{(\sigma(x)U'_{\!\!0}(x))^2}{(1+|U_0(x)|)(1+c_0(x))} \cdot \frac{1+c_0(x)}{(1+|U_0(x)|)^2} \leq L(1+L).\end{equation}

The need for tighter restrictions at the boundary b than at a is not unexpected, since there is no way to control the process from diffusing upwards, whereas ordering can prevent the process from diffusing towards a.

The reason for having two different conditions in Condition 5(a,b) based on whether $c_0$ at the boundary is finite or infinite is that any limiting measure $\mu_0$ of the collection $\{\mu_{0,t}\}$ arising from an admissible policy $(\tau,Z)$ having finite cost $J(\tau,Z)$ must place no $\mu_0$ -mass at a boundary where $c_0$ is infinite. A weak limit $\mu_0$ may have positive mass at a boundary when $c_0$ is finite. Also notice the subtle assumption in (a,ii) and (b,ii) of Condition 5 that the bounds extend to the boundary, whereas there is no assumption needed at the boundary in (a,i) and (b,i).

A sequence of functions $U_n \in \mathcal{D}$ which will approximate the auxiliary function $U_0$ will be defined using the function $h(x) = (\!-\!\frac{1}{8} x^4 + \frac{3}{4} x^2 + \frac{3}{8})I_{[-1,1]}(x) + |x|\, I_{[-1,1]^c}(x)$ defined in Section 5 of [Reference Helmes, Stockbridge and Zhu9]. While the formal definitions of $U_n$ and $G_n$ are similar, there are striking differences between these two approximations when we analyze integrals of the form $\int_{\overline{\mathcal{R}}} \widehat{BU_n}(y,z) \, {\nu}_{1,t_j}(dy\times dz)$ and $\int_{\overline{\mathcal{R}}} BG_n(y,z) \, \mu_{1,t_j}(dy\times dz)$ ; see the proof of Proposition 9 below.

In the next lemma, we define the sequence of functions $\{{U_n}\;:\; n\in \mathbb N\} \subset \mathcal{D}$ which approximate $U_0$ , and in the lemma following that one we examine the convergence of $A{U_n}$ and ${B{U_n}}$ .

Lemma 3. Assume Conditions 1–5 with $U_0$ defined by (4.1). For each $n \in \mathbb N$ , define the function $U_n$ by

(5.4)

\begin{equation} U_n = \frac{U_0}{1+\frac{1}{n} h(U_0)}.\end{equation}

Then $U_n \in \mathcal{D}$ and

\begin{align*} & \lim_{n\rightarrow \infty} AU_n(x) = AU_0(x), \quad \forall x \in \mathcal{I}, \\[5pt] & \lim_{n\rightarrow \infty} \widehat{BU_n}(y,z) = \widehat{BU_0}(y,z), \quad \forall (y,z) \in \overline{\mathcal{R}}.\end{align*}

Moreover, at each boundary where $c_0$ is finite, $\lim_{n\rightarrow \infty} AU_n \geq AU_0$ .

Proof. The fact that $U_n \in \mathcal{D}$ and the convergence of $AU_{n}$ can be proven using arguments similar to those in the proofs of Lemmas 5.1 and 5.2 of [Reference Helmes, Stockbridge and Zhu9]. Similarly, we can show that $\lim_{n\rightarrow \infty}{BU_n}(y,v) = {BU_0}(y,v)$ for all $(y, v) \in \overline{\mathcal R}$ . This, together with the bounded convergence theorem, implies the desired convergence of $ \widehat{BU_n}(y,z)$ to $\widehat{B U_{0}}(y,z)$ .

The following proposition gives the first important result involving $AU_n$ and $c_0$ .

Proposition 8. Assume Conditions 1–5. Let $(\tau,Z) \in \mathcal{A}_0$ with $J(\tau,Z) < \infty$ , let X satisfy (1.2), let $ {\mu}_{0,t}$ be defined by (3.1), and let $ {\mu}_0$ be any weak limit of $\{{\mu}_{0,t}\}$ as $t\rightarrow \infty$ . Define $U_n$ by (5.4). Then

\begin{align*}\liminf_{n\rightarrow \infty} \int_{\overline{\mathcal{E}}} (AU_n(x) + c_0(x))\, {\mu}_0(dx) \geq \int_{\overline{\mathcal{E}}} (AU_0(x) + c_0(x))\, {\mu}_0(dx) \geq H_0^*.\end{align*}

The proof uses Condition 5 and is again very similar to the proof of Proposition 5.2 for non-deficient supply models in [Reference Helmes, Stockbridge and Zhu9]. It is therefore left to the reader.

We next establish a similar result involving $\widehat{BU_n}$ and $\widehat{c_1}$ , though the lack of tightness of $\{ {\nu}_{1,t}\}$ means that the result cannot be expressed in terms of a limiting measure.

Proposition 9. Assume Conditions 1–5. Let $(\tau,Z) \in \mathcal{A}_0$ with $J(\tau,Z) < \infty$ , and let X satisfy (1.2). Let $\{t_j\;:\;j\in \mathbb N\}$ be a sequence such that $\lim_{j\rightarrow \infty} t_j= \infty$ and

\begin{align*}J(\tau,Z) = \lim_{j\rightarrow \infty} \mbox{$\frac{1}{t_j}$}\mathbb E\Bigg[\int_0^{t_j} c_0(X(s))\, ds + \sum_{k=1}^\infty I_{\{\tau_k\leq t_j\}} c_1(X(\tau_k-\!),X(\tau_k))\Bigg].\end{align*}

For each j, define $\nu_{1,t_j}$ by (3.1), and with $U_0$ given in (4.1), define $U_n$ by (5.4). Then

(5.5)

\begin{equation} \liminf_{n\rightarrow \infty} \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) + \widehat{c_1}(y,z))\, \nu_{1,t_j}(dy\times dz) \geq 0.\end{equation}

The proof of this proposition is very long and technical. In a nutshell, the desired assertion (5.5) follows from the progression of Lemmas 4, 5, 6, and 7. Let us briefly describe the idea here. First we observe in Lemma 4 that (5.5) holds true if the function $U_{0}$ is uniformly bounded. Consequently, we only need to focus on the case when $U_{0}$ is unbounded, which necessarily implies that either $U_{0}(a) =-\infty$ or $U_{0}(b) =\infty$ . We present only the case when $U_{0}(a) =-\infty$ and $U_{0}(b) =\infty$ ; the other cases (either $U_{0}(a) >-\infty$ and $U_{0}(b) =\infty$ , or $U_{0}(a) =-\infty$ and $U_{0}(b) < \infty$ ) follow from similar arguments and are left to the reader. In Lemma 5 we observe that the integrand $ \widehat{BU_n}(y,z) + \widehat{c_1}(y,z)$ of (5.5) is bounded below by the sum of two terms $\widehat R_{n,1} $ and $\widehat R_{n,2}$ . Then we show in Lemmas 6 and 7 that the double limits inferior involving $\widehat R_{n,2} $ and $\widehat R_{n,1}$ , respectively, are nonnegative, thus establishing (5.5).

The analyses of the two double limits inferior follow similar lines of reasoning, though significantly more effort is required for the term involving $\widehat R_{n,1} $ . First $\overline{\mathcal{R}}$ is partitioned into appropriate subsets in the proofs of Lemmas 6 and 7. Detailed analyses reveal that the inner integrand $\widehat R_{n,1} $ or $\widehat R_{n,2}$ is bounded below over these subsets of $\overline{\mathcal R}$ , and taking limits leads to the desired result. The limiting result for $\widehat R_{n,1}$ requires the ASC condition of Condition 2(c) for the region $\Gamma_4$ in Figure 3. For the subset $\Gamma_5$ of $\overline{\mathcal R}$ in Figure 3, the analysis of the double limit inferior requires subtle weak convergence analysis related to the measures $\{\nu_{1,t_j}\}$ as well.

We now supply the details of the arguments.

Lemma 4. Let $U_0$ be defined by (4.1). If $U_{0}$ is uniformly bounded, then (5.5) holds.

Proof. Suppose $\sup_{x\in {\mathcal I}} |U_{0}(x)| \le K$ for some positive constant $K\geq 1$ . Recall the nonnegativity of $\widehat{BU}_0 + \widehat{c}_1$ from Proposition 5. Then

(5.6)

\begin{align} \nonumber & \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) + \widehat{c_1}(y,z))\, \nu_{1,t_j}(dy\times dz) \\[5pt] \nonumber & \ \ =\int_{\overline{\mathcal{R}}} (\widehat{BU_0}(y,z) + \widehat{c_1}(y,z))\, \nu_{1,t_j}(dy\times dz) \\[5pt] \nonumber & \ \ \quad + \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) -\widehat{BU_0}(y,z))\, \nu_{1,t_j}(dy\times dz) \\[5pt] &\ \ \ge \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) -\widehat{BU_0}(y,z))\, \nu_{1,t_j}(dy\times dz).\end{align}

Now, using the definition of $U_{n}(\!\cdot\!)$ , for any $(y,v) \in \overline{\mathcal R}$ ,

\begin{align*} | B U_{n}(y,v) - BU_{0}(y,v)| & = \bigg|\frac{U_{0}(v)}{1+ \frac1n h(U_{0}(v))} - \frac{U_{0}(y)}{1+ \frac1n h(U_{0}(y))} - U_{0}(v) + U_{0}(y) \bigg| \\[5pt] & = \bigg|\frac{U_{0}(y) h(U_{0}(y))}{n(1+ \frac1n h(U_{0}(y)))}- \frac{U_{0}(v) h(U_{0}(v))}{n(1+ \frac1n h(U_{0}(v)))}\bigg|\\[5pt] & \le \mbox{$\frac{2K^{2}}{n}$}.\end{align*}

As a result, for any $(y,z) \in \overline{\mathcal R}$ we have

\begin{align*} \widehat{BU_n}(y,z) -\widehat{BU_0}(y,z)) & = \int_{y}^{z}[B U_{n}(y,v) - BU_{0}(y,v)] Q(dv;\; y,z) \\[5pt] & \ge - \int_{y}^{z} \mbox{$\frac{2K^{2}}{n}$}\, Q(dv;\; y,z) = - \mbox{$\frac{2K^{2}}{n}$}.\end{align*}

Employing this lower bound in (5.6) gives

\begin{align} \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) + \widehat{c_1}(y,z))\, \nu_{1,t_j}(dy\times dz) \ge \int_{\overline{\mathcal{R}}} - \mbox{$\frac{2K^{2}}{n}$} \, \nu_{1,t_j}(dy\times dz) = - \mbox{$\frac{2K^{2}}{n}$}\, \nu_{1,t_j}(\overline{\mathcal{R}}).\end{align}

The bound on the asymptotic limit of $\nu_{1,t_j}(\overline{\mathcal{R}})$ as $j\to \infty$ in Remark 5 implies that

\begin{align}\liminf_{j\to\infty} \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) + \widehat{c_1}(y,z))\, \nu_{1,t_j}(dy\times dz) \geq - \mbox{$\frac{2K^2\, J(\tau,Y)}{n\, k_1}$}.\end{align}

Now letting $n\to \infty$ yields (5.5).

For the remaining lemmas, assume $U_0$ is unbounded with $U_0(a)=-\infty$ and $U_0(b)=\infty$ .

Lemma 5. Let $U_0$ be defined by (4.1) and $U_n$ by (5.4). Then

(5.7)

\begin{equation}\begin{aligned}(\widehat{B U_{n}} + \widehat c_{1})(y,z) & \geq \int_{y}^{z} R_{n,1}(y,v) Q(dv;\; y,z) + \int_{y}^{z} R_{n,2}(y,v) Q(dv;\; y,z)\\[5pt] & = \widehat R_{n,1}(y,z) +\widehat R_{n,2}(y,z),\end{aligned}\end{equation}

where

(5.8)

\begin{align} R_{n,1}(y,v) \;:\!=\; \frac{BU_{0}(y,v)+c_1(y,v)}{[1+\frac{1}{n} h( U_0(v))][1+\frac{1}{n} h(U_0(y))]},\end{align}

(5.9)

\begin{align} R_{n,2}(y,v) \;:\!=\; \frac{U_{0}( v) h(U_{0}(y)) - U_{0}(y) h(U_{0}(v))}{n [1+\frac{1}{n} h(U_0(v))][1+\frac{1}{n} h(U_0(y))]}. \end{align}

Proof. Since $c_1$ is strictly positive, observe that

(5.10)

\begin{align} \nonumber c&_1 (y,v) + BU_{n}(y,v) \\[5pt] \nonumber &= c_1(y,v) + \frac{U_{0}(v)}{1+\frac{1}{n} h(U_{0}(v))} - \frac{U_{0}(y)}{1+\frac{1}{n} h(U_{0}(y))} \\[5pt] \nonumber &=\; \frac{BU_{0}(y,v)+c_1(y,v)}{\left[1+\frac{1}{n} h(U_{0}(v))\right]\left[1+\frac{1}{n} h(U_{0}(y))\right]} + \frac{U_{0}( v) h(U_{0}(y)) - U_{0}(y) h(U_{0}(v))}{n \left[1+\frac{1}{n} h(U_{0}(v))\right]\left[1+\frac{1}{n} h(U_{0}(y))\right]}\\[5pt] \nonumber & \ \ \ +\;c_1(y,v) \left(1 - \frac{1}{\left[1+\frac{1}{n} h(U_{0}(v))\right]\left[1+\frac{1}{n} h(U_{0}(y))\right]}\right) \\[5pt] \nonumber &\geq \frac{BU_{0}(y,v)+c_1(y,v)}{\left[1+\frac{1}{n} h( U_0(v))\right]\left[1+\frac{1}{n} h(U_0(y))\right]} + \frac{U_{0}( v) h(U_{0}(y)) - U_{0}(y) h(U_{0}(v))}{n \left[1+\frac{1}{n} h(U_{0}(v))\right]\left[1+\frac{1}{n} h(U_{0}(y))\right]}\\[5pt] &= R_{n,1}(y,v) + R_{n,2}(y,v).\end{align}

Integrating with respect to $Q(\!\cdot;\;y,z)$ yields (5.7).

We now demonstrate that the double limit inferior of $R_{n,2}$ is nonnegative.

Lemma 6. Let $R_{n,2}$ be defined by (5.9). Then

(5.11)

\begin{equation} \liminf_{n\rightarrow \infty} \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal R}} \widehat R_{n,2}(y,z)\, \nu_{1,t_j}(dy\times dz) \geq 0.\end{equation}

Proof. Since $U_{0}(a) = -\infty$ , there exists some $y_1$ , with $y_1 > a$ , such that $U_{0}(x) < -1$ for all $x < y_1$ . Recall that $h(x) = |x|$ on $(\!-\!\infty,-1)$ and $h(x) \geq |x|$ for all x. Thus it follows that for all (y, v) with $y \leq y_1$ ,

(5.12)

\begin{equation}R_{n,2}(y,v) = \frac{|U_{0}(y)|(U_{0}(v) + h(U_{0}(v)))}{n \left[1+\frac{1}{n} h(U_{0}(v))\right]\left[1+\frac{1}{n} |U_{0}(y)|\right]} \geq 0.\end{equation}

Define $F_{1} \;:\!=\; \{(y,v)\in \overline{\mathcal R}\;:\; a < y \le y_{1}\}$ .

Similarly, the condition $U_{0}(b) = \infty$ implies that there exists some $z_1$ with $ z_1 < b$ such that $U_{0}(v) \geq 1$ for $z_1 < v < b$ . Thus, for (y, v) with $v > z_1$ ,

(5.13)

\begin{equation}R_{n,2}(y,v) = \frac{U_{0}(v)(h(U_{0}(y)) - U_{0}(y))}{n [1+\frac{1}{n} h(U_{0}(v))][1+\frac{1}{n} h(U_{0}(y))]} \geq 0.\end{equation}

Set $F_{2} \;:\!=\; \{(y,v)\in \overline{\mathcal R}\;:\;\; y_{1} < y \le v, z_{1} \le v < b \}$ , and also define the set $F_{3} \;:\!=\; \overline{\mathcal R}\setminus (F_{1}\cup F_{2})$ . These sets are illustrated in Figure 2.

Figure 2. The regions $F_1$ , $F_2$ , and $F_3$ .

For $(y,z) \in F_{1} $ , (5.12) implies that

(5.14)

\begin{align}\widehat R_{n,2}(y,z) & = \int_{y}^{z} R_{n,2}(y,v) Q(dv;\;y,z) \ge 0.\end{align}

We establish the result for the regions $F_2$ and $F_3$ using a common argument. Concerning the region $F_2$ , the nonnegativity from (5.13) implies that for $(y,z)\in F_{2}$ ,

\begin{align*} \widehat R_{n,2}(y,z) & = \int_{y}^{z_{1}} R_{n,2}(y,v) Q(dv;\;y,z) + \int_{z_{1}}^{z} R_{n,2}(y,v) Q(dv;\;y,z) \\[5pt] & \ge \int_{y}^{z_{1}} R_{n,2}(y,v) Q(dv;\;y,z).\end{align*}

For $(y,z)\in F_3$ , $\widehat R_{n,2}(y,z) = \int_{y}^{z} R_{n,2}(y,v) Q(dv;\;y,z)$ . In each of these integrals, the upper limit of integration is bounded by $z_1$ , so for each $(y,z)\in F_2\cup F_3$ we are only considering integrands $R_{n,2}$ on the closure of $F_3$ .

Since the function $ U_{0}( v) h(U_{0}(y)) - U_{0}(y) h(U_{0}(v))$ is continuous, it is uniformly bounded on $\overline{F}_3$ . It follows that there exists some constant $K >0$ such that $|U_{0}(v) h(U_{0}(y)) - U_{0}(y) h(U_{0}(v))| \le K$ and hence $|R_{n,2}(y,v) | \le \frac{K}{n}$ . This, in turn, implies that

(5.15)

\begin{align}\widehat R_{n,2}(y,z) \ge \int_{y}^{z\wedge z_{1}} R_{n,2}(y,v) Q(dv;\;y,z) \ge - \frac{K}{n}\int_{y}^{z\wedge z_{1}} Q(dv;\;y,z) \ge - \frac{K}{n}.\end{align}

The inequalities (5.14) and (5.15) imply that $\widehat R_{n,2}(y,z) \ge - \frac{K}{n}$ for all $(y,z) \in \overline{\mathcal R}$ , and hence the asymptotic bound on the masses $\nu_{1,t_j}(\overline{\mathcal{R}})$ in Remark 5 implies

\begin{align*}\liminf_{n\rightarrow \infty} \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal R}} \widehat R_{n,2}(y,z)\, \nu_{1,t_j}(dy\times dz) \geq 0.\end{align*}

We now turn to $R_{n,1}$ , for which the proof of nonnegativity of the double limit inferior is more challenging.

Figure 3. Partition of $\overline{\mathcal{R}}$ .

Lemma 7. Let $R_{n,1}$ be given by (5.8) and define $\widehat{R}_{n,1}$ by (5.7). Then

(5.16)

\begin{equation} \liminf_{n\rightarrow \infty} \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal R}} \widehat R_{n,1}(y,z)\, \nu_{1,t_j}(dy\times dz) \geq 0.\end{equation}

Proof. We begin with a similar line of reasoning as that used for Lemma 6, by establishing lower bounds on $R_{n,1}$ in various regions of $\overline{\mathcal{R}}$ . Figure 3 indicates the partition of $\overline{\mathcal{R}}$ used in the proof. The sets $\Gamma_1$ and $\Gamma_2$ are defined slightly differently depending on whether a is attainable or natural and whether b is entrance or natural. When a is attainable and b is entrance, the partition can be slightly simplified. In order that the proof apply to all types of boundary points, however, we adopt the same partition for every type of boundary.

• When a is attainable, $\zeta$ is bounded below on $\overline{ \mathcal{E}}$ . As a result,
(5.17) \begin{equation} \frac{BU_0(y,v) + c_1(y,v)}{B\zeta(y,v)} = \frac{Bg_0(y,v) + c_1(y,v)}{B\zeta(y,v)} - H_0^* \geq \frac{k_1}{B\zeta(y,v)} - H_0^*.\end{equation}
Let $z_0$ satisfy $B\zeta(a,z_0) = \frac{k_1}{H_0^*}$ , and define the set $\Gamma_1 = \{(y,z)\in \overline{\mathcal{R}}\;:\; a \leq y \leq z < z_0\}$ . Then the monotonicity of $\zeta$ yields $0 < B\zeta(y,v) \leq \frac{k_1}{H_0^*}$ for $(y,v)\in \Gamma_1$ with $y < v$ , and hence $BU_0(y,v) + c_1(y,v) \geq 0$ , implying that $R_{n,1} \geq 0$ as well. The continuity of $R_{n,1}$ up to the diagonal of $\Gamma_1$ then establishes $R_{n,1} \geq 0$ on $\Gamma_1$ .
• When a is a natural boundary, (2.7) of Lemma 1 with Condition 4(a) implies that there is some $z_{0} > a$ such that $\frac{Bg_{0}(y,v)}{B\zeta(y,v)} \geq H_0(y_1,z_1) \ge H_{0}^{*} $ for all $ y \le v \le z_{0}.$ Define the region
\begin{align*}\Gamma_1 \;:\!=\; \{(y,z)\in \overline{\mathcal{R}}\;:\; a < y \leq z < z_0\}.\end{align*}
As a result of the lower bound on the ratio, for $(y,v) \in \Gamma_1$ ,
(5.18) \begin{equation} 0 \leq Bg_0(y,v) - H_0^* B\zeta(y,v) = BU_0(y,v) < BU_0(y,v) + c_1(y,v).\end{equation}
Therefore, from its definition, $R_{n,1} > 0$ on $\Gamma_1$ .
• When b is an entrance boundary, $\zeta$ is bounded above on $\overline{ \mathcal{E}}$ . Set $y_1$ so that $B\zeta(y_1,b) = \frac{k_1}{H_0^*}$ . Define $\Gamma_2 = \{(y,z)\in \overline{\mathcal{R}}\;:\;\; y_1 < y \leq z \leq b\}$ . Using the estimate in (5.17) and arguing similarly as for the boundary a, we deduce that $R_{n,1} \geq 0$ on $\Gamma_2$ .
• When b is a natural boundary, (2.7) of Lemma 1 with Condition 4(b) implies that there is some $y_{1} < b$ such that $\frac{Bg_{0}(y,v)}{B\zeta(y,v)} \ge H_{0}^{*} $ for all $y_{1}\le y \le v$ . Define the region
\begin{align*}\Gamma_2\;:\!=\;\{y,z)\in \overline{\mathcal{R}}\;:\;\; y_1 < y \leq z < b\}.\end{align*}
Then for $(y,v) \in \Gamma_2$ the relation (5.18) again holds, implying that $R_{n,1}(y,v) > 0$ .
• Let $z_0$ be as in the definition of $\Gamma_1$ . Define $K_1 = \inf\{U_0(v)\;:\; z_0 \leq v < b\}$ and observe that $K_1 > -\infty$ . Since $U_{0}(a) = -\infty$ , the continuity of $U_0$ at a implies that there is some $y_0$ with $a < y_{0} < y_{1}\wedge z_{0}$ such that $U_{0}(y) \le K_{1}$ for all $y < y_{0}$ . Define
\begin{align*}\Gamma_{3}\;:\!=\; \{(y,v)\in \overline{\mathcal R}\;:\; a < y< y_{0}, v \ge z_{0}\}.\end{align*}
Then for all $(y,v) \in \Gamma_{3}$ ,
(5.19) \begin{align} \nonumber R_{n,1}(y,v) &= \frac{BU_{0}(y,v)+c_1(y,v)}{[1+\frac{1}{n} h(U_{0}(v))][1+\frac{1}{n} h(U_{0}(y))]} \\[5pt] &\ge \frac{K_{1}- K_{1} + c_1(y,v)}{[1+\frac{1}{n} h(U_{0}(v))][1+\frac{1}{n} h(U_{0}(y))]} > 0.\end{align}
• Following a similar argument, let $y_0$ and $y_1$ be as chosen above. Define $K_2 = \sup\{|U_0(y)|\;:\;\; y_0 \leq y \leq y_1\}$ . Since $U_0(b) = \infty$ , continuity implies existence of some $\widetilde{z}_1 < b$ for which $U_0(v) \geq K_2$ for all $v \geq \widetilde{z}_1$ . Define the region
\begin{align*}\widetilde\Gamma_4 = \{(y,z)\in \overline{\mathcal{R}}\;:\;\; y_0 \leq y \leq y_1, z \geq \widetilde{z}_1\}.\end{align*}
For all $(y,v) \in \widetilde\Gamma_4$ , the numerator of $R_{n,1}$ has the bound $BU_0(y,v) + c_0(y,v) \geq K_2 - K_2 + c_0(y,v) > 0$ , implying that $R_{n,1} > 0$ on $\widetilde\Gamma_4$ .

Turning briefly to $\widehat{R}_{n,1}(y,v) = \int_y^z R_{n,1}(y,v)\, Q(dv;\;y,z)$ , notice that this is a line integral over the vertical segment (y, y) to (y, z). For $\Gamma_1$ , $\Gamma_2$ , and $\Gamma_3$ , these segments are entirely contained in the regions, so it immediately follows that $\widehat{R}_{n,1} \geq 0$ on these regions. For $(y,z)\in \widetilde\Gamma_4$ , the segment from (y, y) to (y, z) is not contained in $\widetilde\Gamma_4$ , and it is not necessary that $R_{n,1} \geq 0$ on the segment, so a more careful analysis is required.

Let $y_{0}, y_{1}, $ and $\widetilde{z}_{1}$ be the values used to define the subsets $\Gamma_2$ , $\Gamma_3$ , and $\widetilde\Gamma_4$ . Recall $K_{2} = \sup_{y_{0}\le y \le y_{1}}|U_{0}(y)|$ . Now set

\begin{align*}K_{3}\;:\!=\; \sup_{y_{0}\le y \le y_{1}, y\le v \le \widetilde{z}_{1}} |BU_{0}(y,v)+c_1(y,v)|. \end{align*}

Note that $| R_{n,1}(y,v) | \le K_{3} $ for all $n\in \mathbb N$ and $(y,v) \in \overline{\mathcal R}$ with $y_{0}\le y \le y_{1}$ and $y\le v \le \widetilde{z}_{1}$ . In addition, observe that for any $(y,v) \in \widetilde\Gamma_4$ ,

\begin{align*} R_{n,1}(y,v) & = \frac{BU_{0}(y,v)+c_1(y,v)}{[1+\frac{1}{n} h(U_{0}(v))][1+\frac{1}{n} h(U_{0}(y))]} \\[5pt] & \ge \frac{ U_{0}(v) - \sup_{y_{0}\le y \le y_{1}}|U_{0}(y)| }{[1+\frac{1}{n} h(U_{0}(v))][1+\frac{1}{n} \cdot1\vee \sup_{y\in [y_{0}, y_{1}]} |U_{0}(y)|]} \\[5pt] & = \frac{ U_{0}(v) -K_{2}}{[1+\frac{1}{n} h(U_{0}(v))][1+\frac{1}{n} \cdot1\vee K_{2}] } \\[5pt] & \;=\!:\; f_{n}(v).\end{align*}

By the choice of $\widetilde{z}_1$ and the definition of $K_2$ , it is easy to see that for each $v \ge \widetilde{z}_{1}$ fixed, $f_{n}(v)$ is increasing in n. Moreover, since $\lim_{v\to b} U_{0}(v) = \infty$ , we have $\lim_{v\to b} f_{n}(v) = \frac{n}{1+\frac{1\vee K_{2}}{n}\rule{0pt}{10pt}}$ for each n.

Using the interval $[y_0,y_1]$ , let $\delta>0$ be given by Condition 2(c). We first fix an $N >(\frac{ 4K_{3}}{\delta}+1)\vee K_2$ . Since $\lim_{v\to b} f_{N}(v) = \frac{N}{1+\frac{1\vee K_{2}}{N}\rule{0pt}{10pt}}$ , we can find a $z_1$ with $\widetilde{z}_1 < z_1 < b$ such that $f_{N}(v) \ge \frac{N}{2} \ge \frac{2K_{3}}{\delta}$ for all $v \ge z_1.$ Consequently, for all $n \ge N$ and (y, v) with $y_{0} \le y \le y_{1}$ and $v \geq z_1$ , we have

\begin{align}R_{n,1}(y,v) \ge f_{n}(v) \ge f_{N}(v) \ge \frac{2K_{3}}{\delta}.\end{align}

By Condition 2(c), there exists a $z_{2} > z_{1}$ such that

\begin{align*}\inf_{y\in [y_{0}, y_{1}]} Q((z_{1}, b);\;y,z) \ge \frac{\delta}{2},\quad \textrm{ for all }z> z_{2}.\end{align*}

Define $\Gamma_{4}\;:\!=\;\{(y,v) \in \overline{\mathcal R}\;:\;\; y_{0}\le y \le y_{1} \textrm{ and }v > z_2\}$ . Recall that supp $Q(\!\cdot;\;y,z)\subset (y,z]$ , so $Q((z_1,b);\;y,z) = Q((z_1,z];\;y,z)$ . Then for all $n \ge N$ and all $(y,z) \in \Gamma_4$ ,

\begin{align*} \widehat R_{n,1}(y,z) & = \int_{(y,\widetilde{z}_{1}]} R_{n,1}(y,v)\, Q(dv;\;y,z) + \int_{(\widetilde{z}_{1},z_{1}]} R_{n,1}(y,v)\, Q(dv;\;y,z) \\[5pt] & \qquad + \int_{(z_{1},z] } R_{n,1}(y,v)\, Q(dv;\;y,z) \\[5pt] & \ge \int_{(y,\widetilde{z}_{1}]} (\!-\!K_{3})\, Q(dv;\;y,z) + \int_{(\widetilde{z}_{1},z_{1}]} 0\, Q(dv;\;y,z) \\[5pt] & \qquad + \int_{(z_{1},z] } \frac{2K_{3}}{\delta}\, Q(dv;\;y,z)\\[5pt] & \ge -K_{3} + \frac{2K_{3}}{\delta}\cdot \frac{\delta}{2} =0.\end{align*}

Summarizing, on the set $\Gamma = \cup_{i=1}^4 \Gamma_i$ , the function $\widehat{R}_{n,1} \geq 0$ , so

(5.20)

\begin{equation} \liminf_{n\to\infty} \liminf_{j\to\infty} \int_{\Gamma} \widehat{R}_{n,1}(y,z)\, \nu_{1,t_j}(dy\times dz) \geq 0.\end{equation}

Now define the set $\Gamma_5 = \overline{\mathcal R} \backslash (\cup_{i=1}^4 \Gamma_{i})$ ; this compact set is depicted as the closure of the white region in Figure 3. We need to show that

(5.21)

\begin{align}\liminf_{n\to\infty}\liminf_{j\to\infty} & \int_{\Gamma_{5}} \widehat R_{n,1}(y,z) \nu_{1,t_{j}}(dy\times dz) \ge 0.\end{align}

For each n, let $\{t_{j_k}\}\subset \{t_j\}$ be a subsequence such that

\begin{align*}\lim_{k\to\infty} \int_{\Gamma_{5}} \widehat R_{n,1}(y,z) \nu_{1,t_{j_k}}(dy\times dz) = \liminf_{j\to\infty} \int_{\Gamma_{5}} \widehat R_{n,1}(y,z) \nu_{1,t_{j}}(dy\times dz);\end{align*}

the dependence of the subsequence on n is notationally suppressed. Now restrict each $\nu_{1,t_{j_k}}$ to $\Gamma_5$ and observe that, trivially, the collection $\{\nu_{1,t_{j_k}}\}$ is tight; furthermore, $\nu_{1,t_{j_k}}(\Gamma_5) \leq \nu_{1,t_{j_k}}(\overline{\mathcal{R}})$ for each k. It therefore follows from Remark 5 that the masses $\{\nu_{1,t_{j_k}}(\Gamma_5)\}$ are uniformly bounded. The properties of tightness and uniform boundedness imply that there exist some further subsequence $\{t_{j_{k_\ell}}\}$ and a measure $\overline\nu_{1,n}$ on $\Gamma_5$ such that $\nu_{1,t_{j_{k_\ell}}} \Rightarrow \overline\nu_{1,n}$ (see Theorem 8.6.2 of [Reference Bogachev3]); the dependence of the limiting measure on n is now explicitly represented. Note that since the measures are restricted to $\Gamma_5$ , the weak convergence $\nu_{1,t_{j_{k_\ell}}} \Rightarrow \overline\nu_{1,n}$ implies that

\begin{align*}\lim_{\ell \to\infty} \nu_{1,t_{j_{k_\ell}}}(\Gamma_5) = \lim_{\ell \to\infty} \int_{\Gamma_5} 1\, d\nu_{1,t_{j_{k_\ell}}} = \int_{\Gamma_5} 1\, d\overline\nu_{1,n}(\Gamma_5) = \overline\nu_{1,n}(\Gamma_5).\end{align*}

For each n, the function $\widehat R_{n,1}(y,z)$ can be shown to be lower semicontinuous by an argument similar to that used in the proof of Proposition 1. In addition, $\widehat{R}_{n,1}$ inherits boundedness from the function $R_{n,1}$ , which is continuous and uniformly bounded on the compact region $\Gamma_{5}$ . This bound is also uniform for all n by the definition of $R_{n,1}$ . Then, applying Corollary 8.2.5 of [Reference Bogachev3], we have

\begin{align*}\liminf_{\ell\to\infty} \int_{\Gamma_5} \widehat{R}_{n,1}(y,z)\, \nu_{1,t_{j_{k_\ell}}}(dy\times dz) \geq \int_{\Gamma_5} \widehat{R}_{n,1}(y,z)\, \overline\nu_{1,n}(dy\times dz).\end{align*}

The challenge in analyzing the right-hand side is the dependence on n of both $\widehat{R}_{n,1}$ and $\overline\nu_{1,n}$ . We will apply Lemma 2.1 in [Reference Serfozo17], which concerns nonnegative functions. Since $\widehat R_{n,1}$ is uniformly bounded on $\Gamma_{5}$ and over $n\in \mathbb N$ , there is a positive constant R such that $\widehat R_{n,1}(y,z) + R \ge 0$ for all $(y,z) \in \Gamma_{5}$ and $n \in \mathbb N$ .

Now let $\{n_m\}\subset \mathbb N$ be a subsequence for which

\begin{align}\lim_{m\to\infty} \int_{\Gamma_{5}} \widehat R_{n_m,1}(y,z)\, \overline \nu_{1,n_m} (dy\times dz) = \liminf_{n\to\infty}\int_{\Gamma_{5}} \widehat R_{n,1}(y,z)\, \overline \nu_{1,n} (dy\times dz).\end{align}

The collection $\{\overline \nu_{1,n_m}\}$ , as measures on the compact set $\Gamma_{5}$ , is tight, and $\overline\nu_{1,n_m}(\Gamma_5)$ inherits the uniform bound of Remark 5. Theorem 8.6.2 of [Reference Bogachev3] implies the existence of a further subsequence $\{\overline \nu_{1,n_{m_i}}\}$ and a measure $\overline\nu$ such that $\overline\nu_{1,n_{m_i}} \Rightarrow \overline \nu$ .

We now verify the hypothesis of Lemma 2.1 of [Reference Serfozo17]. Observe that Fatou’s lemma implies that for each $(y,z)\in \Gamma_5$ ,

(5.22)

\begin{align} \nonumber \liminf_{n\to\infty} \widehat R_{n,1}(y,z) & = \liminf_{n\to\infty} \int_{y}^{z} R_{n,1}(y,v) Q(dv;\; y, z)\\[5pt] \nonumber& \ge \int_{y}^{z} \liminf_{n\to\infty} R_{n,1}(y,v) Q(dv;\; y, z) \\[5pt] \nonumber & = \int_{y}^{z} (BU_{0} (y,v) + c_{1}(y,v)) Q(dv;\; y, z) \\[5pt] & = \widehat{BU_{0}} (y, z) + \widehat c_{1}(y,z) \ge 0,\end{align}

where the last inequality follows from Proposition 5. Now briefly simplify notation by setting $f\;:\!=\;\widehat{BU_{0}} + \widehat c_{1}$ . Note that f is nonnegative and lower semicontinuous on $ {\Gamma_{5}}$ by Proposition 1. Moreover, (5.22) implies that

\begin{align*}\liminf_{i\to\infty} \widehat R_{n_{m_i},1}(y,z) \ge \liminf_{n\to\infty} \widehat R_{n,1}(y,z) \ge f(y,z).\end{align*}

Thus it follows that for any $t \in {\mathbb R}^{+}$ , $\varepsilon > 0$ , and all sufficiently large $i\in \mathbb N$ , we have $\{f +R > t + \varepsilon\} \subset \{ \widehat R_{n_{m_i},1} + R > t\}$ . Hence the weak convergence of $\overline\nu_{1,n_{m_i}}$ to $\overline\nu$ and this inclusion for i sufficiently large yield

\begin{align*}\overline\nu\{f + R> t+\varepsilon\} & \le \liminf_{i\to\infty} \overline \nu_{1,n_{m_i}}\{f + R > t+\varepsilon\} \\[5pt] & \le \liminf_{i\to\infty} \overline \nu_{1,n_{m_i}}\{\widehat R_{n_{m_i},1} + R> t\};\end{align*}

thus the conditions of Lemma 2.1 of [Reference Serfozo17] are satisfied. From that lemma and Proposition 1, it follows that

\begin{align*}& \liminf_{n\to\infty}\int_{\Gamma_{5}} (\widehat R_{n,1}(y,z) + R)\, \overline\nu_{1,n} (dy\times dz) \\[5pt] &\ \ = \lim_{i\to\infty}\int_{\Gamma_{5}} (\widehat R_{n_{m_i},1}(y,z) + R)\, \overline\nu_{1,n_{m_i}} (dy\times dz)\ge \int_{\Gamma_{5}} (f(y, z) + R)\, \overline\nu(dy\times dz).\end{align*}

Recalling that $f = \widehat{BU}_0 + \widehat{c}_1 \geq 0$ and that $\overline\nu_{1,n_{m_i}} \Rightarrow \overline\nu$ implies convergence of the masses $\overline\nu_{1,n_{m_i}}(\Gamma_5)$ to $\overline\nu(\Gamma_5)$ , we obtain

\begin{align*}\liminf_{n\to\infty}\int_{\Gamma_{5}} \widehat R_{n,1}(y,z)\, \overline\nu_{1,n}(dy\times dz) \ge \int_{\Gamma_{5}} f(y, z)\, \overline\nu(dy\times dz) \ge0.\end{align*}

Therefore (5.21) is established, which combined with (5.20) completes the proof.

Pulling all these results together, we obtain our main theorem.

Theorem 2. Assume Conditions 1–5. Let $(\tau,Z) \in \mathcal{A}_0$ with $J(\tau,Z) < \infty$ . Then

\begin{align*}J(\tau,Z) \geq H_0^* = H_0({y_0^*},{z_0^*}) = J(\tau^*,Z^*),\end{align*}

where $(\tau^*,Z^*)$ is the ordering policy (2.13) using an optimizing pair $({y_0^*},{z_0^*}) \in \mathcal{R}$ .

Proof. Let $(\tau,Z)\in \mathcal{A}_0$ satisfy $J(\tau,Z) < \infty$ . Let X satisfy (1.2), and let $\mu_{0,t}$ and $\nu_{1,t}$ be defined by (3.1) for each $t > 0$ . Let $\{t_j\}$ be a sequence with $t_j \rightarrow \infty$ and

(5.23)

\begin{eqnarray} \nonumber J(\tau,Z) &=& \lim_{j\rightarrow \infty} \mbox{$\frac{1}{t_j}$} \mathbb E\left[\int_0^{t_j} c_0(X(s))\, ds + \sum_{k=1}^\infty I_{\{\tau_k \leq t_j\}} c_1(X(\tau_k-\!),X(\tau_k))\right] \\[5pt] &=& \lim_{j\rightarrow \infty} \left(\int_{\overline{\mathcal{E}}} {c_0}(x)\, {\mu}_{0,t_j}(dx) + \int_{\overline{\mathcal{R}}} \widehat{c_1}(y,z)\, {\nu}_{1,t_j}(dy\times dz)\right).\end{eqnarray}

The tightness of $\{{\mu}_{0,t_j}\}$ implies the existence of a weak limit ${\mu}_0$ ; without loss of generality, assume ${\mu}_{0,t_j} \Rightarrow {\mu}_0$ as $j\rightarrow \infty$ . Proposition 4 and its proof establish that

\begin{align*}\int_{\overline{\mathcal{E}}} {c_0}\, d {\mu}_0 \leq \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal{E}}} {c_0}\, d {\mu}_{0,t_j} \leq J(\tau,Z) < \infty.\end{align*}

Since $U_n \in \mathcal{D}$ , $\displaystyle\lim_{j\rightarrow \infty} \int_{\overline{\mathcal{E}}} AU_n\, d {\mu}_{0,t_j} = \int_{\overline{\mathcal{E}}} AU_n\, d {\mu}_0$ . Proposition 7 implies that for each n,

(5.24)

\begin{equation} \lim_{j\rightarrow \infty} \left(\int_{\overline{\mathcal{E}}} AU_n(x)\, {\mu}_0(dx) + \int_{\overline{\mathcal{R}}} \widehat{BU_n}(y,z)\, {\nu}_{1,t_j}(dy\times dz)\right) = 0,\end{equation}

so adding (5.23) and (5.24) and taking the limit inferior as $n\rightarrow \infty$ yields

\begin{align*}& J(\tau,Z) \\[5pt] &= \liminf_{n\rightarrow \infty} \lim_{j\rightarrow \infty} \bigg(\int_{\overline{\mathcal{E}}} (AU_n(x) + c_0(x))\, {\mu}_{0,t_j}(dx) \\[5pt] & \qquad+ \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) + \widehat{c_1}(y,z))\, {\nu}_{1,t_j}(dy\times dz)\bigg) \\[5pt] &\geq \liminf_{n\rightarrow \infty} \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal{E}}} (AU_n(x) + c_0(x))\, {\mu}_{0,t_j}(dx) \\[5pt] & \qquad+\; \liminf_{n\rightarrow \infty} \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) + \widehat{c_1}(y,z))\, {\nu}_{1,t_j}(dy \times dz) \\[5pt] &\geq \liminf_{n\rightarrow \infty} \int_{\overline{\mathcal{E}}} (AU_n(x) + c_0(x))\, {\mu}_0(dx) \\[5pt] & \qquad+\; \liminf_{n\rightarrow \infty} \liminf_{j\rightarrow \infty} \int_{\overline{\mathcal{R}}} (\widehat{BU_n}(y,z) + \widehat{c_1}(y,z))\, {\nu}_{1,t_j}(dy \times dz) \\[5pt] &\geq H_0^*;\end{align*}

Propositions 8 and 9 establish the last inequality.

6. Examples

We begin by briefly discussing the inventory management models in [Reference Helmes, Stockbridge and Zhu9]. The present paper shows that the optimality of a $({y_0^*},{z_0^*})$ policy extends to models having deficient supply. The main example (in Section 6.3) demonstrates the efficacy of this optimization approach for a more complicated stochastic logistic inventory model having nearly proportional yields.

6.1. Drifted Brownian motion inventory models

The first inventory problem concerns the classical fundamental process of a drifted Brownian motion $X_0$ satisfying the stochastic differential equation

(6.1)

\begin{equation} dX_0(t) = - \mu\, dt + \sigma\, dW(t), \qquad X_0(0)=x_0,\end{equation}

where $\mu, \sigma > 0$ and W is a standard Brownian motion, under the cost structure

\begin{align*} c_0(x) = \begin{cases}-c_b\, x, & x < 0, \\[5pt] c_h\, x, & x\geq 0,\end{cases} \ \quad \mbox{and} \ c_1(y,z) = k_1 + k_2(z-y), \ -\infty < y \leq z < \infty,\end{align*}

with $c_b, c_h, k_1, k_2 > 0$ .

A modification of the problem has reflection at 0, so that no back-ordering is allowed, with the cost structure

\begin{align*}c_0(x) = k_3 x + k_4 e^{-x}, \ \ x\geq 0, \quad \mbox{ and } \ \ c_1(y,z) = k_1 + k_2 \sqrt{z-y}, \ \ 0 \leq y \leq z < \infty,\end{align*}

again with $k_1, k_2, k_3, k_4 > 0$ .

As mentioned previously, Condition 1 is the same in both papers, and Condition 2.2 of [Reference Helmes, Stockbridge and Zhu9] is the same as Condition 3 in this paper. Further, Condition 2.3 of the previous paper is more restrictive than Condition 4 here. Thus, Conditions 1, 3, and 4 are satisfied by both of these models, as established in [Reference Helmes, Stockbridge and Zhu9]. Consequently, for any family $ \mathcal{Q}$ satisfying Condition 2, the conditions of Theorem 1 are satisfied and there exists an optimizing pair $({y_0^*},{z_0^*}) \in \mathcal{R}$ of $H_0$ .

Turning to Theorem 2 to establish the optimality of the $({y_0^*},{z_0^*})$ policy, we note that Condition 5 of this paper differs from Condition 5.1 of [Reference Helmes, Stockbridge and Zhu9] only in the use of $U_0 = g_0 - H_0^* \zeta$ in place of $G_0 = g_0 - F_0^* \zeta$ . The verification of Condition 5.1 of the previous paper does not rely on $F_0^*$ . Thus the same argument using $U_0$ in place of $G_0$ demonstrates that Condition 5 holds for both problems involving the drifted Brownian motion model. Theorem 2 therefore establishes that the $({y_0^*},{z_0^*})$ ordering policy is optimal.

6.2. Geometric Brownian motion storage models

In the second model examined in [Reference Helmes, Stockbridge and Zhu9], we take the fundamental dynamics to be a geometric Brownian motion process satisfying the stochastic differential equation

\begin{align*}dX_0(t) = -\mu X_0(t)\, dt + \sigma X_0(t)\, dW(t), \qquad X_0(t) = x_0 \in (0,\infty),\end{align*}

where $\mu, \sigma > 0$ . Two cost structures are analyzed:

\begin{align*}\begin{aligned}c_0(x)& = k_3 x + k_4 x^\beta \ \quad \mbox{ for } 0 < x < \infty, \\[5pt] c_1(y,z) & = k_1 + k_2\sqrt{z-y}\ \quad \mbox{ for } 0 < y \leq z < \infty, \end{aligned}\end{align*}

and

\begin{eqnarray*}c_0(x)\; &=& \;\left\{\begin{array}{cl}k_3 (1-x) & \qquad \qquad \qquad \qquad \mbox{for } 0 < x < 1, \\[5pt] k_4(x-1) & \qquad \qquad \qquad \qquad \mbox{for } 1 \leq x < \infty,\end{array}\right. \\[5pt] c_1(y,z)\; &=& \; k_1 + \mbox{$\frac{1}{2}$}\left(y^{-\frac{1}{2}} - z^{-\frac{1}{2}}\right) + \mbox{$\frac{1}{2}$} (z-y) \quad \; \; \mbox{for } 0 < y \leq z < \infty,\end{eqnarray*}

with the parameters $k_1, k_2, k_3, k_4 > 0$ and $\beta < 0$ .

For the geometric Brownian motion model, Conditions 1, 3, and 4 are shown to be satisfied in [Reference Helmes, Stockbridge and Zhu9]. Thus, for any family $ \mathcal{Q}$ satisfying Condition 2, Theorem 1 establishes the existence of an optimizing pair $({y_0^*},{z_0^*}) \in \mathcal{R}$ for $H_0$ . Furthermore, as in the case of the drifted Brownian motion model, Condition 5 follows from the same analysis as in the proof of Theorem 6.4 of [Reference Helmes, Stockbridge and Zhu9], with $U_0$ replacing $G_0$ . Therefore Theorem 2 shows that the $({y_0^*},{z_0^*})$ ordering policy is optimal for deficient supply models.

6.3. Logistic storage model

Our third example is a logistic inventory model in a random environment with a special family of random supplies. The process is an adaptation to an inventory set-up of a population model analyzed in [Reference Lungu and Øksendal15] in the context of a particular harvesting study.

For this model, the inventory level of a product (in the absence of orders) satisfies the stochastic differential equation

(6.2)

\begin{equation} dX_0(t) = - \mu X_0(t) (k-X_0(t))\, dt + \sigma X_0(t) (k-X_0(t))\, dW(t), \quad X_0(0) = x_0,\end{equation}

where k, $\mu$ and $\sigma$ are positive constants. Set $\beta \;:\!=\; - \frac{2 \mu }{k \sigma^2}$ and require $\beta < -1$ . The process $X_0$ evolves on the bounded state space ${\mathcal I} = (0,k)$ . With reference to Chapter 15 of [Reference Karlin and Taylor13], straightforward calculations verify that this model satisfies Condition 1. In particular, both endpoints are natural, 0 is attracting, and k is non-attracting; see also [Reference Helland7]. In comparison with geometric Brownian motion, both boundary points are finite. We identify the scale function and speed measure in (6.3) and (6.4) for a particular scaling of the logistic model.

A common yield structure when there are deficient supplies is provided by the uniform distribution on (y, z), which represents proportional yields. When ${\mathcal I}$ is unbounded above, this family of uniform distributions on (y, z) for $y,z \in {\mathcal I}$ is easily seen to satisfy Condition 2(c), since the mass escapes to $\infty$ as $z\to\infty$ . Unfortunately this condition is no longer true for a uniform distribution with y fixed, and $z \to k$ for this example since the right boundary is a finite value. Thus, we adopt the family of ‘z-skewed uniform distributions’ as a surrogate, resulting in a model with nearly proportional yields.

To be precise, choose a large integer j, and for each $(y, z) \in {\mathcal R}$ let $Q(\!\cdot \,;\;y,z)$ be the uniform distribution on the interval having left endpoint $(1-(z/k)^j) y + (z/k)^j z$ and right endpoint z. In this choice, the left endpoint is a convex combination of y and z with a weight factor $(z/k)^j$ that more heavily favors z as z approaches the upper boundary k. Clearly, this family of distributions satisfies the ASC condition as well as the MDG condition in Condition 2(a,ii). Furthermore, depending on the choice of j, the measure $Q(\!\cdot;\; y,z)$ is a ‘reasonable’ approximation to the uniform distribution on (y, z) when z is not too close to k. Therefore this family of random effects distributions results in a model having nearly proportional yields. Finally, we take $Q(\!\cdot\,,y,y) = \delta_y(\!\cdot\!)$ so that Condition 2(a,i) holds and it is easy to verify the weak convergence of the measures in Condition 2(b).

For this example, we choose the bounded holding cost function $c_0(x) \;:\!=\; k_0 (x-{\bar x})^2$ for $0 < x < k$ , where $k_0$ is a positive constant and the number ${\bar x} \in (0,k)$ characterizes a ‘preferred’ inventory level. Furthermore, we choose the order cost function $c_1(y,z)$ in (6.1). Again, straightforward analysis verifies (2.3) and hence Condition 3 is satisfied.

Scaling the inventory process by the factor k and adjusting the parameters appropriately, we can set $k = 1$ without loss of generality. The scale function S and the speed measure M associated with $X_0$ can be determined as follows. Let $C_1 = ({x_0/(1-x_0)})^{\beta}$ , $C_2 = 1/(\sigma^2C_1) = ((1-x_0)/x_0)^{\beta}/\sigma ^2$ , and let ${_2F_1}$ denote the (Gaussian) hypergeometric function. Let $\tilde S(x) = C_1 x^{(1-\beta)}/(1-\beta)\, {_2F_1}(1-\beta,-\beta;\;2-\beta;\;x)$ . Then

(6.3)

\begin{equation} S(x) = C_1 \int_{x_0}^{x} {((1-u)/u)}^{\beta} \, du = \, \,{\tilde S(x)} - {\tilde S(x_0)},\qquad 0 < x < 1,\end{equation}

while $M[a,b] = \int_{a}^{b}m(v) dv$ for any $[a, b] \subset (0,1)$ , where the speed density m is given by

(6.4)

\begin{equation} m(v) = C_2 (1-v)^{-(\beta+2)}{v}^{\beta - 2}, \qquad 0 < v < 1.\end{equation}

For later reference, we note that $S'(x) = C_1 (\frac{1-x}{x})^\beta$ for $x\in (0,1)$ .

Now turning to Condition 4, since each boundary is natural, we need to check that there is some $(y,z)\in {\mathbb R}$ for which $H_0(y,z)=(\widehat{Bc_1}(y,z)+\widehat{Bg_0}(y,z))/\widehat{B\zeta}(y,z)$ is smaller than the holding cost rates at the boundaries. To this end, note that the expressions for $\zeta$ and $g_0$ simplify considerably when we set $x_0 = {\bar x} = 1/2$ , so for this example we make this selection. These functions are then

(6.5)

\begin{align} \zeta(x) & = -\,\frac{2 \left(1-2 x+2 \beta \ln(2-2 x)+\beta (1+\beta ) \ln\left(\frac{x}{1-x}\right)\right)}{\sigma ^2 \beta \left(-1+\beta^2\right)},\end{align}

(6.6)

\begin{align}g_0(x) &= \frac{k_0 \left((-1+2 x) \left(-1+2 \beta ^2\right)-2 \beta \ln(2-2 x)-\beta (1+\beta ) \ln\left(\frac{x}{1-x}\right)\right)}{2\sigma ^2 \beta \left(-1+\beta ^2\right)}\ .\end{align}

The functions $\widehat{Bc_1}$ , $\widehat{B\zeta}$ , and $\widehat{Bg_0}$ are then obtained by integrating the functions given above with respect to the measures Q. Usually this integration is best accomplished using software packages such as Maple or Mathematica, since the formulas become messy. Then, by elementary but rather lengthy calculations, one verifies Condition 4.

For more general parameters in this model, (6.5) and (6.6) become more involved and even analytically intractable for some families of random effects measures. An alternative approach to verifying Condition 4 is to simply optimize $H_0$ and then compare the optimal value $H_0^*$ with the cost rates $c_0(0) = k_0/4 = c_0(1)$ . An optimizing pair $(y^*,z^*)$ in the interior would then satisfy Condition 4 for this model when $H_0^* < k_0/4$ . Minimizing $H_0$ is a two-dimensional optimization problem. Since Condition 4 only requires the existence of a pair $(y,z)\in \mathcal{R}$ , other alternatives for verifying this condition would be (i) to fix one of the variables or a relation between the variables, perform a one-dimensional optimization, and compare this value of $H_0$ against $k_0/4$ , or (ii) to compare the values of $H_0$ from a random search of $\mathcal{R}$ . Each of these approaches is numerical, rather than analytic.

Finally, to see that an (s, S) policy is optimal, we need to verify parts (a,ii) and (b,ii) of Condition 5. To this end, recall that $U_0(x) = g_0(x) - H_{0}^{*} \zeta(x)$ and observe that $c_{0}(x) - H_{0}^{*}$ is uniformly bounded on the unit interval. With $\zeta$ and $g_0$ given in (2.4) and using the expressions for the scale density (6.3) and the speed density (6.4), we have

(6.7)

\begin{align} \nonumber |\sigma x (1 - x) U'_{\!\!0}(x)| & \le |\sigma x (1 - x) S'(x)| \int_{x}^{1} |c_0(v) - H_{0}^{*}| dM(v) \\[5pt] & \le K x^{1-\beta}(1-x)^{1+\beta}\int_{x}^{1} v^{\beta-2} (1-v)^{-\beta-2} dv,\end{align}

where K is a positive constant independent of x or $x_{0}$ . To see that the left-hand side of (6.7) is uniformly bounded on [0, 1] which, in turn, implies (a,ii) and (b,ii) of Condition 5, it is clearly sufficient to find bounds in some neighborhoods of the two endpoints. The simple idea is to verify the following: (i) for x close to 1, the integral on the right-hand side of the inequalities decreases at the same rate as the factor $(1-x)^{1+\beta}$ increases; (ii) when x is close to zero, the integral increases at a rate no faster than the rate at which the factor $x^{1-\beta}$ decreases.

(i) For $x\in (\frac12, 1)$ the integral in (6.7) is dominated by

\begin{align*}\int_{x}^{1} v^{\beta-2} (1-v)^{-\beta-2} dv \quad \le \quad 2^{2-\beta} \int_{x}^{1}(1-v)^{-\beta-2} dv \quad \le \quad \mbox{$\frac{2^{2-\beta}}{-\beta-1}$} (1-x)^{-\beta-1},\end{align*}

and hence the left-hand side of (6.7) is bounded by some $K_1$ for $x\in (\frac12,1)$ . (ii) Similarly, for $x\in (0, \frac12)$ , a dominating function for the integral in (6.7) is determined as follows:

\begin{eqnarray*}\int_{x}^{1} v^{\beta-2} (1-v)^{-\beta-2} dv\; &=& \;\int_{x}^{1/2} v^{\beta-2} (1-v)^{-\beta-2} dv + \int_{1/2}^{1} v^{\beta-2} (1-v)^{-\beta-2} dv \\[5pt]\; &\leq&\;(2^{2+\beta}\vee 1) \int_{x}^{1/2} v^{\beta-2} dv + K_2 \\[5pt] \; &=& \;\mbox{$\frac{2^{2+\beta}\vee 1}{1-\beta}$}\, x^{\beta-1} + K_3,\end{eqnarray*}

where $K_2$ is the value of the integral over $\big[\frac12,1\big]$ and $K_3$ then adjusts this value by the contribution of the first integral at the boundary $1/2$ . Thus, taking into account the factor $x^{1-\beta}$ on the right-hand side of (6.7), we see that the left-hand side of (6.7) is bounded for $x\in (0,1/2)$ .

Using both estimates in (6.7), together with the fact that $\lim_{x\to 0} \sigma x (1-x) U'_{\!\!0}(x)$ exists and is finite, we have thus shown that $|\sigma x (1-x) U'_{\!\!0}(x)|$ is uniformly bounded on [0, 1]. Since the denominators are bounded below by 1, Condition 5 holds.

In summary, the model satisfies Conditions 1, 2, 3, and 4. Therefore Theorem 1 establishes the existence of an optimizing pair $({y_0^*},{z_0^*}) \in \mathcal{R}$ of $H_0$ . Furthermore, since Condition 5 holds, Theorem 2 shows that the $({y_0^*},{z_0^*})$ ordering policy is optimal for this particular logistic inventory model.

Finally, we numerically illustrate the effect of using the optimization results in this paper for a particular set of parameters. For comparison purposes, three models based on the logistic dynamics in (6.2) are examined. Model 1 assumes no noise by setting $\sigma=0$ so that the dynamics are deterministic, and it uses the non-deficient supply measures $Q(\!\cdot;\;y,z) = \delta_{\{z\}}(\!\cdot\!)$ for all $(y,z) \in \mathcal{R}$ . Model 2 has $\sigma=1/10$ , resulting in random fluctuations in the inventory level, but also uses $Q(\!\cdot;\;y,z) = \delta_{\{z\}}(\!\cdot\!)$ for all $(y,z) \in \mathcal{R}$ , so that the amount ordered is the amount delivered. Model 3 takes $\sigma=1/10$ and uses the nearly proportional yield transition functions Q defined earlier in this subsection, with $j=10$ . The other parameters in this illustration are $k=1$ , $\mu=1/20$ , $k_0=100$ , $k_1=9$ , $k_2=4$ , and $x_0 = {\bar x} = 1/2$ .

Table 1 illustrates the impact a random environment and/or random supplies have on the optimal characteristics of the logistic inventory model. Specifically, the following characteristics of the optimal solutions have been computed:

• the order ‘from’ level ${y_0^*}$ and the deterministic order ‘to’ or nominal order ‘to’ level ${z_0^*}$
• the ‘mean supply’, a deterministic quantity in Models 1 and 2
• the optimal expected long-run average ‘Cost’
• the ‘mean cycle length’ (the cycle length is again deterministic for Model 1)

Observe that the optimal value of $H_0^* = 1.33092 = H_0(0.384973,0.6575) < 25 = k_0/4$ , so Condition 4 is satisfied.

Table 1. Comparison of three logistic inventory models.

From a management point of view, the following observations are important. The nearly proportional yield model’s having random fluctuations in inventory results in cost increases of 42% and 33% over Models 1 and 2, respectively. Also, the uncertainty of the environment and the fluctuating deliveries typically shorten the mean cycle length, even though the nominal order interval increases in length as randomness is added to the process and to the delivered amounts. Thus, ordering tends to occur more frequently for the stochastic models.

Additional insights into the characteristics of the optimal nominal policy and optimal inventory process can be obtained by more extensive sensitivity analysis. For instance, for modifications of this example, various statistics of the aforementioned quantities, such as the mean cycle time, can be computed or derived from simulation studies.

As indicated earlier, uniqueness of the optimal policy is not analytically guaranteed. However, one may obtain contour plots of $H_0$ numerically and thereby determine the uniqueness of the optimal policy for this particular model and for more general stochastic differential equations and Q distributions.

Funding information

This research was supported in part by the Simons Foundation (grant award numbers 246271 and 523736) and a DIG award from the University of Wisconsin–Milwaukee.

Competing interests

There were no competing interests to declare which arose during the preparation or publication process of this article.

References

Bar-Lev, S. K., Parlar, M. and Perry, D. (1994). On the EOQ model with inventory-level-dependent demand rate and random yield. Operat. Res. Lett. 16, 167–176.CrossRef Google Scholar

Bensoussan, A. (2011). Dynamic Programming and Inventory Control. IOS Press, Amsterdam.Google Scholar

Bogachev, V. I. (2007). Measure Theory, Vol. 2. Springer, Berlin.CrossRef Google Scholar

Chen, H., Wu, O. Q. and Yao, D. D. (2010). On the benefit of inventory-based dynamic pricing strategies. Prod. Operat. Manag. 19, 249–260.CrossRef Google Scholar

Ethier, S. N. and Kurtz, T. G. (1986). Markov Processes: Characterization and Convergence. John Wiley, New York.CrossRef Google Scholar

Federgruen, A. and Zipkin, P. (1986). An inventory model with limited production capacity and uncertain demands I. The average-cost criterion. Math. Operat. Res. 11, 193–207.Google Scholar

Helland, I. (1996). One-dimensional diffusion processes and their boundaries. Tech. Rep., University of Oslo. Available at https://core.ac.uk/download/pdf/30815803.pdf.Google Scholar

Helmes, K. L., Stockbridge, R. H. and Zhu, C. (2017). Continuous inventory models of diffusion type: long-term average cost criterion. Ann. Appl. Prob. 27, 1831–1885.CrossRef Google Scholar

Helmes, K. L., Stockbridge, R. H. and Zhu, C. (2018). A weak convergence approach to inventory control using a long-term average criterion. Adv. Appl. Prob. 50, 1032–1074.CrossRef Google Scholar

Helmes, K. L., Stockbridge, R. H. and Zhu, C. (2024). On the modelling of uncertain impulse control for continuous Markov processes. SIAM J. Control Optimization. 62, 699–723.CrossRef Google Scholar

Inderfurth, K. and Transchel, S. (2007). Note on myopic heuristics for the random yield problem. Operat. Res. 55, 1183–1186.CrossRef Google Scholar

Inderfurth, K. and Vogelsang, S. (2013). Periodic review inventory systems with fixed order cost and uniform random yield. Europ. J. Operat. Res. 224, 293–301.CrossRef Google Scholar

Karlin, S. and Taylor, H. M. (1981). A Second Course in Stochastic Processes. Academic Press, New York.Google Scholar

Korn, R. (1997). Optimal impulse control when control actions have random consequences. Math. Operat. Res. 22, 639–667.CrossRef Google Scholar

Lungu, E. and Øksendal, B. (1997). Optimal harvesting from a population in a stochastic crowded environment. Math. Biosci. 145, 47–75.CrossRef Google Scholar

Sato, K., Yagi, K. and Shimakazi, M. (2018). A stochastic inventory model for a random yield supply chain with wholesale-price and shortage penalty contracts. Asia-Pacific J. Operat. Res. 35, article no. 1850040.CrossRef Google Scholar

Serfozo, R. (1982). Convergence of Lebesgue integrals with varying measures. Sankhya A 44, 380–402.Google Scholar

Sigman, K. and Wolff, R. W. (1993). A review of regenerative processes. SIAM Rev. 35, 269–288.CrossRef Google Scholar

Song, Y. and Wang, Y. (2017). Periodic review inventory systems with fixed order cost and uniform random yield. Europ. J. Operat. Res. 257, 106–117.CrossRef Google Scholar

Tinani, K. S. and Kandpal, D. H. (2017). Literature review on supply uncertainty problems: yield uncertainty and supply disruption. J. Indian Soc. Prob. Statist. 18, 89–109.CrossRef Google Scholar

Yano, C. A. and Lee, H. L. (1995). Lot sizing with random yields: a review. Math. Operat. Res. 43, 311–334.CrossRef Google Scholar

Yao, D., Chao, X. and Wu, J. (2015). Optimal control policy for a Brownian inventory system with concave ordering cost. J. Appl. Prob. 52, 909–925.CrossRef Google Scholar

Zheng, Y. S. and Federgruen, A. (1991). Finding optimal (s, S) policies is about as simple as evaluating a single policy. Operat. Res. 39, 654–665.Google Scholar