OPTIMALITY OF FOUR-THRESHOLD POLICIES IN INVENTORY SYSTEMS WITH CUSTOMER RETURNS AND BORROWING/STORAGE OPTIONS

Eugene A. Feinberg; Mark E. Lewis

doi:10.1017/S0269964805050047

OPTIMALITY OF FOUR-THRESHOLD POLICIES IN INVENTORY SYSTEMS WITH CUSTOMER RETURNS AND BORROWING/STORAGE OPTIONS

Published online by Cambridge University Press: 01 January 2005

Eugene A. Feinberg and

Mark E. Lewis

Show author details

Eugene A. Feinberg: Affiliation:
Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, Stony Brook, NY 11794-3600, E-mail: Eugene.Feinberg@sunysb.edu
Mark E. Lewis: Affiliation:
Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, MI 48109-2117, E-mail: melewis@engin.umich.edu

Article contents

Abstract
1. INTRODUCTION
2. PROBLEM DESCRIPTION
3. FINITE HORIZON DISCOUNTED COST OPTIMAL POLICIES
4. INFINITE HORIZON DISCOUNTED COST OPTIMAL POLICIES
5. AVERAGE COST PER UNIT TIME OPTIMAL POLICIES
6. THE AVERAGE COST OPTIMALITY EQUATIONS
7. CONCLUSIONS
Acknowledgments
References

Rights & Permissions

Abstract

Consider a single-commodity inventory system in which the demand is modeled by a sequence of independent and identically distributed random variables that can take negative values. Such problems have been studied in the literature under the name cash management and relate to the variations of the on-hand cash balances of financial institutions. The possibility of a negative demand also models product returns in inventory systems. This article studies a model in which, in addition to standard ordering and scrapping decisions seen in the cash management models, the decision-maker can borrow and store some inventory for one period of time. For problems with back orders, zero setup costs, and linear ordering, scrapping, borrowing, and storage costs, we show that an optimal policy has a simple four-threshold structure. These thresholds, in a nondecreasing order, are order-up-to, borrow-up-to, store-down-to, and scrap-down-to levels; that is, if the inventory position is too low, an optimal policy is to order up to a certain level and then borrow up to a higher level. Analogously, if the inventory position is too high, the optimal decision is to reduce the inventory to a certain point, after which one should store some of the inventory down to a lower threshold. This structure holds for the finite and infinite horizon discounted expected cost criteria and for the average cost per unit time criterion. We also provide sufficient conditions when the borrowing and storage options should not be used. In order to prove our results for average costs per unit time, we establish sufficient conditions when the optimality equations hold for a Markov decision process with an uncountable state space, noncompact action sets, and unbounded costs.

Type: Research Article
Information: Probability in the Engineering and Informational Sciences , Volume 19 , Issue 1 , January 2005 , pp. 45 - 71

DOI: https://doi.org/10.1017/S0269964805050047 [Opens in a new window]
Copyright: © 2005 Cambridge University Press

1. INTRODUCTION

Consider a single-commodity inventory system for which we do not assume that the demand is nonnegative. Such problems have been studied in the literature under the name cash management and relate to the variations of the on-hand cash flows of financial institutions. For the cash management problem without fixed ordering costs, an optimal policy is defined by two thresholds; if the inventory level is above the higher threshold, it should be reduced to this threshold, and, similarly, if the inventory level is below the lower threshold, it should be increased up to this threshold (see Eppen and Fama [16] or Heyman and Sobel [27, Sect.8.4]). This is an extension of S-policies (also called “order-up-to” or “basestock” policies) for models with nonnegative demand.

In cash management problems, ordering and scrapping the inventory is associated with financial transactions. In reality, a number of financial instruments are available to a company. In particular, a firm may be able to borrow money for a short period of time. A natural question to investigate is how should short-term and long-term transactions be linked. A similar dilemma also takes place for production/inventory operations; is it better to buy or lease even when leasing is more expensive per unit time? In fact, in inventory systems with product returns, a company can benefit from temporary increases to its inventory (leasing) for a fixed period of time since, in view of possible returns, after this period, ordering may not be needed. In this article, we model short-term transactions by one-step borrowing and storage decisions.

We consider the problem when the manager can borrow or store the inventory at any period of time. The borrowed inventory should be returned at the beginning of the next period and the stored inventory is automatically added to the main inventory in the beginning of the next period. We consider the model with back orders. For example, it is possible that the borrowing option is executed but the inventory level becomes negative in the beginning of the next period. In this case, the borrowed amount is returned anyway and the back order increases. The manager, though, has an option to borrow again. In the model considered in this article, the purchasing, scrapping, borrowing, and storing costs are linear. There are no fixed costs associated with ordering for any of these operations. All capacities are unconstrained and all lead times are zero.

We show that when the option to borrow or store for one period is available to the decision-maker, an optimal policy is defined by four thresholds. If the inventory position is too high, the optimal decision is to reduce the inventory via scrapping, but only to a certain point, after which one should also store some of the commodity to meet future demand. Analogously, if the inventory position is too low, the decision-maker should order up to a certain level and then borrow from the secondary source to meet potential demand. We also provide sufficient conditions when optimal policies do not use the borrowing and storage options; that is, the production threshold is equal to the borrowing threshold and the similar equality holds for scrapping and storage. In particular, we show that this holds if borrowing is more expensive than backordering in problems with linear holding and backordering costs or if the demand is nonnegative and borrowing is not less expensive than ordering.

The study of inventory control dates back at least to the work of Arrow, Harris, and Marshcak [2] and some would say (in the deterministic case) to the work of Harris [23]. With this in mind, we will not attempt a complete review here except to point the reader to the excellent survey article of Porteus [32] that touches many of the high points in the area until 1990. Cash management has also received a considerable amount of attention, although much less in the operations research literature than the inventory models with nonnegative demand. Eppen and Fama [16] considered a model with independent and identically distributed (i.i.d.) discrete demands with finite support and showed the existence of order-up-to and down-to levels in the finite horizon case for models without setup costs. Girgis [21] and Neave [30] considered both fixed and variable costs for each transaction. The former shows that when there are fixed costs for increasing or decreasing demand (but not both), an optimal policy analogous to (s,S)-policies in inventory control results. One important difference is that between the order-up-to level (S) and the lower limit (s), where no action is usually taken, the optimal policy in the cash management model may order. Furthermore, it is not immediately obvious what the ordering decision in this case might be. The latter article shows that when both transactions have fixed costs, this analogy need not hold in both directions, and it provides conditions under which it does hold. All of these results were collected and simplified in [15]. Other generalizations of the cash management problem appeared in [12,28]. Other models of product returns were studied in [20,26,41]. The most recent results on the cash management problem with fixed costs can be found in Chen and Simchi-Levi [9].

Early models that allow for demand to be met by emergency orders, possibly at the expense of higher costs, include those of Daniel [13], Neuts [31], and Barankin [4]. Aneji and Noori [1] discussed a problem in which unmet demand may be met by a secondary source and showed that the ordering policy is an (s,S)-policy. Tagaras and Vlachos [40] discussed a periodic review system with the possibility of emergency replenishments with various lead times. Recently, Huggins and Olsen [29] showed that when overtime production is available, an (s,S)-policy is still optimal for regular production and various policies are optimal for overtime. Other related models include those of Chiang and Gutierrez [10,11] and Arslan, Ayhan, and Olsen [3].

Section 2 of this article provides a formal description of the problem and the optimality criteria considered. Section 3 discusses optimal order-up-to and down-to policies in the finite horizon case and provides conditions under which we need not consider the borrowing or storage options. This is continued in Sections 4 and 5 for the infinite horizon discounted and average cost cases, respectively. Section 6 proves the existence of a solution to the average cost optimality equations (ACOEs) for our problem. Since our search of the literature did not yield sufficient conditions for the validity of the ACOEs for our problem, we provide sufficient conditions for a more general Markov decision process (MDP) and show that they hold in our case. We conclude by discussing avenues of future research in Section 7. For the remainder of this section, we explain what sufficient conditions are available in the literature and what is required for our problem.

For MDPs with Borel state spaces and bounded rewards, the ACOEs were introduced by Ross [35] and studied by Gubenko and Statland [22], Dynkin and Yushkevich [14], and Fernández-Gaucherand [18]. For problems with unbounded rewards, according to Schäl [37, Prop. 1.3] a stationary policy that satisfies the average cost optimality inequalities is optimal. Of course, if the optimality equations hold, a stationary policy that satisfies them is optimal as well. Schäl [37] described two groups of conditions, (W) and (S), and proved that each of these conditions imply the validity of the optimality inequalities. Conditions (W) require weak continuity of transition probabilities. Conditions (S) require setwise continuity of transition probabilities; discussed further later. We remark that if the action sets are finite, as was assumed in Ross [35] and Ritt and Sennott [34], then the setwise convergence conditions (S) from Schäl [37] hold and no specific continuity assumption is needed on transition probabilities.

An important feature of many inventory control models considered in the literature is that the control sets may not be compact and are, in fact, assumed to be unbounded. However, Schäl [37] assumed that the action sets are compact. This assumption prevents direct applications of the results from [37] to classic problems with unlimited ordering/scrapping capacities.

We remark that for inventory control problems, conditions (W) are natural, whereas conditions (S) are too strong. Consider a typical inventory control equation

where x_n is the inventory at the end of period n, a_n is the decision on how much should be ordered, and D_n is the demand during period n. Let q(dy|x,a) be the transition probability for the control problem (1.1); that is, q(B|x,a) represents the probability that a subset B of the state space is visited at the next step, given that action a is chosen in state x. Weak continuity of q in Schäl's [37] condition (W) means that E_{x_n^k,a_n^k} f (x_n+1) → E_{x_n,a_n} f (x_n+1) for any sequence {(x_n^k,a_n^k),k ≥ 0} of state–action pairs such that (x_n^k,a_n^k) → (x_n,a_n) for each bounded, continuous function f. This is true in light of (1.1) and Lebesgue's dominated convergence theorem. On the other hand, recall that setwise continuity in Schäl's [37] condition (S) means that q(B|x_n,a_n^k) → q(B|x_n,a_n) as a_n^k → a_n for any Borel subset B of the state space. Suppose that the demand is deterministic, D_n = 1, and a_n^k = a_n + (1/k), x_n^k = x_n. Then, q(B|x_n,a_n) = 1 for B = (−∞,x_n + a_n − 1] and q(B|x_n,a_n^k) = 0 for all k = 1,2,… .

As discussed earlier, Schäl [37] established the optimality inequalities under both weak and setwise continuity conditions but assumed the compactness of action sets. Hernández-Lerma and Lasserre [25, Chap.5], and Fernández-Gaucherand, Arapostathis, and Marcus [19] presented results for noncompact action sets but assumed setwise convergence. Moreover, Section 5.7 in Hernández-Lerma and Lasserre [25] provides conditions for the existence of stationary optimal policies for an MDP with weakly continuous transition probabilities, but the derivation is done directly—without deriving the optimality equations or inequalities. In this article, we need not only the existence of optimal policies but also the validity of the ACOEs. Hartley [24] established the validity of the ACOEs for the class of problems whose dynamics are described by equations that include (1.1). However, he assumed that the demand is a uniformly bounded random variable (i.e., |D_n| ≤ C for a finite constant C). Thus, although several sufficient conditions for the validity of the ACOEs have been described in the literature, none of them covers our problem. In Section 6, we provide the sufficient conditions that do this.

2. PROBLEM DESCRIPTION

Recall that we consider a single-commodity system with positive or negative demand. Assume that items that are returned are immediately available for resale. All lead times are assumed to be zero, the production/scrapping and borrowing/storage costs are assumed to be linear, and the unmet demand is backlogged. The cost of inventory held or backlogged (negative inventory) is modeled as a convex function. The sequence of events is as follows. At the beginning of the period, a decision-maker decides how much of the product to order or to scrap (at a loss) to meet demand. In addition, the decision-maker simultaneously decides how much inventory to borrow from or store to a (potentially external) secondary source. During the period, demand is realized and holding costs are accrued on the surplus or backlogged inventory. At the end of the period, borrowed or stored material is returned and the process continues. Note that since borrowed or stored inventory is returned the next period, holding costs are accrued but the inventory level of the next period is not affected. The objective is to minimize the total expected discounted cost over a finite horizon or the discounted or average cost over an infinite horizon. Let the following hold:

β ∈ (0,1] is the discount factor.
Positive numbers c₊ and c₋ are the per unit ordering and scrapping costs, respectively.
Positive numbers e₊ and e₋ are the per unit borrowing and storage costs, respectively.
h(·) denotes the holding/backordering cost per period; convex, nonnegative function with finite values, and h(x) → ∞ as |x| → ∞.
{D_n,n ≥ 0} is a sequence of i.i.d. random variables, where D_n represents demand in the nth period. We assume that
for all
, where D is a random variable with the same distribution as D_n. We note that this assumption and the assumed properties of the function h imply that
.

For

, let a⁺ and a⁻ be the positive and negative parts of a, respectively; a⁺ = max{a,0} and a⁻ = max{−a,0}. Let X_n be the inventory position at period n and let the ordered pair (Y_n,Z_n) be the amount ordered/scrapped and the amount borrowed/stored in this period. Denote the one-step cost function by C(x,(a,b)):

We model the decision scenario as a Markov decision process (see, e.g., Bertsekas [5,6], Dynkin and Yushkevich [14], Feinberg and Shwartz [17], Puterman [33], or Ross [36]). A general policy can be randomized and depend on the history of the inventory levels and decisions. A policy is called stationary if decisions are nonrandomized and depend only on the current inventory level; that is, a stationary policy is defined by a measurable function that maps the inventory position (the state space,

) to the set of potential actions (the action space). A Markov policy is defined as a sequence d₀,d₁,…, where d_n(x) is the decision that should be selected if the inventory level is x at step n. In our model, if an action (a,b) is chosen in state x, the cost C(x,(a,b)) is accrued, the system moves to state x + a − D, and this process continues. As was alluded to earlier, since the amount that is borrowed or stored is returned the next period, it has no effect on the subsequent inventory position. For a policy π and for an initial inventory level x, we define

Equations (2.2), (2.3), and (2.4) define the N-stage expected discounted cost, the infinite horizon expected discounted cost, and the long-run average expected cost, respectively. In the finite horizon problem, only the portion of the policy required for the time horizon is used. In each case, we define the optimal values

where Π is the set of all policies. A policy φ is called optimal for the respective criterion if its value v_N,β^φ(x), v_β^φ(x), or w^φ(x) corresponds to the value on the right-hand side of (2.5), (2.6), or (2.7), respectively, for all

We remark that our assumptions also imply that v_β(x) < ∞ for all

. Indeed, the assumptions on the holding cost h imply that there is a point x* such that

. Without loss of generality, we assume that x* = 0 and h(0) = 0. Consider a policy φ that never borrows/stores and always orders/scraps in a way that the inventory level before the demand is known is zero. Then

3. FINITE HORIZON DISCOUNTED COST OPTIMAL POLICIES

In this section, we study the finite horizon problem. Since it is fixed throughout the section, we suppress β whenever possible. It is well known that if a solution to the following finite horizon optimality equations (FHOEs) exist, that solution is equal to v_n (componentwise) as defined in (2.5). Let v₀ ≡ 0 and for n = 1,2,…,

where

We observe that an equivalent system is

where

This observation leads to an algorithm for solving the proposed problem. First, solve (3.4) for g*. Using this function, find the optimal ordering/scrapping policy by solving (3.3). Finally, using g* evaluated at y + a, find the optimal borrowing/storage decision b. The following lemma provides preliminary results on v_n and g*.

Lemma 3.1:

(i) The functions g*(y) and v_n(y) are convex in y for all n ≥ 0.

(ii) g*(y) → ∞ and v_n(y) → ∞ as |y| → ∞ for all n ≥ 1.

(iii) The inf can be replaced with min in (3.3) and (3.4).

Proof: For any convex function k(y) and random variable X such that

is also convex in y (cf. Bertsekas [6, Lemma4.2.1]). Since h is convex, the function inside the infimum in (3.4) is jointly convex in y and b. Thus, applying Proposition B-4 of Heyman and Sobel [27], we have that g*(y) is convex in y. The fact that g*(y) → ∞ as |y| → ∞ follows from the assumption that h(y) → ∞ as |y| → ∞.

We prove parts (i) and (ii) by induction. By assumption, v₀ is convex. Assume that convexity holds for n − 1. Since we have just shown that g* is convex, the function inside the infimum in (3.3) is jointly convex in y and a. Again applying Proposition B-4 of Heyman and Sobel [27] yields that v_n(y) is convex in y. Moreover, note that v_n(y) → ∞ as |y| → ∞ is implied by the fact that g*(y) → ∞ as |y| → ∞ and the result is proven.

To verify (iii), we observe that for each y the function of b on the right-hand side of (3.4) is convex in

and, therefore, continuous. In addition, this function tends to ∞ as |b| → ∞. This is also true for (3.3) with the variable a instead of b. █

For a function f (y), let d⁻f (y)/dy denote the left-hand derivative of f. This derivative exists if f is convex. In light of the results of Lemma 3.1, the infimums in (3.3) and (3.4) can be replaced by minimums. The minimum can be computed by finding the place at which the left-hand derivative changes sign. We, thus, can define the following quantities for n = 1,2,…:

where the supremum (infimum) of the empty set is taken to be −∞ (∞). Although the values defined in (3.5)–(3.8) depend on β, to keep the notation simple, we continue to suppress this dependence in the current section. For example, the full notation for L_n^p is L_n,β^p.

We remark that Eqs. (8-58a) and (8-58b) of Heyman and Sobel [27] are similar to our (3.5)–(3.12), but only one point where the appropriate convex functions achieve their minimums was considered in [27]. Here, we consider the intervals of all possible solutions. Obviously, L_n^p ≤ L_n^p, U_n^p ≤ U_n^p, L^b ≤ L^b, and U^b ≤ U^b, where equalities are possible in each case. We say that L_n^p is a lower actual inventory threshold if

U_n^p is an upper actual inventory threshold if

L^b is a lower total (actual plus borrowed) inventory threshold if

and U^b is an upper total inventory threshold if

The following lemma clarifies these definitions.

Lemma 3.2: For any n = 1,2,…, consider four threshold levels L_n^p, U_n^p, L^b, and U^b satisfying (3.13)–(3.16). Define the following decisions: order up/scrap down to the level

where y is the current inventory level, and borrow up/store down to the level

Then the actions a_n(y) = t_n′ − y and b_n(y) = z_n′ − a_n(y) minimize the right-hand sides of the optimality equations (3.3) and (3.4), respectively. Therefore, for any N = 1,2,…, the policy φ = {d_N,d_N−1,…,d₁} is optimal for the N-step problem, where d_n(x) = (a_n(x),b_n(x)), n = 1,…,N, and −∞ < x < ∞.

Proof: In light of Lemma 3.1, the problem of finding optimal policies via (3.3) and (3.4) is simply the single-period cash balance models discussed in [15] and [27, Sect.8-4]. Thus, similar to [27, Sect.8-4], we have optimal order-up-to and order-down-to levels defined by the minimums of the convex functions defined in (3.3) and (3.4) when the changes of variables z = y + a and z = y + b are applied. For any L_n^p ∈ [L_n^p,L_n^p] , an optimal action is to order up to level L_n^p if the current inventory y < L_n^p. If the inventory level is above any level U_n^p ∈ [U_n^p,U_n^p] , an optimal action is to reduce the inventory level to U_n^p. If the inventory level is between L_n^p and U_n^p, an optimal action is not to use the ordering/scrapping option. Similarly, the optimal decision obtained from (3.4) is to increase y to L^b ∈ [L^b,L^b] if y < L^b and to decrease y to U^b ∈ [U^b,U^b] if y > U^b. No borrowing/storage action is required if L^b < y < U^b. █

The following lemma simplifies the structure of the optimal policy in the sense that it displays that we need not produce (scrap) and store (borrow) in the same period.

Lemma 3.3: The following inequalities hold: L_n^p ≤ U^b and L^b ≤ U_n^p, for n = 1,2,….

Proof: We prove the first inequality. The proof of the second inequality is similar. Suppose L_n^p > U^b and set L_n^p = L_n^p and U^b = U^b. For n = 1 and y < L₁^p, (3.1) and Lemma 3.2 imply that for a = L₁^p − y > 0 and b = L^b − L₁^p < 0,

where the last expression corresponds to the policy that orders (a⁺ − b⁻) units and borrows nothing. This violation of the optimality equation implies the contradiction. Thus, the case n = 1 is proven.

For n ≥ 2, the inequality L_n^p > U^b is not possible by similar arguments, but we must consider two-step decisions. Again, suppose that this inequality holds. According to Lemma 3.2, when the initial state y < L_n^p, there is an optimal policy φ that prescribes to order up to the level L_n^p (a = L_n^p − y) and then to store down to the level U^b (b = U^b − L_n^p < 0). Consider now a policy ψ (when the initial state is y) that prescribes to order up to the level U^b at the first step and borrow nothing. At the second step, it orders/scraps the amount that the policy φ would order/scrap at the second step if the inventory level were L_n^p − D₁ plus it orders −b = L_n^p − U^b to make up the difference. It also borrows the same amount as the policy φ given that their actual inventory levels are the same. At the following steps, the policies coincide so that the inventory position seen by ψ coincides with φ; the processes couple.

Denote by c(y,d₁(y)) the ordering/scrapping costs for policy φ at the second step. The total ordering costs at the first two steps plus the borrowing/storage cost at stage 2 for policy φ are

Similarly for the policy ψ, we have

All other costs for these two policies coincide. Since C^ψ(y) < C^φ(y) (almost surely), we have v_n^ψ(y) < v_n^ψ(y). Thus, φ is not an optimal policy. This contradiction completes the proof. █

For L_n^p, U_n^p, L^b, and U^b satisfying (3.13)–(3.16), define

and

Lemma 3.3 implies

Combining Lemmas 3.2 and 3.3 and (3.24), we arrive at the major result of this section.

Theorem 3.4: Consider four threshold levels L_n^p, U_n^p, L^b, and U^b satisfying (3.13)–(3.16) and consider L_n^b and U_n^b defined in (3.22) and (3.23). Moreover, for the current inventory level y, let t_n′ be the order up/scrap down to action defined in (3.17) and let z_n′ be the borrow up/store down to action defined by

Then, (3.24) holds and the actions a_n(y) = t_n′ − y and b_n(y) = z_n′ − a_n(y) minimize the right-hand sides of the optimality equations (3.3) and (3.4), respectively. Therefore, for any N = 1,2,…, the policy φ = {d_N,d_N−1,…,d₁} is optimal for the N-step problem, where d_n(x) = (a_n(x),b_n(x)), n = 1,…,N, and −∞ < x < ∞.

Theorem 3.4 implies the existence of an optimal policy that in each period either produces (scraps) up (down) to the level L_n^p (U_n^p) and then potentially borrows (scraps) up (down) to the level L^b (U^b). These ideas are illustrated in the following example.

Example 3.5: Consider a finite horizon discounted cost problem with the following parameters:

The holding cost is assumed quadratic, and when x > 0 units are held in inventory, the holding cost is 0.002x², and when x ≤ 0, the backordering cost is 0.008x². The probability mass function p(·) of demand per period is

After a finite state approximation, the optimal policy can be defined by t_n′ in (3.17), where the optimal production/scrapping levels are

and L_n^b = 0, U_n^b = 2 for all n = 1,2,… in (3.25).

Aside from the observation that there exists several order-up-to or down-to levels, one should also note that the borrowing and storage options are used as secondary options to meet demand. Thus, the borrow-up-to level is higher than the order-up-to level, and the store-down-to level is lower than the scrap-down-to level.

Example 3.5 illustrates that it is possible that an optimal policy uses all four options: producing, scrapping, borrowing, and storing. The following two propositions give sufficient conditions when only ordering/scrapping options should be used and managers should not borrow or store. Proposition 3.6 states that borrowing and storage should not be used when they are relatively expensive. Proposition 3.7 indicates that borrowing and storage should not be used when the demand is either nonnegative or nonpositive.

Proposition 3.6: Suppose the holding and backordering costs are linear,

If e₊ > h₋, then the optimal policy presented in Theorem 3.4 does not borrow. Similarly, if h₊ < e₋, this optimal policy does not store. In addition, if e₊ = h₋, then L^b = −∞ and the optimal policy defined in Theorem 3.4 with L^b = −∞ does not borrow. Similarly, if e₋ = h₊, then U^b = ∞ and the optimal policy defined in Theorem 3.4 with U^b = ∞ does not store.

The case when e₊ = h₋ (e₋ = h₊) has the potential that one could borrow (store) and still be optimal but that this option need not be exercised. This is a consequence of the possibility of multiple optimal borrowing (storage) levels.

Proof: Note that

Thus, (3.9) implies that L^b = −∞ and yields the first result. Now, note that

so that the second result follows by assumption since h₊ < e₋ implies U^b = ∞. The cases e₊ = h₋ and h₊ = e₋ follow from similar considerations. █

Theorem 3.4 implies that

Similarly, for n ≥ 1,

For standard inventory problems with nonnegative demands and zero setup costs, so-called S-policies (also called “order-up-to” or “basestock” policies) are optimal: Always order up to the level S when the inventory level is smaller than S. For finite horizon problems, these order-up-to levels may depend on the stage number. The following statement demonstrates that for inventory problems with nonnegative demands, borrowing should not be used unless borrowing costs per unit are less expensive than ordering costs. Unlike Proposition 3.6, we do not assume that the holding costs are linear.

Proposition 3.7: Suppose c₊ ≤ e₊ and P(D ≥ 0) = 1. Then L^b ≤ L_n^p, and by selecting L^b = L^b in Theorem 3.4, we have that the optimal policy defined by (3.17) and (3.25) never borrows.

Proof: If L^b ≤ L_n^p and L^b = L^b, then L_n^b = L_n^p and the policy defined by (3.17) and (3.25) never borrows. So, we need only prove that L^b ≤ L_n^p. According to (3.5), this inequality is equivalent to

for all z < L^b. We fix z < L^b. Note that from (3.27)

and (3.29) holds for n = 0 as a nonstrict inequality. We consider any n = 1,2,… and make the induction assumption that the left-hand side of (3.29) is nonpositive for n − 1. Differentiating (3.28) yields (recall L^b ≤ L_n^b ≤ U_n^p)

Adding c₊ + (d⁻g*(z)/dz) = c₊ − e₊ to both sides of (3.30) and applying the inductive hypothesis (since D ≥ 0 almost surely) yields that (3.29) holds and the result is proven. █

Since the cases when the demand is nonnegative and nonpositive are symmetric, Proposition 3.7 implies the following corollary.

Corollary 3.8: Suppose c₋ ≤ e₋ and P(D ≤ 0) = 1. Then, U^b ≥ U_n^p, and by selecting U^b = U^b in Theorem 3.4, we have the optimal policy defined by (3.17) and (3.25) never stores.

Finally, we remark that the assumption that v₀ = 0 is simply for convenience; the results of this section hold when v₀ is an arbitrary nonnegative convex function.

4. INFINITE HORIZON DISCOUNTED COST OPTIMAL POLICIES

In order to obtain results analogous to Theorem 3.4 for the infinite horizon discounted problem, it is sufficient to justify taking limits as n approaches infinity on each side of the finite horizon optimality equations (3.3). This result is alluded to for the cash balance problem in [21] and [27], but apparently not shown.

Assume β < 1. Unlike the previous section, we do not suppress β. Since the optimality equations hold for problems with nonnegative costs (see, e.g., Theorem 8.2 in Strauch [39]), we may write the discounted cost optimality equations (DCOEs),

where g(y,b) is defined in (3.2). The system (4.1) is equivalent to

where g* is as defined in (3.4).

The model satisfies the following two conditions: All costs are nonnegative, and for all

and all n = 1,2…, the sets

are compact. Therefore, in view of [5, Prop.1.7, p.148] or [7, Prop. 9.17], v_n,β ↑ v_β as n → ∞. Thus, v_β is a convex function and Lemma 3.1 holds for the objective function v_β and for each of the optimality equations (4.1) and (4.2). Similar to the finite horizon case, we can rewrite the optimality equation (4.2) in the form

We define the numbers L_β^p, L_β^p, U_β^p, and U_β^p defined by (3.5)–(3.8) with v_n−1 replaced by v_β. As in the finite horizon case, note that L_β^p ≤ L_β^p and U_β^p ≤ U_β^p and define lower and upper actual inventory thresholds L_β^p and L_β^p satisfying the inequalities

The following lemma is similar to Lemma 3.2 and has a virtually identical proof, with the only difference being that (4.2) should be considered instead of (3.3).

Lemma 4.1: Consider four threshold levels L_β^p, U_β^p, L^b, and U^b satisfying (4.5), (4.6), (3.15), and (3.16), respectively. The stationary policy that orders up/scraps down to the level

where y is the current inventory level, and borrows up/stores down to the level

is optimal.

Similar to Lemma 3.3, we have that

The proofs of these inequalities coincide with the proof of Lemma 3.3 for n ≥ 2. We also define

. Thus, similar to Theorem 3.4, we have the main result for infinite horizon discounted cost problems.

Theorem 4.2: Consider four threshold levels L_β^p, U_β^p, L^b, and U^b satisfying (4.5), (4.6), (3.15), and (3.16), respectively. The stationary policy defined by the ordering/scrapping decision (4.7) for a current inventory level y and by the borrowing/storage decision to borrow up/store down to the level

is optimal.

Example 4.3: Consider the infinite horizon analog of Example 3.5. An optimal policy is defined by

. This is simply the four-threshold policy from Example 3.5 for n ≥ 6.

For infinite horizon problems, Propositions 3.6 and 3.7 hold for the stationary policy defined in Theorem 4.2. The proofs are virtually unchanged. However, since Theorem 4.2 describes a stationary policy, a stronger version of Proposition 3.7 holds.

Proposition 4.4: Suppose c₊ ≤ e₊ and P(D ≥ 0) = 1. Then, L^b ≤ L_β^p, and by selecting L^b = L^b in Theorem 4.2, we have that the L_β^p policy that prescribes to order at each step up to the level L_β^p, never borrows, never scraps, and never stores is optimal when the initial inventory level y ≤ L_β^p.

Similar to Corollary 3.8, we have the following statement.

Corollary 4.5: Suppose c₋ ≤ e₋ and P(D ≤ 0) = 1. Then, U^b ≥ U_β^p, and by selecting U^b = U^b in Theorem 4.2, we have that the policy that prescribes to scrap at each step down to the level L_β^p, never borrows, never orders, and never stores is optimal when the initial inventory level y ≥ U_β^p.

We remark that in addition to the convergence of the values v_n,β ↑ v_β, the convergence of the optimal policies takes place. Indeed, if

are limit points of sequences of the optimal ordering/scrapping thresholds for the objective function v_n,β, then for any L^b and U^b satisfying (3.15) and (3.16), we have that the four-threshold policy with

defined in Theorem 4.2 is optimal for the infinite horizon problem. This follows, for example, from the remark in [5, p.149].

5. AVERAGE COST PER UNIT TIME OPTIMAL POLICIES

In this section, we extend the previous results for the average cost case. We define the constant

where

As is shown in the next section, for our problem, w < ∞ and there exists a function u(x) with nonnegative finite values such that

where g(y,b) is as defined in (3.2). In addition, there exists a sequence β_n ↑ 1 such that lim_{β_n↑1}(1 − β_n)m_{β_n} = w and u(x) = lim_n→∞{v_{β_n}(x) − m_{β_n}} ≥ 0 for all

. Thus, the function u is nonnegative and convex and u(x) → ∞ as |x| → ∞.

Analogous to the discounted case, an equivalent system is

where g* is defined as in (3.4).

Similar to (3.5)–(3.8), define

and consider L^b, L^b, U^b, and U^b defined by (3.9), (3.10), (3.11), and (3.12), respectively. Consider L^b and U^b satisfying (3.15) and (3.16), respectively. As analogs to (3.13) and (3.14), consider L^p and U^p satisfying the inequalities

Lemma 5.1: Consider four threshold levels L^p, U^p, L^b, and U^b satisfying (5.8), (5.9), (3.15), and (3.16), respectively. The stationary policy that prescribes to order up/scrap down to

where y is the current inventory level, and to borrow up/store down to the level

defines the values a = t′ − y and b = z′ − t′ that minimize the right-hand side of (5.3) and (3.4), respectively. In addition, the infimum of the average costs per unit time, w(y) defined in (2.7), equals the constant w defined in (5.1), and the thresholds t′ and z′ define a stationary optimal policy.

Proof: Similar to Lemma 3.2, the convexity of u and h implies that the policy described minimizes the right-hand sides of (5.3) and (3.4). We recall that a policy for an MDP is called stationary if it is defined by a measurable mapping of the state space into an action space. The interpretation of this mapping is that if the system is at some state, the value of this mapping is the selected action (independent of time). For our problem, a stationary policy φ is a measurable mapping from

, where the first coordinate is how much to order/scrap and the second coordinate is how much to borrow/store. We remark that t′ = t′(y) in (5.10) and z′ = z′(t′) = z′(t′(y)) in (5.11) for an inventory level y. We consider the mapping φ(y) = (a(y),b(y)) = (t′(y),z′(t′(y))). Since the functions t′ and z′ have simple threshold forms, φ is measurable. Thus, we have

According to Schäl [37, Prop. 1.3], if the left-hand side of (5.12) is greater than or equal to the right-hand side, then w^φ(y) = w(y) = w for all

. █

Lemma 5.2: L^p ≤ U^b and L^b ≤ U^p.

Proof: Let φ be the (stationary) policy defined by (5.10) and (5.11). Since φ defines actions that minimize the right-hand sides of (5.3) and (3.4), the policy φ is canonical; see [25, Sect.5.2]; that is, for any n ≥ 1, φ minimizes the criterion

. The rest of the proof follows from the coupling arguments in the proof of Lemma 3.3 for n ≥ 2. █

Corresponding to (3.22) and (3.23), define

Lemma 5.2 implies

, which together with Lemma 5.1 imply our main result for undiscounted costs.

Theorem 5.3: For four thresholds L^p, U^p, L^b, and U^b satisfying (5.8), (5.9), (3.15), and (3.16), respectively, consider the thresholds

defined in (5.13) and (5.14). The stationary policy that prescribes for a current inventory level y to order up/scrap down to the level t′ defined in (5.10) and to borrow up/store down to the level

where y is the current inventory level and

have been defined in (5.13) and (5.14), is optimal.

Example 5.4: Suppose we use the same parameters as Example 3.5, but consider the average cost criterion. An optimal policy is defined by

The following proposition is similar to Proposition 3.6. Its proof is identical to that of Proposition 3.6, with the major difference being that (5.3) should be considered instead of (3.3).

Proposition 5.5: Suppose the holding costs are linear (i.e., (3.26) holds). If e₊ > h₋, then the optimal policy presented in Theorem 5.3 does not borrow. Similarly, if h₊ < e₋, this policy does not store. In addition, if e₊ = h₋, then L^b = −∞ and the optimal policy defined in Theorem 5.3 with L^b = −∞ does not borrow. Similarly, if e₋ = h₊, then U^b = ∞ and the optimal policy defined in Theorem 5.3 with U^b = ∞ does not store.

Analogous to Proposition 4.4 and Corollary 4.5, we have the following two results.

Proposition 5.6: Suppose c₊ ≤ e₊ and P(D ≥ 0) = 1. Then L^b ≤ L^p, and by selecting L^b = L^b in Theorem 5.3 we have that the L^p policy that prescribes to order at each step up to the level L^p, never borrows, never scraps, and never stores, is optimal when the initial inventory level y ≤ L^p.

Corollary 5.7: Suppose c₋ ≤ e₋ and P(D ≤ 0) = 1. Then U^b ≥ U^p, and by selecting U^b = U^b in Theorem 5.3 we have that the policy that prescribes to scrap at each step down to the level U^p, never borrows, never orders, and never stores, is optimal when the initial inventory level y ≥ U^p.

At the end of Section 4, we established that discounted cost finite horizon optimal thresholds converge to discounted cost infinite horizon optimal thresholds. Similarly, discounted cost infinite horizon optimal thresholds converge in some sense to optimal thresholds for the average cost criterion. Indeed, for ordering and scrapping decisions, let

be limit points of a sequence of optimal ordering/scrapping thresholds L_{β_n}^p and U_{β_n}^p as n → ∞, where the sequence {β_n,n ≥ 0} is chosen as discussed in the text following 5.2. Appealing to the results in the the next section implies that

are respectively optimal ordering and scrapping thresholds for the average cost per unit time criterion.

6. THE AVERAGE COST OPTIMALITY EQUATIONS

In this section, we prove the existence of a solution to the ACOEs (5.2) and (3.2). Instead of restricting attention solely to the problem at hand, we provide sufficient conditions for the existence of a solution to the ACOEs for a more general MDP, and then show that these conditions are satisfied for our problem.

Consider an MDP with a standard Borel state space

. Suppose ρ is a metric on

such that ρ(x,y) < ∞ for all

is complete and separable. Assume that the one-step costs c(x,a) are nonnegative and that the MDP satisfies the standard Borel measurability conditions (see, e.g., [37]). The optimal values in the infinite horizon discounted cost case, v_β(x), are defined for discount factors β ∈ [0,1). Let A(x) be the set of available actions at state x and q be the transition probability. Consider the following set of assumptions

Assumptions C:

1. There exists a nondecreasing, continuous function η on R₊ = [0,∞) such that η(0) = 0, η(r) < ∞ for all

for all

, and

for all

and a ∈ A(x).

2. There exists a finite constant w such that

where

3. For

, there exists a compact subset

such that c(x) ≥ N for all

, where c(x) = inf_a∈A(x) c(x,a).

Note that Assumption C1 implies that v_β(x) is a continuous function in x for any β ∈ [0,1). Since v_β(x) ≥ c(x), there exist x_β such that v_β(x_β) = m_β. Let M(β) = {x ∈ X|v_β(x) = m_β}. In particular, M(β) ⊆ K_N for N > m_β. The following lemma is similar to Lemma 4.6 in [37] and to Lemma 4 in [8].

Lemma 6.1: Let Assumptions C2 and C3 hold. Suppose {β(n),n ≥ 1} is a subsequence such that β(n) ↑ 1 and w = lim_n→∞(1 − β(n))m_β(n). Then, there exists a compact subset

and a finite integer [ell ] such that M(β(n)) ⊆ K for n ≥ [ell ].

Proof: Consider N > w + 1. Then, N > lim_n→∞(1 − β(n))m_β(n) + 1 and there exists an integer [ell ] such that for n ≥ [ell ]

Set K = K_N as defined in Assumption C3. We prove by contradiction that K is the set whose existence is postulated by the lemma. Let β satisfy (6.2) and suppose that this is not the case. Then, there exists

; that is, c(x) ≥ N and v_β(x) = m_β. Let T be the first hitting time of K; T = inf{n ≥ 0|X_n ∈ K}. Then, for any stationary policy π,

where the strict inequality follows from (6.2) and the last inequality follows from T ≥ 1 when x ∉ K. Since π is arbitrary, v_β(x) > m_β and the inclusion x ∈ M(β) is not possible. █

We recall that a real-valued function f defined on a metric space Y is called inf-compact if the set

is compact for any

. Note that since compact sets are closed, inf-compactness implies lower-semicontinuity. Consider the following two additional conditions:

Assumptions C (continued):

4. For each fixed

, the function c(x,a) is inf-compact in a.

5. Transition probabilities q(·|x,a) are weakly continuous in a for each

Theorem 6.2: Suppose C hold. Then the following hold:

1. There exists a continuous nonnegative function u on

such that for all

2. There exists a sequence β(n) → 1 such that u(x) = lim_n→∞ u_β(n)(x), where u_β(x) = v_β(x) − m_β,

3. If

is a linear space and the functions v_β are convex for all β ∈ [0,1), then u is convex.

4. For fixed

, let a* be a limit point of a sequence {a_n,n ≥ 1}, where v_β(n)(x) = c(x,a_n) + β(n)∫v_β(n)(y)q(dy|x,a_n) (the point a* exists in view of Lemma 6.1). Then

Proof: Consider the discount cost optimality equations (DCOEs),

In particular, the minimum is achieved in the right-hand side of (6.4) since the expression minimized is lower semicontinuous (the sum of two lower semicontinuous functions). The first function is lower semicontinuous because of Assumption C4 and the second function is lower semicontinuous because of Assumptions C1 and C5. The former implies that v_β(x) is continuous. In addition, v_β(x) ≥ 0.

A little algebra in (6.4) reveals

Utilizing Assumption C2, there exists a sequence β(n) ↑ 1 as n → ∞ such that lim_n→∞ β(n)m_β(n) = w. Lemma 6.1 implies that we can select this sequence such that M(β(n)) ⊆ K, where K is a compact subset of

. Let diam(K) denote the maximal distance between two points in K (the diameter of K). Since K is compact, diam(K) exists and is finite. We fix any point z₀ ∈ K. Then, according to Assumption C1,

where z ∈ K is such that v_β(n)(z) = m_β(n).

Since |u_β(x) − u_β(y)| ≤ ε when η(ρ(x,y)) < ε, the family of functions u_β(n) is equicontinuous. By the Ascoli theorem [25, p.96], there exists a subsequence {β(n_k),k ≥ 1} of the sequence {β(n),n ≥ 1} such that u_{β(n_k)} converges pointwise to a continuous function u and this convergence is uniform on each compact subset of

. In particular, for each

Fix

. We first show that

where the minimum in (6.8) is attained by the same reasons as in (6.3). For each β(n_k), consider a(n_k) such that

Without loss of generality, select the sequence n_k such that |(1 − β(n_k))m_{β(n_k)} + u_{β(n_k)}(x) − w − u(x)| ≤ 1 for all n_k. Assumption C4 implies that all a(n_k) belong to the compact set K_N(x), where N = w + u(x) + 1. Thus, there exists a* ∈ A(x) and a subsequence {m_k} of {n_k} such that a(m_k) → a*. The result obtained by Serfozo's [38] extension of Fatou's lemma (see also Lemma 2.3 in [37]) implies that

Since c is lower semicontinuous,

Thus, (6.8) is proved.

From (6.5), we have that for any a ∈ A(x),

Consider (6.11) for β = β(n) defined above and apply sequentially the Ascoli theorem [25, p.96] and Lebesgue's dominated convergence theorem. The latter is applicable in view of Assumption C1 and (6.6). Indeed, (6.6) yields

and (6.1) implies that

We, thus, have for any a ∈ A(x),

which is equivalent to

The next result shows that the ACOEs (5.2) and (3.2) hold for the inventory problem considered. We remark that Theorem 6.2 also implies the validity of the ACOEs for problems without borrowing and storage (let

in (5.2)).

Proposition 6.3: In the inventory problem considered, there exists a solution to the ACOEs (5.2) and (3.2), (w,u(y)), such that the relative value function u(y) is nonnegative, convex, and equal to the limit of functions u_β(n) described in Theorem 6.2.

Proof: We verify that Assumptions C1–C5 hold so that Theorem 6.2 applies. First, in our case

, ρ(x,y) = |x − y|, and η(r) = r max{c₊,c₋}. Consider two inventory levels x and z. In state x, suppose the manager orders or scraps inventory to bring the level to z, then implements an optimal policy thereafter. Since this policy may not be optimal, v_β(x) ≤ v_β(z) + max{c₊,c₋}|z − x|. In other words,

Since

for any

, h is convex, and h(x) → ∞ as x → ∞, we have

and Assumption C1 is verified.

Fix

. Consider the policy φ that always orders up to level x. The renewal reward theorem implies that w^φ(x) = lim_n→∞(1/n)v_n,1^φ(x) < ∞. Thus, lim inf_β↑1(1 − β)m_β ≤ lim inf_β↑1(1 − β)v_β(x) ≤ lim inf_β↑1(1 − β)v_β^φ(x) = w^φ(x) < ∞ and Assumption C2 holds.

To verify Assumption C3, we must prove that c(x) → ∞ as |x| → ∞. Let x → ∞ and let d = min{c₊,c₋,e₊,e₋}. Note that d > 0 and for

where the first inequality follows by setting y = a + b and applying Jensen's inequality. To verify the second inequality, consider the two possibilities (a)

and (b)

. The case x → −∞ is similar.

Assumption C4 holds since for

, {(a,b)|C(x,(a,b)) ≤ λ} is closed and bounded. The fact that it is bounded follows from the fact that C(x,(a,b)) → ∞, as either a or b → ∞. It is closed since the cost function is convex and, therefore, continuous. Assumption C5 is equivalent to

as a_n → a, for any bounded, continuous f, which follows from Lebesgue's dominated convergence theorem. █

7. CONCLUSIONS

We have studied an extension of the classic inventory control/cash management models to include one-period borrowing and storage. Instead of an optimal policy requiring two thresholds, as has been shown for the cash management problem, we have four thresholds. We expect that in the discounted models (both finite and infinite horizon), a fixed cost could be added for either ordering or scrapping (but not both) and our results could be extended without difficulties to results analogous to those shown in [16,21]. This follows from the fact that the (convex) holding cost in each model would be replaced with the function g*. On the other hand, we do not believe that allowing for fixed costs to be associated with borrowing/storage and ordering/scrapping would lead to such simple policies. Although the borrowing and storage policy would most likely be analogous to (s,S)-policies, it is not immediately clear that the K-convexity would carry through. This is left as a potential future research direction. Other potential research directions are to study problems with lost sales, lead times, and problems with the borrowing/storage time intervals longer than one period.

Acknowledgments

The authors thank Emmanuel Fernandez for the comments regarding the existence of stationary optimal policies for the average cost criterion, and Woonghee Tim Huh and Ganesh Janakiraman, who brought to our attention that condition η(0) = 0 was accidentally missing in Assumption C1 of a preliminary version of this paper. We would also like to thank the anonymous referees for several comments that improved the readability of this article. Research by the first author was partially supported by NSF grant DMI-0300121.

References

REFERENCES

Aneja, Y. & Noori, A.H. (1987). The optimality of (s, s) policies for a stochastic inventory problem with proportional and lump-sum penalty cost. Management Science 33(6): 750–755.Google Scholar

Arrow, K.J., Harris, T., & Marshcak, J. (1951). Optimal inventory policy, Econometrica 53(6): 250–272.Google Scholar

Arslan, H., Ayhan, H., & Olsen, T.L. (2001). Analytic models for when and how to expedite in make-to-order systems. IIE Transactions 33(11): 1019–1029.Google Scholar

Barankin, E. (1961). A delivery-lag inventory model with an emergency provision. Naval Research Logistics Quarterly 8: 285–311.Google Scholar

Bertsekas, D.P. (1995). Dynamic programming and optimal control, Vol. 2. Belmont, MA: Athena Scientific.

Bertsekas, D.P. (2000). Dynamic programming and optimal control, Vol. 1, 2nd ed. Belmont, MA: Athena Scientific.

Bertsekas, D.P. & Shreve, S.E. (1996). Stochastic optimal control: The discrete-time case. Belmont, MA: Athena Scientific.

Cavazos-Cadena, R. & Sennott, L.I. (1992). Comparing recent assumptions for the existence of average optimal stationary policies. Operations Research Letters 11: 33–37.Google Scholar

Chen, X. & Simchi-Levi, D. (2004) A new approach for the stochastic cash balance problem with fixed costs. Preprint.

Chiang, C. & Gutierrez, G.J. (1996). A periodic review inventory system with two supply modes. European Journal of Operational Research 94(3): 527–547.Google Scholar

Chiang, C. & Gutierrez, G.J. (1998). Optimal control policies for a periodic review inventory system with emergency orders. Naval Research Logistics Quarterly 45: 187–204.Google Scholar

Constantinides, G.M. & Richard, S.F. (1978). Existence of optimal simple policies for discounted cost inventory and cash management in continuous time. Operations Research 26(4): 620–636.Google Scholar

Daniel, K. (1963). A delivery-lag inventory model with emergency. In H.E. Scarf, D.M. Gilford, & M.W. Shelly (eds.), Multistage Inventory Models and Techniques. Stanford, CA: Stanford University Press, pp. 32–46.

Dynkin, E. & Yushkevich, A.A. (1979). Controlled Markov processes. New York: Springer-Verlag.CrossRef

Elton, E.J. & Gruber, M.J. (1974). On the cash balance problem. Operational Research Quarterly 25(4): 553–572.Google Scholar

Eppen, G.D. & Fama, E.F. (1969). Cash balance and simple dynamic portfolio problems with proportional costs. International Economic Review 10(2): 119–133.Google Scholar

Feinberg, E.A. & Shwartz, A. (eds.). (2002). Handbook of Markov decision processes: Methods and applications. Boston: Kluwer.

Fernández-Gaucherand, E. (1994). A note on the Ross–Taylor theorem. Applied Mathematics and Computation 64(2–3): 207–212.Google Scholar

Fernández-Gaucherand, E., Arapostathis, A., & Marcus, S.I. (1992). Convex stochastic control problems. In Proceedings of the 31st Conference on Decision and Control, pp. 2179–2180.CrossRef

Fleischmann, M., Kuik, R., & Dekker, R. (2002). Controlling inventories with stochastic item returns: A basic model. European Journal of Operational Research 138: 63–75.Google Scholar

Girgis, N.M. (1968). Optimal cash balance levels. Management Science 15(3): 130–140.Google Scholar

Gubenko, L. & Statland, E. (1975). On controlled discrete-time Markov decision processes. Theory Probability and Mathematical Statistics 7: 47–61.Google Scholar

Harris, T. (1913). How many parts to make at once. Factory, the Magazine of Management 10: 135–136, 152.Google Scholar

Hartley, R. (1980). Dynamic programming and an undiscounted, infinite horizon, convex stochastic control problem. In R. Hartley, L. Thomas, & D. White (eds.), Recent developments in Markov decision processes. London: Academic Press, pp. 277–300.

Hernández-Lerma, O. & Lasserre, J.B. (1996). Discrete-time Markov control processes: Basic optimality criteria. New York: Springer-Verlag.

Heyman, D.P. (1977). Optimal disposal policies for a single-item inventory system with returns. Naval Research Logistics Quarterly 24: 385–405.Google Scholar

Heyman, D.P. & Sobel, M.J. (1984). Stochastic models in operations research, Vol. II. New York: McGraw-Hill.

Hinderer, K. & Waldmann, K-H. (2001). Cash management in a randomly varying environment. European Journal of Operational Research 130: 468–485.Google Scholar

Huggins, E.L. & Olsen, T.L. (2003). Inventory control with overtime and premium freight. Preprint.

Neave, E.H. (1970). The stochastic cash balance problem with fixed costs for increases and decreases. Management Science 16(7): 472–490.Google Scholar

Neuts, M. (1964). An inventory model with optional lag time. SIAM Journal of Applied Mathematics 12: 179–185.Google Scholar

Porteus, E. (1990). Stochastic inventory theory. In D. Heyman & M. Sobel (eds.), Handbooks in operations research and management science, Vol. 2. Amsterdam: Elsevier Science Publishers, pp. 605–652.

Puterman, M.L. (1994). Markov decision processes: Discrete stochastic dynamic programming. New York: Wiley.CrossRef

Ritt, R. & Sennott, L. (1992). Optimal stationary policies in general state Markov decision chains with finite action sets. Mathematics of Operations Research 17: 901–909.Google Scholar

Ross, S. (1968). Arbitrary state Markovian decision processes. Annals of Mathematical Statistics 39: 2118–2122.Google Scholar

Ross, S. (1983). Introduction to stochastic dynamic programming. New York: Academic Press.

Schäl, M. (1993). Average optimality in dynamic programming with general state space. Mathematics of Operations Research 18: 163–172.Google Scholar

Serfozo, R. (1982). Convergence of Lebesgue integrals with varying measures. Sankhya Series A 44: 380–402.Google Scholar

Strauch, R.E. (1966). Negative dynamic programming. Annals of Mathematical Statistics 37: 871–890.Google Scholar

Tagaras, G. & Vlachos, D. (2001). A periodic review inventory system with emergency replenishments. Management Science 47(3): 415–429.Google Scholar

van der Laan, E. & Salomon, M. (1997). Production planning and inventory control with remanufacturing and disposal. European Journal of Operational Research 102: 264–278.Google Scholar

Article contents

OPTIMALITY OF FOUR-THRESHOLD POLICIES IN INVENTORY SYSTEMS WITH CUSTOMER RETURNS AND BORROWING/STORAGE OPTIONS

Abstract

1. INTRODUCTION

2. PROBLEM DESCRIPTION

3. FINITE HORIZON DISCOUNTED COST OPTIMAL POLICIES

4. INFINITE HORIZON DISCOUNTED COST OPTIMAL POLICIES

5. AVERAGE COST PER UNIT TIME OPTIMAL POLICIES

6. THE AVERAGE COST OPTIMALITY EQUATIONS

7. CONCLUSIONS

Acknowledgments

References

REFERENCES

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests