OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS

Xiaofei Fan-Orzechowski; Eugene A. Feinberg

doi:10.1017/S026996480707012X

OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS

Published online by Cambridge University Press: 27 February 2007

Xiaofei Fan-Orzechowski and

Eugene A. Feinberg

Show author details

Xiaofei Fan-Orzechowski: Affiliation:
Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, Stony Brook, NY 11794-3600, E-mail: xfan@ams.sunysb.edu; Eugene.Feinberg@sunysb.edu
Eugene A. Feinberg: Affiliation:
Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, Stony Brook, NY 11794-3600, E-mail: xfan@ams.sunysb.edu; Eugene.Feinberg@sunysb.edu

Article contents

Abstract
1. INTRODUCTION AND PROBLEM FORMULATION
2. SEMI-MARKOV DECISION MODEL AND MAIN RESULTS
3. PROOFS
Acknowledgment
References

Rights & Permissions

Abstract

We study the optimal admission of arriving customers to a Markovian finite-capacity queue (e.g., M/M/c/N queue) with several customer types. The system managers are paid for serving customers and penalized for rejecting them. The rewards and penalties depend on customer types. The penalties are modeled by a K-dimensional cost vector, K ≥ 1. The goal is to maximize the average rewards per unit time subject to the K constraints on the average costs per unit time. Let Km denote min{K,m − 1}, where m is the number of customer types. For a feasible problem, we show the existence of a Km-randomized trunk reservation optimal policy, where the acceptance thresholds for different customer types are ordered according to a linear combination of the service rewards and rejection costs. Additionally, we prove that any Km-randomized stationary optimal policy has this structure.

Type: Research Article
Information: Probability in the Engineering and Informational Sciences , Volume 21 , Issue 2 , April 2007 , pp. 189 - 200

DOI: https://doi.org/10.1017/S026996480707012X [Opens in a new window]
Copyright: © 2007 Cambridge University Press

1. INTRODUCTION AND PROBLEM FORMULATION

We consider a controlled finite-capacity Markovian queue with m types of customer, where m = 1,2,… . Type i customers arrive according to a Poisson process with the intensity λ_i, i = 1,…, m, and these m arrival Poisson processes are independent. When a customer arrives, its type becomes known. When there are N customers in the system, the system is full and new arrivals are lost. If the system is not full, upon an arrival of a new customer, a decision to accept or reject this customer is made. A positive reward r_i is collected for serving an accepted type i customer. A nonnegative cost vector C_i = (C_1,i,C_2,i,…,C_K,i)^τ is incurred due to the rejection or loss of an arriving type i customer, where K is the number of constraints in this problem. We assume that the service time of a customer does not depend on the customer type. When there are n customers in the queue, the departure rate is μ_n, n = 1,…, N. The numbers μ_n, n = 1,…, N, satisfy the condition μ_n−1 ≤ μ_n, where μ₀ = 0 and μ₁ > 0. For example, for an M/M/c/N queue, for some μ > 0,

Note that, unless otherwise specified, we do not assume that r₁ ≥ r₂ ≥ ··· ≥ r_m.

Our goal is to maximize the average rewards per unit time, subject to multiple constraints on the average costs per unit time. This research is motivated by the following question: What is the structure of optimal policies for the problem when the blocking probabilities for some of the customer types are not allowed to exceed given numbers? The answer to this question is given in Corollary 2.4. Additionally, Corollary 2.5 provides an answer to a more general problem when different types of customer can be pooled into groups with common blocking probability constraints on each group. Previously, Fan-Orzechowski and Feinberg [3] solved such a problem with a single constraint on the blocking probability of one type of customer or one group. Feinberg and Reiman [7] solved a more particular problem when all of the rewards are different and the constraint on the blocking probability is applied to the most profitable type of customer.

Consider [Kscr ] = 0,1,…, m − 1. A [Kscr ]-randomized trunk reservation policy φ is defined by m numbers M_i^φ, 0 ≤ M_i^φ ≤ N − 1, i = 1,…, m, called the thresholds. Of these thresholds, at most [Kscr ] numbers are noninteger and at least one number equals N − 1. For a number M we denote the integer part of M by [lfloor ]M[rfloor ]. If the system is controlled by the policy φ, a type i arrival will be admitted with probability 1 if it sees no more than [lfloor ]M_i^φ[rfloor ] customers in the system, it will be rejected if the number of customers it sees in the system exceeds [lfloor ]M_i^φ[rfloor ] + 1, and it will be accepted with the probability (M_i^φ − [lfloor ]M_i^φ[rfloor ]) and rejected with the probability 1 − (M_i^φ − [lfloor ]M_i^φ[rfloor ]) if it sees exactly [lfloor ]M_i^φ[rfloor ] + 1 customers in the system at the arrival epoch. In particular, if the number M_i^φ is an integer, a type i arrival will be admitted if and only if it sees no more that M_i^φ customers in the system. Thus, M_i^φ = N − 1 means that a type i arrival is admitted whenever the system is not full. A randomized trunk reservation policy φ is called consistent with a function r′ defined on the set {1,…, m} if r_i′ > r_j′ implies M_i^φ ≥ M_j^φ for i,j = 1,…, m. If all of the thresholds are integers, the randomized trunk reservation policy is called a trunk reservation policy. We sometimes write M_i instead of M_i^φ for the thresholds when there is only one policy in the context and no confusion will occur.

In this article we allow the possibility that the number of constraints K is not necessarily less than m and we introduce K_m = min{K,m − 1}. We prove that if the problem is feasible, there exists a K_m-randomized trunk reservation policy consistent with the reward function

where u_k ≥ 0 is the Lagrange multiplier with respect to the kth constraint of the linear programming problem formulated in this article. Additionally, Theorem 2.2 shows that any K_m-randomized stationary optimal policy is a K_m-randomized trunk reservation policy consistent with r′.

In Feinberg and Reiman [7, Sects.6 and 7], several other types of optimal policy and optimal nonrandomized strategy were constructed by transforming an optimal randomized trunk reservation policy. Similar results can be obtained for the more general problem considered in this article. In fact, these constructions hold as long as the optimality of randomized trunk reservation policies is established.

Miller [13] studied a one-criterion problem for an M/M/c/loss queue when r₁ > r₂ > ··· > r_m. In this case, there exists an optimal nonrandomized trunk reservation policy consistent with r. In other words, all of the thresholds M_i are integers and N − 1 = M₁ ≥ M₂ ≥ ··· ≥ M_m. Feinberg and Reiman [7] studied a constrained problem with r₁ > r₂ > ··· > r_m, where the goal was to maximize average rewards per unit time subject to the constraint that the blocking probability for type 1 customers does not exceed a given level. Feinberg and Reiman [7] proved the existence of an optimal 1-randomized trunk reservation policy with N − 1 = M₁ ≥ M₂ ≥ ··· ≥ M_m. Fan-Orzechowski and Feinberg [3] considered a problem with a single constraint on the average costs. The goal was to maximize the average rewards per unit time subject to this constraint. They proved the existence of a 1-randomized trunk reservation policy consistent with the reward function

where u₁ ≥ 0 is the Lagrange multiplier with respect to the first constraint of the linear programming problem formulated in that article. In particular, Fan-Orzechowski and Feinberg [3] solved the problem with one constraint on the blocking probability for type k customers, k = 1,…, m.

In addition to Miller's [13] classical problem formulation, various versions and generalizations of the admission problem have been studied in the literature. See the references found in Fan-Orzechowski and Feinberg [3]. Recent research in this area includes Lewis, Ayhan, and Foley [9,10], Lewis [8], Lin and Ross [11,12], Altman, Jimenez, and Koole [1], and Altman [2]. If the service times depend on customer types or different types of customer require different numbers of servers, the problem becomes NP-hard and trunk reservation might not be optimal; see Ross [14, p.137] and Altman et al. [1].

The rest of this article is organized as follows. Section 2 contains the problem formulation, introduces a linear program (LP) that identifies an optimal policy, and states the main result—the optimality of randomized trunk reservation policies and its corollaries. Section 3 introduces the Lagrangian relaxation, which reduces the number of constraints, and provides the proofs.

2. SEMI-MARKOV DECISION MODEL AND MAIN RESULTS

Following Feinberg and Reiman [7] and Fan-Orzechowski and Feinberg [3], we model the problem via a semi-Markov decision process (SMDP). Since the sojourn times between actions are exponentially distributed, this problem is an exponential semi-Markov decision process (ESMDP); see Feinberg [6] for details. Notice that this problem can also be formulated as a continuous-time Markov decision process (CMDP). The extra technical difficulty in using CMDP is to prove that the controlled process has no absorbing states; see Feinberg [5]. We can then reach the same preliminary result as by using the SMDP model in this article; when the problem is feasible, there exists a randomized stationary optimal policy that uses a randomization procedure in at most K_m states.

In the framework of an SMDP model, we define the state space I = {0,1,…,N − 1} ∪ ({0,1,…, N} × {1,…, m}), which represents the system state at the departure and arrival epochs. If the state of the system is n = 0,…, N − 1, this means that a departing customer leaves n customers in the system. The state (n,i) means that an arrival of type i sees n customers in the system.

The action set A = {0,1}. For n = 0,…, N − 1 and i = 1,…, m, we set A(n,i) = A = {0,1} and A(N,i) = {0}, where the action 0 means that the type i arrival should be rejected or is lost and the action 1 means that it should be accepted. In any state n = 0…,N − 1, we set A(n) = {0}.

Let τ(s,a) denote the average time that the system spends in a state s ∈ I if action a ∈ A(s) is chosen in this state. Let p(s,s′,a) be the transition probability from the state s to s′ if action a ∈ A(s) is chosen. Note that p(s,s′,a) does not depend on the sojourn time in state s under action a and this sojourn time has an exponential distribution with the average τ(s,a) . For notational convenience, we write τ(n) and p(n,s′) instead of τ(n,0) and p(n,s′,0), respectively, for n = 0,…, N − 1, s′ ∈ I. Denote

We have τ(n) = (μ_n + Λ)⁻¹, where n = 0,…, N − 1. Also, for i = 1,…, m,

For n = 0,…, N − 1 and i = 1,…, m,

and

Suppose that the reward is accrued upon acceptance of a customer and costs are incurred upon rejection. The reward function is

and for k = 1,…, K, the kth cost function is

In addition, r(n,0) = c_k(n,0) = 0 for all n = 0,…, N − 1 and for all k = 1,…, K.

We define the long-run average rewards earned by the system under strategy π as

and the long-run average costs of the system under strategy π as

where z is the initial state, x_n is the state at epoch t_n,

is the expectation operator for the initial state z and strategy π, and N(t) = max{n : t_n ≤ t} is the number of jumps by time t.

A strategy is called a randomized stationary policy if assigned actions a_n depend only on the current state x_n. Additionally, if a_n is a deterministic function of x_n, the corresponding strategy is called a stationary policy. Since the action sets are nonsingletons only at the arrival epochs, a randomized stationary policy φ for our problem is defined by φ(n,i), n = 0,…, N − 1, i = 1,…, m, the probability of accepting an arrival of type i when the arrival sees n customers in the system. A randomized stationary policy φ is called k-randomized stationary, k = 0,1,…, if the number of states (n,i) such that 0 < φ(n,i) < 1 is less than or equal to k. The notions of stationary and 0-randomized stationary policies coincide.

Notice that the unichain condition (any stationary policy defines a Markov chain with one recurrent class) holds for this model. If an SMDP satisfies the unichain condition, then, according to Feinberg [4, Thm.9.2], for a feasible problem under the average rewards per unit time criterion with K constraints, there exists an optimal K-randomized stationary policy. Therefore, similar to Fan-Orzechowski and Feinberg [3], we model our problem as follows:

where a randomized stationary policy φ is the variable and G_k are real numbers.

Consider the following LP with variables (x,P), where x = {x(n,i) : n = 0,…, N − 1,i = 1,…, m} and P = (P₀,…, P_N).

The variables x and P have the following meaning: x(n,i) is the joint probability that a type i arrival sees n customers in the system and this arrival is accepted; P_n is the limiting probability that there are n customers in the system. For the queue controlled by any randomized stationary policy, its associated quantities (x,P) satisfy

, since Poisson arrivals see time averages (PASTA).

Constraints (2.4) and (2.5) correspond to the birth-and-death balance equations and constraints (2.3) correspond to K cost constraints. Constraints (2.5) and (2.6) imply that the set of feasible solutions of the LP (2.2)–(2.6) is bounded and, therefore, if a feasible solution exists, the LP has an optimal solution.

For a vector (x,P) satisfying (2.4)–(2.6), consider a randomized stationary policy φ such that

The following theorem links problem (2.1) and LP (2.2)–(2.6). We recall that K_m = min{K,m − 1}.

Theorem 2.1:

(i) A randomized stationary policy φ is feasible for problem (2.1) if and only if (2.7) holds for a feasible vector (x,P) of LP (2.2)–(2.6).

(ii) If (x,P) is an optimal solution of LP (2.2)–(2.6), then P_n > 0 for all n = 0,1,…, N.

(iii) A randomized stationary policy φ is optimal for problem (2.1) if and only if the equation

holds for an optimal solution (x,P) of LP (2.2)–(2.6). Additionally, if (x,P) is a basic optimal solution of LP (2.2)–(2.6), then the policy φ defined by (2.8) is K_m-randomized stationary optimal.

The proof of Theorem 2.1 is presented in Section 3. In particular, Theorem 2.1(iii) implies that if problem (2.1) is feasible, then there exists an optimal K_m-randomized stationary policy. Let (u,v), where u = (u₁,…,u_K,u_K+1,…,u_K+2Nm) and v = (v₀,…,v_N), be an optimal solution of the LP dual to (2.2)–(2.6). The dual vector u^K = (u₁,…,u_K) corresponds to constraints (2.3), the dual vector (u_K+1,…,u_K+2Nm) corresponds to the inequality constraints (2.6), the dual vector (v₀,…,v_N−1) corresponds to constraints (2.4), and the dual variable v_N corresponds to the constraint (2.5). We call u and v the Lagrange multipliers for LP (2.2)–(2.6) and note that u_k ≥ 0, k = 1,…, K; see [3, Appendix B] for additional details.

The following theorem, which is the main result of this work, implies the existence of optimal K_m-randomized trunk reservation policies stated in its Corollary 2.3.

Theorem 2.2: Any K_m-randomized stationary optimal policy for problem (2.1) is a K_m-randomized trunk reservation policy, which is consistent with the reward function r_i′ = r_i + u^KC_i, i = 1,…, m, where u^K = (u₁,u₂,…,u_K) ≥ 0 is a vector of Lagrange multipliers with respect to constraints (2.3) in LP (2.2)–(2.6).

Corollary 2.3: If problem (2.1) is feasible, then there exists an optimal K_m-randomized trunk reservation policy consistent with the reward function r′ defined in Theorem 2.2.

To conclude this section, consider two important corollaries of Theorem 2.2. Corollary 2.4 deals with the situation when there are constraints on the blocking probabilities of certain customer types. Corollary 2.5 deals with the more general situation when there are blocking probability constraints on several customer types pooled together.

According to [7, p.471], for the costs vector C_i, i = 1,…, m, with the coordinates C_k,i, k = 1,…, K,

the average cost W_k(z,π) is the blocking probability for type k customers.

Corollary 2.4: Consider a special case of problem (2.1) when there is a fixed subset J = {i₁,…, i_K} of customer types and the goal is to maximize the average rewards per unit time subject to K constraints on the blocking probabilities for each of these customer types. For this problem, introduce the costs C_k,i defined by (2.9) when i = i_k and C_k,i = 0 when i ∉ J, k = 1,…, K, i = 1,…, m. If this problem is feasible, then any K_m-randomized stationary optimal policy is K_m-randomized trunk reservation policy consistent with the reward function r′ defined as

where u^K = (u₁,u₂,…,u_K) ≥ 0 is the vector of Lagrange multipliers with respect to constraints (2.3) of LP (2.2)–(2.6). Therefore, there exists a K_m-randomized trunk reservation optimal policy consistent with the reward r′ if the problem is feasible.

Consider a more general problem when there are K subsets of the customer types J₁,…, J_K and the goal is to maximize the average rewards per unit time subject to the constraints on the blocking probabilities for the customer types from the class J_k pooled together, k = 1,…, K. For K = 1, this problem was solved in [3, Cor. 2.5]. When each subset J_k, k = 1,…, K is a singleton, the solution is described in Corollary 2.4.

Let

. Consider the cost functions C_k, k = 1,…, K, defined as

Corollary 2.5: Consider a special case of problem (2.1) when there are K fixed groups J₁,…, J_K of customer types and the goal is to maximize the average rewards per unit time subject to K constraints on the blocking probabilities for the customers from each of these customer groups pooled together within a group. If this problem is feasible, then any K_m-randomized stationary optimal policy is a K_m-randomized trunk reservation policy consistent with the rewards

, where i = 1,…, m, and u^K = (u₁,u₂,…,u_K) ≥ 0 is the vector of Lagrange multipliers with respect to constraints (2.3) of LP (2.2)–(2.6) with the costs C_K,i defined in (2.10). Therefore, there exists a K_m-randomized trunk reservation optimal policy consistent with the reward r′ if the problem is feasible.

3. PROOFS

In view of (2.5) and (2.6), the feasible region of LP (2.2)–(2.6) is bounded. Therefore, this LP has an optimal solution if it is feasible. If LP (2.2)–(2.6) is feasible, consider the vector of Lagrange multipliers u^K = (u₁,…,u_K) with respect to constraints (2.3) in LP (2.2)–(2.6). If LP (2.2)–(2.6) is feasible, we introduce the following LP:

Lemma B.1 in [3] implies the following result.

Lemma 3.1: If LP (2.2)–(2.6) is feasible, then (i) any optimal solution of LP (2.2)–(2.6) is an optimal solution of LP (3.1) and (ii) the optimal values of objective functions for these two LPs are equal.

This lemma plays an important role in the proof of the main result of this article: Theorem 2.2. We refer to this technique as the Lagrangian relaxation in this article. In general, the Lagrangian relaxation refers to using a weak duality theorem to obtain lower bounds for a nonlinear programming problem. We use the term “relaxation” here in the sense that, although the optimal set is enlarged after the transformation, the optimal value remains the same.

Consider the unconstrained problem

Similar to [3], we are interested in this problem when the rewards for served customers are equal to r_i′, i = 1,…, m, where the function r′ was defined in Theorem 2.2. Since the second term in the objection function of LP (3.1) is constant, in view of Theorem 2.1 an optimal solution (x,P) of LP (3.1) defines via (2.8) an optimal randomized stationary policy φ for problem (3.2) with the reward functions r′.

To prove Theorem 2.2, we first formulate certain generic properties of optimal solutions of problem (3.2) with the rewards r and then apply them to the specific case of the rewards r′. Note that we do not assume distinct rewards among different customer types. This extension might not seem significant at first glance, but it is important. In fact, even if we assume that all of the rewards r_i are distinct, after the Lagrangian relaxation it is possible that r_i′ = r_j′ for some i,j = 1,…, m in a new unconstrained problem.

Lemma 3.2 [3, Cor. 2.1]:

(i) If (x,P) is an optimal solution of LP (2.2), (2.4)–(2.6), then P_n > 0 for all n = 0,1,…, N.

(ii) A randomized stationary policy φ is optimal for problem (3.2) if and only if (2.8) holds for an optimal solution (x,P) of LP (2.2), (2.4)–(2.6). In addition, if (x,P) is a basic optimal solution of LP (2.2), (2.4)–(2.6), then the policy φ defined in (2.8) is (nonrandomized) stationary optimal.

The following lemma is used in the proof of Theorems 2.1 and 2.2.

Lemma 3.3 [3, Lemma 3.3]: Consider any randomized stationary optimal policy φ for the unconstrained problem (3.2).

(i)For any i and j, such that r_i > r_j,

(ii) For each n = 0,…, N − 1, if there exist two customer types j₁ and j₂ such that 0 < φ(n,j) < 1, j = j₁,j₂, then r_j₁ = r_j₂. In particular, if all of the rewards r₁,…, r_m are different, then for each n = 0,…, N − 1, all of the probabilities φ(n,j), j = 1,…, m, except at most one, are equal to either zero or one.

(iii) There exists at least one customer type, say type [ell ], such that

In particular, if r_j = max{r_i|i = 1,…, m}, then (3.4) holds with [ell ] = j.

(iv)

and for each j = 1,…, m, all of the probabilities φ(n,j), n = 0,…, N − 1, except at most one, are equal to either zero or one.

Corollary 3.4: Any randomized stationary optimal policy φ for the unconstrained problem (3.2) is an (m − 1)-randomized trunk reservation policy consistent with the rewards r_i. Additionally, if all of the rewards r_i are distinct, such a policy is s-randomized, where s = min{m − 1,N}.

Corollary 3.5: Any stationary optimal policy φ for the unconstrained problem (3.2) is a trunk reservation policy consistent with the rewards r_i.

Proof of Theorem 2.1: The proof of (i) and (ii) and the first statement of (iii) are identical to the proof of Theorem 2.1 in [3]. We will prove the second statement of (iii). First, consider the case when K < m. We represent LP (2.2)–(2.6) in a standard LP form, where nonnegative variables S_k, k = 1,…, K, are introduced to replace (2.3) with

and nonnegative variables y(n,i), n = 0,…, N − 1, i = 1,…, m, are introduced to replace (2.6) with x(n,i) + y(n,i) = P_n. There are K + N + 1 + N × m constraints and K + 1 + N + 2(N × m) variables for this new LP. Therefore, any basic optimal solution of this new LP has at most K + N + 1 + N × m basic variables. Since P_n, n = 0,…, N, are positive, there are at most K + N × m basic variables among x(n,i) and y(n,i). Because x(n,i) + y(n,i) = P_n > 0, x(n,i) and y(n,i) cannot be equal to zero simultaneously. Therefore, for each pair (n,i), either x(n,i) = 0 or y(n,i) = 0, except at most K pairs where both x(n,i) and y(n,i) are not equal to zero. Since φ(n,i) = x(n,i)/P_n, we have that for all pairs (n,i), except at most K, φ(n,i) equals either zero or one. Therefore, the policy φ is K-randomized stationary optimal. For K ≥ m, we note that the matrix C = (C_k,i) is K × m for constraints in (2.3) and, thus, its rank is at most m. After removing redundant constraints in (2.3), if there are l constraints left in (2.3) and l < m, we are back to the previous case and the proof is complete. If l = m, the only extra piece we need to add is to show that the policy φ is (m − 1)-randomized. Lemmas 3.1 and 3.2 imply that φ is optimal for an unconstrained problem. The rest follows from Lemma 3.3(iii). █

We remark that, in the case of K ≥ m, it is also intuitive that the randomized trunk reservation policy actually has at most m − 1 randomized entries. Indeed, for the most profitable customer type, who has the highest reward r′, the system will always accept it when it is not full. Otherwise the system idles and waits to serve the next potentially less profitable customer. Thus, such a policy is suboptimal. This is also consistent with the definition of a [Kscr ]-randomized trunk reservation, in which at least one threshold equals N − 1.

Proof of Theorem 2.2: Consider any K_m-randomized stationary optimal policy φ for problem (2.1). Lemma 3.2(ii), Theorem 2.1(iii), and Lemma 3.1 imply that φ is optimal for an unconstrained problem with the rewards

. Lemma 3.3 implies that φ is a randomized trunk reservation policy consistent with

. █

Acknowledgment

This research was partially supported by NSF grants DMI-0300121 and DMI-0600538.

References

REFERENCES

Altman, E. (2002). Applications of Markov decision processes in telecommunications: A survey. In E. Feinberg & A. Shwartz (eds.), Handbook on Markov decision processes. New York: Kluwer, pp. 489–536.CrossRef

Altman, E., Jimenez, T., & Koole, G. (2001). On optimal call admission control in a resource-sharing system. IEEE Transactions on Communications 49: 1659–1668.Google Scholar

Fan-Orzechowski, X. & Feinberg, E. A. (2006). Optimality of randomized trunk reservation for a problem with a single constraint. Advances in Applied Probability 38: 199–220.Google Scholar

Feinberg, E.A. (1994). Constrained semi-Markov decision processes with average rewards. Mathematical Methods of Operations Research 39: 257–288.Google Scholar

Feinberg, E.A. (2002). Constrained finite continuous-time Markov decision processes with average rewards. In Proceedings of IEEE 2002 Conference on Decisions and Control, December 10–13, 2002, Las Vegas, pp. 3805–3810.

Feinberg, E.A. (2004). Continuous time discounted jump Markov decision processes: A discrete-event approach. Mathematics of Operations Research 29: 492–524.Google Scholar

Feinberg, E.A. & Reiman, M.I. (1994). Optimality of randomized trunk reservation. Probability in the Engineering and Informational Sciences 8: 463–489.Google Scholar

Lewis, M.E. (2001). Average optimal policies in a controlled queueing system with dual admission control. Journal of Applied Probability 38: 369–385.Google Scholar

Lewis, M.E., Ayhan, H., & Foley, R.D. (1999). Bias optimality in a queue with admission control. Probability in the Engineering and Informational Sciences 13: 309–327.Google Scholar

Lewis, M.E., Ayhan, H., & Foley, R.D. (2002). Bias optimal admission policies for a nonstationary multi-class queueing system. Journal of Applied Probability 39: 20–37.Google Scholar

Lin, K.Y. & Ross, S.M. (2003). Admission control with incomplete information of a queueing system. Operations Research 51: 645–654.Google Scholar

Lin, K.Y. & Ross, S.M. (2004). Optimal admission control for a single-server loss queue. Journal of Applied Probability 41: 535–546.Google Scholar

Miller, B.L. (1969). A queueing reward system with several customer classes. Management Science 16: 235–245.Google Scholar

Ross, K.W. (1995). Multiservice loss models for broadband telecommunication networks. London: Springer-Verlag.CrossRef

Article contents

OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS

Abstract

1. INTRODUCTION AND PROBLEM FORMULATION

2. SEMI-MARKOV DECISION MODEL AND MAIN RESULTS

3. PROOFS

Acknowledgment

References

REFERENCES

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests