DYNAMIC ASSIGNMENT OF DEDICATED AND FLEXIBLE SERVERS IN TANDEM LINES

Sigrún Andradóttir; Hayriye Ayhan; Douglas G. Down

doi:10.1017/S0269964807000290

DYNAMIC ASSIGNMENT OF DEDICATED AND FLEXIBLE SERVERS IN TANDEM LINES

Published online by Cambridge University Press: 22 October 2007

Sigrún Andradóttir ,

Hayriye Ayhan and

Douglas G. Down

Show author details

Sigrún Andradóttir: Affiliation:
H. Milton Stewart School of Industrial and Systems Engineering Georgia Institute of TechnologyAtlanta, GA 30332-0205 E-mail: hayhan@isye.gatech.edu
Hayriye Ayhan: Affiliation:
H. Milton Stewart School of Industrial and Systems Engineering Georgia Institute of TechnologyAtlanta, GA 30332-0205 E-mail: hayhan@isye.gatech.edu
Douglas G. Down: Affiliation:
Department of Computing and SoftwareMcMaster University Hamilton, Ontario L8S 4L7, Canada

Article contents

Abstract
INTRODUCTION
PRELIMINARIES
SYSTEMS WITH TWO SERVERS
SYSTEMS WITH THREE SERVERS
COUNTERINTUITIVE EXAMPLES
NUMERICAL RESULTS
CONCLUSIONS
References

Rights & Permissions

Abstract

Consider a system of queuing stations in tandem having both flexible servers (who are capable of working at multiple stations) and dedicated servers (who can only work at the station to which they are dedicated). We study the dynamic assignment of servers to stations in such systems with the goal of maximizing the long-run average throughput. We also investigate how the number of flexible servers influences the throughput and compare the improvement that is obtained by cross-training another server (i.e., increasing flexibility) with the improvement obtained by adding a resource (i.e., a new server or a buffer space). Finally, we show that having only one flexible server is sufficient for achieving near-optimal throughput in certain systems with moderate to large buffer sizes (the optimal throughput is attained by having all servers flexible). Our focus is on systems with generalist servers who are equally skilled at all tasks, but we also consider systems with arbitrary service rates.

Type: Research Article
Information: Probability in the Engineering and Informational Sciences , Volume 21 , Issue 4 , October 2007 , pp. 497 - 538

DOI: https://doi.org/10.1017/S0269964807000290 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2007

1. INTRODUCTION

We consider a tandem queuing network with N stations and M servers. There is an infinite supply of jobs in front of station 1 and infinite room for completed jobs after station N. At any given time, there can be at most one job in service at each station and each server can work on at most one job. We assume that each server i ∈ {1, 2, … , M} works at a deterministic rate μ_ij ∈ [0,∞) at each station j ∈ {1, 2, … , N}. Hence, server i is trained to work at station j if μ_ij > 0. Assume throughout that ∑_j=1^N μ_ij > 0 for i = 1, … , M (because otherwise we can reduce M). It is possible for several servers to work together on a job, in which case the service rates are assumed to be additive. The service requirements of the different jobs at station j ∈ {1, … , N} are independent and identically distributed random variables with rate μ(j) and the service requirements at different stations are independent of each other. Without loss of generality, we assume that μ(j) = 1 for all j ∈ {1, … , N}. We focus on the case when the buffers between the stations are finite but also consider systems with infinite buffers. We assume that the network operates under the manufacturing blocking mechanism with respect to placing jobs in finite buffers. For the majority of the article, our focus is on systems with exponential service requirements, N = 2 stations, and 1≤M ≤ 3 servers. However, some of our results (including numerical experiments) are for more general systems, and the insights obtained from the results for N = 2 stations and 1 ≤ M ≤ 3 servers might help with developing effective server assignment policies for systems with more than two stations in which the number of available servers is not radically different from the number of stations.

Under the assumption that l ≤ 0 of the M servers are flexible and M − l of them are dedicated to particular stations, our objective is to determine the dynamic server assignment policy that maximizes the long-run average throughput. More specifically, we would like to determine which servers should be dedicated (flexible), to which stations the dedicated servers should be assigned, and the dynamic allocation of the flexible servers to stations. For simplicity, we assume that the travel and setup times associated with a server moving between stations are negligible. Even though we consider systems with arbitrary service rates μ_ij, where i ∈ {1, … , M} and j ∈ {1, … , N}, our focus in this article is on systems with generalist servers. In this case, the service rate of each server at each station can be expressed as the product of two constants: one representing the server's speed at every task and the other representing the intrinsic difficulty of the task at the station (i.e., μ_ij = μ_iγ_j for all i ∈ {1, … , M} and j ∈ {1, … , N}). We refer to this case as generalist servers because each server is equally skilled at all tasks and, hence, the same set of servers is better at all tasks in the network. Note that this generalizes the notion of the generalist servers provided in Andradóttir, Ayhan, and Down [Reference Andradóttir, Ayhan and Down6] (they only consider the cases where either γ_j = 1 for all j ∈ {1, … , N} or μ_i = 1 for all i ∈ {1, … , M}).

Let Π be the set of server assignment policies with l flexible and M − l dedicated servers, and for all π ∈ Π and t ≥ 0, let D _π(t) be the number of departures under policy π by time t. For j = 1, … , N − 1, let B _j denote the size of the buffer between stations j and j + 1 and let j =1, … , N − 1

$T_{\pi}\lpar B_1, B_2, \ldots, B_{N - 1}\rpar = \limsup\limits_{t \rightarrow \infty}{{{\open E}[D_{\pi}\lpar t\rpar ]}\over{t}}$

be the long-run average throughput corresponding to the server assignment policy π ∈ Π. Thus, we are interested in solving the optimization problem

(1)

$\max\limits_{\pi \,\in \Pi}T_{\pi}\lpar B_1,\,B_2, \ldots, B_{N - 1}\rpar .$

For simplicity, we restrict our attention to Markovian stationary deterministic policies. Thus, for the rest of the article, Π denotes the set of Markovian stationary deterministic server assignment policies with l flexible and M − l dedicated servers.

In recent years, there has been a growing interest in queues with flexible servers. We now provide a brief overview of the literature in this area. A more complete review is given by Hopp and van Oyen [Reference Hopp and van Oyen18].

Several articles focus on flexible servers in parallel queues. For a two-class queuing system with one dedicated server, one flexible server, and no exogenous arrivals, Ahn, Duenyas, and Zhang [Reference Ahn, Duenyas and Zhang3] characterized the server assignment policy that minimizes the expected total holding cost incurred until all jobs initially present in the system have departed. Under heavy-traffic assumptions, Harrison and López [Reference Harrison and López16], Bell and Williams [Reference Bell and Williams11, Reference Bell and Williams12], and Mandelbaum and Stolyar [Reference Mandelbaum and Stolyar21] developed asymptotically optimal server assignment policies that minimize the discounted infinite-horizon holding cost for parallel queuing systems with flexible servers and external arrivals. Moreover, Squillante, Xia, Yao, and Zhang [Reference Squillante, Xia, Yao and Zhang28] used simulation to study the performance of threshold-type policies for systems that consist of parallel queues.

On the other hand, most of the articles that have considered the optimal assignment of servers to multiple interconnected queues focus on minimizing holding costs. For systems with two queues in tandem and no arrivals, Farrar [Reference Farrar13], Pandelis and Teneketzis [Reference Pandelis and Teneketsiz24], and Ahn, Duenyas, and Zhang [Reference Ahn, Duenyas and Zhang2] studied how servers should be assigned to stations to minimize the expected total holding cost incurred until all jobs leave the system. Rosberg, Varaiya, and Walrand [Reference Rosberg, Varaiya and Walrand26], Hajek [Reference Hajek15], and Ahn, Duenyas, and Lewis [Reference Ahn, Duenyas and Lewis1] studied the assignment of (service) effort to minimize holding costs in the two-station setting with Poisson arrivals. In a more recent article, Kaufman, Ahn, and Lewis [Reference Kaufman, Ahn and Lewis20] determined the workforce allocation that minimizes the long-run average holding cost in systems with two queues in tandem and Poisson arrivals, assuming that the number of available workers is dynamic. Wu, Lewis, and Veatch [Reference Wu, Lewis and Veatch33] also considered the notion of dedicated and flexible servers (which are referred to as dedicated and reconfigurable machines in their setting). Their objective was to determine the allocation of the flexible servers that minimizes the holding cost in a clearing system (without external arrivals) with two queues in tandem and in which the dedicated servers are subject to failures. However, they only considered the allocation of the flexible servers and assumed that their service rates do not depend on the station. Wu, Down, and Lewis [Reference Wu, Down and Lewis32] extended the results of Wu et al. [Reference Wu, Lewis and Veatch33] to more general serial lines with external arrivals under the discounted and average cost criteria. Wang, Perkins, Vakili, and Khurana [Reference Wang, Perkins, Vakili and Khurana31] considered the allocation of generalist servers with the goal of minimizing the completion time of all activities in an NPD (new product development) process. Andradóttir and Ayhan [Reference Andradóttir and Ayhan5], Andradóttir, Ayhan, and Down [Reference Andradóttir, Ayhan and Down6–Reference Andradóttir, Ayhan and Down8], and Tassiulas and Bhattacharya [Reference Tassiulas and Bhattacharya29] considered the dynamic assignment of servers to maximize the long-run average throughput in queuing networks with flexible servers. However, these articles do not focus on the case where some servers are dedicated to specific stations.

Ostalaza, McClain, and Thomas [Reference Ostalaza, McClain and Thomas23], McClain, Thomas, and Sox [Reference McClain, Thomas and Sox22], Zavadlav, McClain, and Thomas [Reference Zavadlav, McClain and Thomas34], and, more recently, Ahn and Righter [Reference Ahn and Righter4] have considered using server flexibility to achieve dynamic line balancing. More specifically, Ostalaza et al. [Reference Ostalaza, McClain and Thomas23] and McClain et al. [Reference McClain, Thomas and Sox22] studied dynamic line balancing in tandem queues with shared tasks that can be performed at either of two successive stations. This work was continued by Zavadlav et al. [Reference Zavadlav, McClain and Thomas34], who studied several server assignment policies for systems with fewer servers than stations, in which all servers trained to work at a particular station have the same capabilities at that station. Ahn and Righter [Reference Ahn and Righter4] studied how workers who are trained to do a set of consecutive tasks should be assigned dynamically to tandem stations. Bartholdi and Eisenstein [Reference Bartholdi and Eisenstein9] defined the “bucket brigades” server assignment policy for systems in which each server works at the same rate at all tasks and showed that under this policy, a stable partition of work will emerge yielding optimal throughput. Bartholdi, Eisenstein, and Foley [Reference Bartholdi, Eisenstein and Foley10] showed that the behavior of the bucket brigades policy, applied to systems with discrete tasks and exponentially distributed task times, resembles that of the same policy applied in the deterministic setting with infinitely divisible jobs.

Gurumurthi and Benjaafar [Reference Gurumurthi and Benjaafar14], Hopp, Tekin, and van Oyen [Reference Hopp, Tekin and van Oyen17], and Sheikhzadeh, Benjaafar, and Gupta [Reference Sheikhzadeh, Benjaafar and Gupta27] considered the use of specific flexibility structures on a set of existing servers to enhance the system's performance (see also Jordan and Graves [Reference Jordan and Graves19] for related work). More specifically, Gurumurthi and Benjaafar [Reference Gurumurthi and Benjaafar14] considered the modeling and analysis of flexible queuing systems. They illustrated that for systems with identical demand and service rates, a skill chaining flexibility structure yields most of the benefits of full flexibility. Similarly, Hopp et al. [Reference Hopp, Tekin and van Oyen17] pointed out that the skill chaining policy can be a robust and efficient approach for implementing workforce agility in serial production lines operating under the CONWIP release policy, and Sheikhzadeh et al. [Reference Sheikhzadeh, Benjaafar and Gupta27] showed that chained systems, under the assumption of homogeneous demand and service times, achieved most of the benefits of total pooling (which is attained by grouping the customers in a single queue and routing them to any server). Similar insights were obtained by Jordan and Graves [Reference Jordan and Graves19] in a production planning context. Unlike skill chaining, where each worker is trained to perform a small number of tasks (e.g., two tasks), in Section 7 we investigate the impact (on system throughput) of cross-training only a few workers at both tasks.

The remainder of this article is organized as follows: In Section 2 we introduce the notation used throughout the article and provide some general results. In Sections 3 and 4 we study systems with two stations in tandem, finite buffers, and M = 2 and M = 3 servers, respectively, and investigate the effect of server flexibility on system performance by varying the number of flexible servers l from 0 to M. In Section 5 we show that the throughput of the optimal policy for the finite-buffered systems considered in Sections 3 and 4 converges to the throughput of the optimal policy for the corresponding infinite-buffered systems as the (finite) buffer size becomes large. Section 6 provides examples that illustrate that the selection of the dedicated (and flexible) servers, the assignment of dedicated servers to stations, and the dynamic assignment of the flexible servers can be counterintuitive. In Section 7 we use numerical examples to investigate the effects of server flexibility on the system throughput. Section 8 contains some concluding remarks. Finally, the Appendix contains the proof of one of the lemmas in the article.

2. PRELIMINARIES

In this section we provide some general results and introduce notation and assumptions that will be used throughout the article. More specifically, in Section 2.1 we show that any nonidling server assignment policy is optimal when all servers are flexible and generalists and also consider the case where M = 1. Section 2.2 provides guidelines on how to select the flexible servers when the servers are generalists.

The following assumptions will be used in our analysis in this section:

Assumption S

For each j = 1, … , N, the service requirements S _k,jof job k ≥ 1 at station j are independent and identically distributed with mean 1. Moreover, for all t ≥ 0, if there is a job in service at station j at time t, then the expected remaining service requirement at station j of that job is bounded above by a scalar 1 ≤ < ∞. Finally, the service discipline is either nonpreemptive or preemptive-resume.

Assumption G

For all i = 1, … , M and j = 1,…, N, μ_ij = μ_iγ_j(and hence all servers are flexible).

2.1. Systems with Flexible, Generalist Servers

When all servers are flexible, we have the following result.

Theorem 2.1

Suppose that M ≥ 1, N ≥ 1, and Assumptions Sand Ghold. Then for all 0 ≤ B ₁, B ₂, …, B _N−1 < ∞, any nonidling server assignment policy π is optimal, with long-run average throughput

$T_{\pi}\lpar B_1,\, B_2, \ldots, B_{N - 1}\rpar = {{\displaystyle\sum\limits_{i = 1}^M \mu_i} \over {\displaystyle\sum\limits_{j = 1}^N 1/ \gamma_j}}.$

Proof

Note that our model is equivalent to one in which the service requirements of successive jobs at station j are independent and identically distributed with mean 1/γ_j and the service rates depend only on the server (i.e., μ_ij = μ_i). Let W _π,p(t) be the total work performed by time t for all servers under the nonidling policy π. Then W _π,p(t) = t∑_i=1^M μ_i. Let B = ∑_j=1^N−1B _j and let S _k = ∑_j=1^NS _k,j be the total service requirement (in the system) of job k for all k ≥1. Let W _π(t) = ∑_k=1^{D _π}^(t)+N+BS _k and let W _π,r(t) = W _π(t)−W _π,p(t) be the total remaining service requirement (work) at time t for the N + B jobs starting service at station 1 after job D _π(t) starts service at station 1. From our assumptions we have

(2)

${\open E}[W_{{\rm\pi}, r}\lpar t\rpar ] \le \lpar N + B\rpar\, {\bar S} \sum\limits_{j = 1}^N {1 \over \gamma_j},$

which implies that lim_t→∞𝔼[W _π,r(t)]/t = 0 and

(3)

$\sum\limits_{i = 1}^M \mu_i = \lim\limits_{t \rightarrow \infty}{{\open E}[W_{\pi,p}\lpar t\rpar ] \over t} = \lim\limits_{t \rightarrow \infty} {{\open E}[W_{\pi}\lpar t\rpar ] \over t}.$

For all n ≥ 0, let Z _n = (S _n,1, … , S _n,N). It has been shown in [Reference Andradóttir, Ayhan and Down6] that D _π(t) + N +B is a stopping time with respect to the sequence of random variables {Z _n} and that 𝔼[D _π(t)] < ∞ for all t ≥ 0. Then from Wald's lemma, we have 𝔼[W _π(t)] = (𝔼[D _π(t)] + N + B) × ∑_j=1^N(1/γ_j), and hence (3) implies that

(4)

$\eqalignno{\sum\limits_{i = 1}^M \mu_i & = \lim\limits_{t \rightarrow \infty}{{\open E}[W_{\pi}\lpar t\rpar ] \over t} \cr & = \lim\limits_{t \rightarrow \infty} {{\open E}[D_{\pi}\lpar t\rpar ] \over t} \sum\limits_{j = 1}^N {1 \over \gamma_j} \cr & = T_{\pi}\lpar B_1,B_2, \ldots, B_{N - 1}\rpar \sum\limits_{j = 1}^N {1 \over \gamma_j},}$

which yields the desired throughput. The optimality of this throughput follows from (3) and (4) and the fact that W _π,p(t)≤t∑_i=1^M μ_i for all t ≥ 0 and for all server assignment policies π.■

Note that Theorem 2.1 is an extension of Theorem 2.1 of Andradóttir et al. [Reference Andradóttir, Ayhan and Down6], where we considered the special cases μ_ij = μ_i for all i = 1, … , M and j = 1, … , N and μ_ij = μ_j for all i = 1, … , M and j = 1, … , N. When there is only one server, this server is flexible (i.e., l = 1) and the conditions of Theorem 2.1 hold, then it is clear that Theorem 2.1 implies that any nonidling policy is optimal with throughput 1/(∑_j=1^N 1/μ_1j). On the other hand, if there is only one server, this server is dedicated (i.e., l = 0), and if N ≥ 2, then the throughput is obviously equal to zero.

2.2. Systems with Both Flexible and Dedicated, Generalist Servers

In this subsection, we assume that the servers are generalists, so that μ_ij = μ_iγ_j for i = 1, … , M and j = 1, … , N. Given that l servers move, we show that under certain assumptions it is optimal to have the fastest l servers as the flexible ones.

We start by considering the case where the size of the buffers between the stations is infinite. It then immediately follows from Proposition 4 of Andradóttir et al. [Reference Andradóttir, Ayhan and Down7] that the l fastest servers should be the flexible ones. In fact, Proposition 4 of Andradóttir et al. [Reference Andradóttir, Ayhan and Down7] implies that for a general queuing network with N stations, M > l servers, probabilistic routing (so that the queues are not necessarily in tandem), general service requirements, and infinite buffers in front of all of the stations, having the l fastest servers as the flexible ones maximizes the throughput in the class of all policies with at most l flexible servers.

For the remainder of the article, we assume that N = 2 and denote the size B ₂ of the buffer between the two stations by B. For all π ∈ Π, consider the stochastic process {X _π(t) : t ≥ 0}, where X _π(t) = 0 if there is a job to be processed at station 1, the number of jobs waiting to be processed between stations 1 and 2 is zero, and station 2 is starved at time t; X _π(t) = s for 1 ≤ s ≤ B + 1 if there are jobs to be processed at both stations 1 and 2 and in the buffer there are s−1 jobs waiting to be processed at time t; finally, X _π(t) = B + 2 if station 1 is blocked (so that there is a job at station 1 whose processing at that station has been completed), B jobs are waiting to be processed in the buffer, and there is a job to be processed at station 2 at time t. Let S = {0, 1, … , B + 2} be the set of states of {X _π(t) : t ≥ 0} and for all s ∈ S, let $p_\pi \lpar s \rpar = {\rm lim}_{t \rightarrow \infty} \int_0^t {\open I} \{ X_\pi (u) = s \}du / t$ if the limit exists and equals a finite constant with probability 1 (where for two integers i, j, 𝕀{i = j} = 1 if i = j and 𝕀{i = j} = 0 otherwise).

We now consider how the flexible servers should be selected when the buffer B between the two stations is finite. Let π_{i ₁} be a server assignment policy and assume that under $\pi_{i_1}$ there is a flexible server i ₁ and a dedicated server i ₂ at station 1 such that μ_{i ₂}>μ_{i ₁}. Let $\pi_{i_2}$ be a policy having the property that the roles of servers i ₁ and i ₂ are reversed (i.e., under $\pi_{i_2}$ server i ₁ is dedicated to station 1 and server i ₂ is flexible). Similarly, assume that under policy $\pi_{i_1}^{\,\prime}$ there is a flexible server i ₁ and a dedicated server i ₂ at station 2 such that $\mu_{i_2} \gt \mu_{i_1}$ . Let $\pi_{i_2}^{\,\prime}$ be a policy such that the roles of servers i ₁ and i ₂ are reversed (i.e., under $\pi_{i_2}^{\,\prime}$ server i ₁ is dedicated to station 2 and server i ₂ is flexible). Assume that under policies $\pi_{i_1}$ , $\pi_{i_2}$ , $\pi_{i_1}^{\,\prime}$ , and $\pi_{i_2}^{\,\prime}$ , the flexible servers never idle and the dedicated servers work whenever they can. The following proposition provides sufficient conditions under which it is desirable to reverse the roles of servers i ₁ and i ₂ in these two cases.

Proposition 2.1

Suppose that N = 2, Assumptions Sand Ghold, B < ∞, π_{i ₁}, π_{i ₂}, π′_{i ₁}, π′_{i ₂} ∈ Π, and $p_{\pi_{i_1}}$ (s), $p_{\pi_{i_2}}$ (s), $p_{\pi^{\,\prime}_{i_1}} (s)$ , and $p_{\pi^{\,\prime}_{i_2}} (s)$ exist for s ∈ {0, B + 2}.

(i) Let 𝒟 ₁and 𝒟 ₂denote the set of dedicated servers at station 1 and station 2, respectively, under policy π_{i ₁}. We then have $T_{\pi_{i_2}} \lpar B \rpar \gt T_{\pi_{i_1}} \lpar B \rpar$ if and only if
(5)
$\eqalignno{&\lpar \mu_{i_{2}} - \mu_{i_{1}}\rpar p_{\pi_{i_{2}}}\lpar B +2\rpar + [\,p_{\pi_{i_{1}}} \lpar B +2\rpar - p_{\pi_{i_{2}}}\lpar B + 2\rpar ] \sum\limits_{k \in {\cal D}_{1}} \mu_k \cr & \quad + [p_{\pi_{i_{1}}}\lpar 0\rpar - p_{\pi_{i_{2}}}\lpar 0\rpar ] \sum\limits_{k \in {\cal D}_{2}} \mu_k \gt 0.}$
(ii) Let 𝒟′₁and 𝒟′₂denote the set of dedicated servers at station 1 and station 2, respectively, under policy $\pi_{i_1}^{\,\prime}$ . We then have $T_{\pi^{\,\prime}_{i_2}} \lpar B \rpar \gt T_{\pi^{\,\prime}_{i_1}} \lpar B \rpar$ if and only if
(6)
$\eqalignno{&\lpar \mu_{i_{2}} - \mu_{i_{1}}\rpar p_{\pi^{\,\prime}_{i_2}}\lpar 0\rpar + [p_{\pi^{\,\prime}_{i_1}}\lpar B + 2\rpar - p_{\pi^{\,\prime}_{i_2}}\lpar B + 2\rpar ] \sum\limits_{k \in {\cal D}\,_1^{\prime}} \mu_k \cr & \quad + [\,p_{\pi^{\,\prime}_{i_1}} \lpar 0\rpar - p_{{\pi^{\,\prime}_{i_2}}}\lpar 0\rpar ] \sum\limits_{k \in {\cal D}\,_2^{\prime}} \mu_k \gt 0.}$

Proof

(i) Let W _π,p(t) and W _π,r(t) be as defined in the proof of Theorem 2.1 and let ℱ denote the set of flexible servers under policy π_{i ₁}. Equation (2) implies that
$\lim\nolimits_{t \rightarrow \infty}{W_{\pi_{i_{1}},r}\lpar t\rpar \over t} = \lim\nolimits_{t \rightarrow \infty}{W_{\pi_{i_{2}},r}\lpar t\rpar \over t} = 0.$
Consequently, it is clear that
$T_{\pi^{\,\prime}_{i_{2}}}\lpar B\rpar - T_{\pi^{\,\prime}_{i_{1}}}\lpar B\rpar = \lim\limits_{t \rightarrow \infty} \left[{W_{\pi_{i_{2}}\!,p}\lpar t\rpar \over t} - {W_{\pi_{i_{1}}\!,p}\lpar t\rpar \over t} \right],$
and it suffices to show that the right-hand side is equal to the expression in (5). We have
$\eqalign{&\lim\limits_{t \rightarrow \infty} \left[{W_{\pi_{i_{2}}\!,p}\lpar t\rpar \over t} - {W_{\pi_{i_{1}}\!\!,\!\,p}\lpar t\rpar \over t} \right] \cr & = \left[\sum\limits_{k \in{\cal F}} \mu_ k + \mu_{i_{2}} - \mu_{i_{1}} \right. \left.+ [1 - p_{\pi_{i_{2}}}\lpar B + 2\rpar ] \left( \sum\limits_{k \in {\cal D}_{1}} \mu_ k + \mu_{i_{1}} - \mu_{i_{2}} \right) \right. \cr & \quad + \left.[1 - p_{\pi_{i_{2}}}\lpar 0\rpar ] \sum\limits_{k \in {\cal D}_{2}} \mu_ k \right] - \left[\sum\limits_{k \in{\cal F}} \mu_ k + [1 - p_{\pi_{i_{1}}}\lpar B + 2\rpar ] \right. \cr & \quad \times \sum\limits_{k \in{\cal D}_{1}} \mu_ k + \left. [1 - p_{\pi_{i_{1}}}\lpar 0\rpar ] \sum\limits_{k \in{\cal D}_{2}} \mu_ k \right] \cr & = \lpar \mu_{i_{2}} - \mu_{i_{1}}\rpar p_{\pi_{i_{2}}}\lpar B + 2\rpar + [p_{\pi_{i_{1}}}\lpar B + 2\rpar - p_{\pi_{i_{2}}}\lpar B + 2\rpar ] \cr &\quad \times \sum\limits_{k \in {\cal D}_1} \mu_ k + [p_{\pi_{i_{1}}} \lpar 0\rpar - p_{\pi_{i_{2}}}\lpar 0\rpar ] \sum\limits_{k \in {\cal D}_2} \mu_ k,}$
which completes the proof of (i). The proof of (ii) is similar and is omitted. ■

Since Proposition 2.1 can be used to compare any two policies of the form π_{i ₁} and π_{i ₂} (or π′_{i ₁} and π′_{i ₂}), it is possible to use it recursively (if the conditions in (5) or (6) hold each time) to show that the best nonidling policy should have the l fastest servers as the flexible ones. Thus, Proposition 2.1 provides sufficient conditions that guarantee that the best nonidling policy should have the l fastest servers as the flexible ones. When the service requirements are exponentially distributed, even though the conditions in (5) and (6) might be satisfied for some π_{i ₁} (π_{i ₂}) and π′_{i ₁} (π′_{i ₂}), respectively, one can also construct examples with dedicated servers at both stations for which these conditions are violated. For example, consider a system with B = 0 and γ₁ = γ₂ = 1. Let π_{i ₁} be the policy that has two servers with rates 0.9 and 0.1 dedicated to station 1, three servers with rates 0.7, 0.7, and 0.6 dedicated to station 2, and two flexible servers with rates 0.8 and 0.2. The flexible servers work at station 1 in states 0 and 1 and at station 2 in state 2. Let the policy π_{i ₂} be identical to policy π_{i ₁} except that the server with rate 0.2 is now dedicated to station 1 and the server with rate 0.9 is now one of the flexible servers. One can easily compute that $1.4118 \simeq T_{\pi_{i_1}} (B) > T_{\pi_{i_2}} (B) \simeq 1.3647$ . We now show that when all of the dedicated servers are at the same station and the service requirements are exponentially distributed, the optimal policy with l flexible and M−l dedicated servers should have the l fastest servers as the flexible ones. Similarly, Proposition 4.2 and Remark 4.1 in Section 4 state that when the service requirements are exponentially distributed, all available servers have been assigned to three teams, and it is of interest to dedicate one team to each station and have one team of flexible servers (who move together between the stations), then the optimal policy should have the service rate of the team of flexible servers larger than the service rates of the two teams of dedicated servers.

We start by showing that the conditions in (5) and (6) simplify significantly when all the dedicated servers (assuming that the set of dedicated servers is nonempty) are at the same station. Consider policies π_{i ₁} and π′_{i ₁} that are nonidling to the extent possible, and suppose that π_{i ₁} is a policy with all dedicated servers at station 1 and that π′_{i ₁} is a policy with all dedicated servers at station 2. For both π_{i ₁} and π′_{i ₁}, assume that there is a flexible server i ₁ and a dedicated server i ₂ such that μ_{i ₂} > μ_{i ₁}. Under policies π_{i ₂} and π′_{i ₂}, the roles of servers i ₁ and i ₂ are reversed (as described earlier). The following corollary follows directly from Proposition 2.1.

Corollary 2.1

(i) If 𝒟₂ = ∅ and $p_{\pi_{i_1}}$ (B + 2) ≥ $p_{\pi_{i_2}}$ (B + 2) > 0, then $T_{\pi_{i_2}}$ (B) > $T_{\pi_{i_1}}$ (B).
(ii) If 𝒟₁ = ∅ and $p_{\pi^{\,\prime}_{i_1}} (0) \geq p_{\pi^{\,\prime}_{i_2}} (0) \gt 0$ , then $T_{\pi^{\,\prime}_{i_2}} (B) \gt T_{\pi^{\,\prime}_{i_1}} (B)$ .

For the remainder of the article we make the following assumption:

Assumption E

The service requirements of jobs at both stations are independent and exponentially distributed with rate 1.

The next lemma shows that if the service requirements are exponentially distributed, then the assumptions of Corollary 2.1 hold.

Lemma 2.1

Suppose that N = 2, l < M, assumptions Gand Ehold, B < ∞, and π_{i ₁}π_{i ₂}, π′_{i ₁}, π′_{i ₂} ∈ Π. Then $p_{\pi_{i_1}}$ (B + 2) ≥ $p_{\pi_{i_2}}$ (B + 2) > 0 and $p_{\pi^{\,\prime}_{i_1}}(0)$ ≥ $p_{\pi^{\,\prime}_{i_2}}$ (0) > 0.

Proof

It is clear that the stochastic processes $\{X_{\pi_{i_1}} \lpar t \rpar\}$ and $\{X_{\pi_{i_2}} \lpar t \rpar \}$ are birth–death processes with state space S. Note that under π_{i ₁} and π_{i ₂}, there exists a state s ₀ ∈ S such that {s ₀, … , B + 2} form a recurrent set of states and that states 0, … , s ₀ − 1 are transient. Consequently, we have $p_{\pi_{i_2}}(B+2)\gt0$ . For all s ∈ S and π ∈ Π, let λ_π (s) and γ_π (s) be the birth and death rates in state s under policy π. Then we have

$\lambda_{\pi_{i_{1}}}\lpar s\rpar \ge \lambda_{\pi_{i_{2}}}\lpar s\rpar \quad \hbox{and} \quad \gamma_{\pi_{i_{1}}}\lpar s\rpar \le \gamma_{\pi_{i_{2}}}\lpar s\rpar \quad \hbox{for all}\ s \in S.$

For all s ∈ {s ₀ , … , B + 2}, we have

$p_{\pi_{i_{1}}}\lpar s\rpar = p_{\pi_{i_{1}}}\lpar s_0\rpar \prod\limits_{j = s_0}^{s - 1} {\lambda_{\pi_{i_{1}}}\lpar\, j\rpar \over \gamma_{\pi_{i_{1}}}\lpar\, j + 1\rpar }$

and

$p_{\pi_{i_{2}}}\lpar s\rpar = p_{\pi_{i_{2}}}\lpar s_0\rpar \prod\limits_{j = s_0}^{s - 1} {\lambda_{\pi_{i_{2}}}\lpar\, j\rpar \over \gamma_{\pi_{i_{2}}}\lpar\, j + 1\rpar }.$

Note that there must exist an s ∈ {s ₀ , … , B + 2} such that $p_{\pi_{i_1}}(s)\ge p_{\pi_{i_2}}(s)$ , because otherwise we have 1 = ∑^B+2_{s=s ₀} $p_{\pi_{i_2}}$ (s) < ∑^B+2_{s=s ₀} $p_{\pi_{i_2}}$ (s) = 1. The fact that $\lambda_{\pi_{i_2}}$ (i)/ $\gamma_{\pi_{i_1}}$ (i + 1) ≥ $\gamma_{\pi_{i_2}}$ (i)/ $\gamma_{\pi_{i_2}}$ (i + 1) for i = s ₀, … , B + 1 now implies that $p_{\pi_{i_1}}$ (B + 2) ≥ $p_{\pi_{i_2}}$ (B + 2). The proof that $p_{\pi^{\,\prime}_{i_1}} (0) \geq p_{\pi^{\,\prime}_{i_2}}$ is similar and is omitted. ■

The following proposition follows immediately from Corollary 2.1 and Lemma 2.1 which are given above and Proposition 2.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5], which shows that the optimal policy should be nonidling to the extent possible for systems with exponentially distributed service requirements.

Proposition 2.2

Suppose that N = 2, l < M, B < ∞, and Assumptions Gand Ehold. If all of the dedicated servers are at the same station, then the optimal policy should have the l fastest servers as the flexible servers.

3. SYSTEMS WITH TWO SERVERS

In this section we consider the assignment of M = 2 servers (with arbitrary service rates) to N = 2 stations when 0 ≤ l ≤ M servers are flexible. In particular, Section 3.1 consider systems where both servers are dedicated. In Section 3.2 we study systems with one dedicated and one flexible server. Finally, Section 3.3 provides the optimal server assignment policy when both servers are flexible.

3.1. Systems with Two Dedicated Servers

When both servers are dedicated, it is clear that the dedicated servers should be assigned to different stations because otherwise the long-run average throughput would be zero. Our objective is to determine what stations each server should be assigned to in order to maximize the long-run average throughput. We have only two policies to consider. In particular, let π₁ be the policy that assigns server 1 to station 1 and server 2 to station 2, and let π₂ be the policy that assigns server 2 to station 1 and server 1 to station 2. Define

$a_1 = \min\{\mu_{11}, \mu_{22}\}, \qquad a_2 = \min\{\mu_{21}, \mu_{12}\}$

and

$\rho_1 = {a_1 \over \max \{\mu_{11},\mu_{22}\}}\, \le\, 1, \qquad \rho_2 = {a_2 \over \max\{\mu_{21},\mu_{12}\}}\, \le\,1.$

Then one can verify that

(7)

$\eqalignno{T_{\pi_{i}}\lpar B\rpar & = \lpar {\open I}\{i = 1\} \mu_{22} + {\open I}\{i = 2\} \mu_{12}\rpar \sum\limits_{s = 1}^{B +2} p_{\pi_{i}}\lpar s\rpar \cr & = \left\{\matrix{a_i {1 - \rho_i^{B + 2} \over 1 - \rho_i^{B + 3}} &\ {\rm if}\ \rho_i \lt 1 \cr a_i {B + 2 \over B + 3} & {\rm if}\ \rho_i = 1,}\right.}$

where (in this case) {p _{π_i}(s) : s = 0, … , B + 2} is the steady-state distribution of the number of customers in an M/M/1/(B + 2) queuing system. The next proposition compares the throughputs of these policies under certain assumptions.

Proposition 3.1

Suppose that Assumption Eholds and B < ∞. If ρ₂ < ρ₁and there exists B ₀such that T _π₁(B ₀) ≥ T _π₂(B ₀), then T _π₁(B) > T _π₂(B) for all B > B ₀.

Proof

If a ₂ = 0, then T _π₁(B) > T _π₂(B) = 0 for all B ≥0. When a ₂ > 0, we will show that T _π₁(B ₀ + 1) > T _π₂(B ₀ + 1). First suppose that ρ₁ < 1. Since T _π₁(B ₀) ≥ T _π₂(B ₀), we have

$\eqalignno{T_{\pi_{1}}\lpar B_{0}+1\rpar & = a_{1} {1-\rho _{1}^{B_{0}+3} \over 1-\rho _{1}^{B_{0}+4}}\, \ge a_{2} \left( {1-\rho _{2}^{B_{0}+2} \over 1-\rho _{2}^{B_{0}+3}}\right) \cr & \quad \times \left( {1-\rho _{1}^{B_{0}+2} \over 1-\rho _{1}^{B_{0}+3}}\right) ^{-1} \left( {1-\rho _{1}^{B_{0}+3} \over 1-\rho _{1}^{B_{0}+4}}\right) \cr & = a_{2} \left( {1-\rho _{1}^{B_{0}+3} \over 1-\rho _{1}^{B_{0}+4}}\right) {f\lpar \rho_1, B_0\rpar \over f\lpar \rho_2, B_0\rpar } \cr & =T_{\pi_{2}}\lpar B_{0}+1\rpar {f\lpar \rho_1, B_0\rpar \over f\lpar \rho_2, B_0\rpar },}$

where

$f \lpar \rho, B\rpar = {\lpar 1-\rho ^{B+3}\rpar ^{2} \over \lpar 1-\rho ^{B+2}\rpar \lpar 1-\rho ^{B+4}\rpar }.$

It suffices to show that f is strictly increasing in 0 ≤ ρ < 1. With some algebra, we have

$\eqalignno{{d \over d \rho} f\lpar \rho, B\rpar & = {\rho\,^{B+1}\lpar 1-\rho\,^{B+3}\rpar \lpar 1-\rho\rpar \over \lpar 1 - \rho\,^{B+2}\rpar ^2 \lpar 1-\rho\,^{B+4}\rpar ^{2}} \Big( \lpar B + 2\rpar \lpar 1 - \rho ^{B+4}\rpar - \lpar B+4\rpar \rho \lpar 1-\rho ^{B+2}\rpar \Big) \cr & = {\rho\,^{B+1}\lpar 1-\rho\,^{B+3}\rpar \lpar 1-\rho\rpar ^2 \over \lpar 1 - \rho\,^{B+2}\rpar ^2 \lpar 1-\rho\,^{B+4}\rpar ^{2}} \left( \lpar B + 2\rpar \lpar 1+\rho^{B + 3}\rpar - 2 \sum\limits^{B + 2}_{i = 1}\rho^i \right) .}$

Define g(x) = ρ^x for all x ∈ ℝ. Clearly, g(x) is a strictly convex function for 0 < ρ < 1. Then for all i = 1, … , B + 2,

$g\lpar i\rpar \lt {B+3 - i \over B + 3} g\lpar 0\rpar + {i \over B + 3}g \lpar B + 3\rpar.$

Hence,

(8)

$\eqalignno{{d \over d \rho} f\lpar \rho, B\rpar & \gt {\rho\,^{B+1}\lpar 1-\rho\,^{B+3}\rpar \lpar 1-\rho\rpar ^2 \over \lpar 1 - \rho\,^{B+2}\rpar ^2 \lpar 1-\rho\,^{B+4}\rpar ^{2}} \times \left( \lpar B + 2\rpar \lpar g\lpar 0\rpar + g\lpar B + 3\rpar \rpar \right. \cr & \quad - \left.2\sum\limits^{B + 2}_{i = 1}\left( {B + 3-i \over B + 3}g\lpar 0\rpar + {i \over B + 3} g \lpar B + 3\rpar \right) \right) \cr & = {\rho\,^{B+1}\lpar 1-\rho\,^{B+3}\rpar \lpar 1-\rho\rpar ^2 \over \lpar 1 - \rho\,^{B+2}\rpar ^2 \lpar 1-\rho\,^{B+4}\rpar ^{2}} \lpar \lpar B + 2\rpar \lpar g\lpar 0\rpar + g \lpar B + 3\rpar \rpar \cr & \quad - \lpar B + 2\rpar g\lpar 0\rpar - \lpar B + 2\rpar g\lpar B + 3\rpar \rpar \cr & = 0,}$

which completes the proof for the case ρ₁ < 1. If ρ₁ = 1, then T _π₁(B ₀) ≥ T _π₂(B ₀) yields

$\eqalignno{T_{\pi_{1}}\lpar B_{0}+1\rpar& = a_{1} {B_{0}+3 \over B_{0}+4} \ge a_{2} \left( {1 - \rho_2^{B_{0} + 2} \over 1 - \rho_2^{B_{0} + 3}}\right) \cr &\quad \times \left( {B_0 + 2 \over B_0 + 3}\right) ^{-1} \left( {B_0 + 3 \over B_0 + 4}\right) \cr & = T_{\pi_{2}} \lpar B_{0} + 1\rpar {\lim_{\rho\uparrow 1} f\lpar \rho, B_0\rpar \over f\lpar \, \rho_2, B_0\rpar }}$

The result now follows from (8).■

The next proposition provides conditions that determine which of the throughputs of the policies π₁ and π₂ is larger for all buffer sizes B.

Proposition 3.2

Suppose that Assumption Eholds and B < ∞. If ρ₂ < ρ₁and a ₁ ≤ a ₂, then T _π₁(B) < T _π₂(B) for all 0 ≤ B ≤ ∞.

Proof

First assume that ρ₂ < ρ₁ and a ₁ = a ₂. Then it suffices to show that

$h\lpar \rho \rpar = {1-\rho\, ^{B+2} \over 1-\rho\, ^{B+3}}$

is a strictly decreasing function of ρ. We have

${d \over d \rho} h\lpar \rho\rpar = {\rho\,^{B + 1} \lpar \!\! - \!\lpar B + 2\rpar g\lpar 0\rpar + \lpar B + 3\rpar g\lpar 1\rpar - g\lpar B + 3\rpar \rpar \over \lpar 1 - \rho\,^{B + 3}\rpar ^{2}},$

where the function g is as definedd in the proof of Proposition 3.1. Recall that g is a strictly convex function. Hence,

$g\lpar 1\rpar \lt {B + 2 \over B + 3}g \lpar 0\rpar + {1 \over B + 3}g\lpar B + 3\rpar \quad \hbox{and} \quad {d \over d \rho} h\lpar \rho \rpar \lt 0.$

Now assume that a ₁ < a ₂. Suppose that there exists a B ₀ such that T _π₁(B ₀) ≥ T _π₂(B ₀). Then it follows from Proposition 3.1 that T _π₁(B) > T _π₂(B) for all B > B ₀ and

$a_{1} = \mathop{\lim}\limits_{B\to \infty} T_{\pi_{1}}\lpar B\rpar \ge \mathop{\lim}\limits_{B\to \infty} T_{\pi_ {2}}\lpar B\rpar = a_{2},$

which contradicts the fact that a ₁ < a ₂.■

Note that it follows from (7) that if ρ₁ = ρ₂, then T _π₁(B) ≤ T _π₂(B) for all B ≥ 0 if and only if a ₁ ≤ a ₂. On the other hand, when ρ₁ ≠ ρ₂, then Propositions 3.1 and 3.2 indicate that the policy with the larger ratio of the service rates (ρ) is only guaranteed to have the larger throughput if it also has the larger minimum service rate and the buffer size B is sufficiently large. Now consider a system with two servers and service rates μ₁₁ = 2.5, μ₁₂ = 4, μ₂₁ = 3, and μ₂₂ = 7. Then, for B = 0, the policy that assigns server 1 to station 1 and server 2 to station 2 has a higher throughput than the policy that assigns server 2 to station 1 and server 1 to station 2. This example shows that it is not necessarily correct that one would always try to balance the rates and that it is not necessarily correct that one would like to maximize the minimum of the rates at the two stations.

Remark 3.1

Suppose that we want to assign M > 2 dedicated servers to the two stations. Then one can consider all possible ways of grouping these servers into two teams and use (7) to compare the throughputs of the resulting policies and hence to determine the optimal assignment. Moreover, Propositions 3.1 and 3.2 provide structural results about how these teams should be assigned to the stations in an optimal fashion.

Now, suppose that μ_ij = μ_iγ_j > 0 for all i = 1, 2 and j = 1, 2. The next proposition states that the faster server should be assigned to the slower station. Thus, when the dedicated servers are generalists, one would like to balance the service rates at the two stations (this is not correct in general, as was shown earlier).

Proposition 3.3

Suppose that Assumption Eholds and B < ∞. If γ₁ ≤ γ₂and μ₁ ≤ μ₂, then T _π₂(B) ≤ T _π₁(B) for all B ≥ 0.

Proof

Note that if γ₁ = γ₂ and μ₁ = μ₂, or γ₁ = γ₂ and μ₁ < μ₂, or γ₁ < γ₂ and μ₁ = μ₂, then T _π₁(B) = T _π₂(B) for all B ≥ 0 because in these cases a ₁ = a ₂ and ρ₁ = ρ₂ (see (7)). Thus, we assume that γ₁ < γ₂ and μ₁ < μ₂. Then

$\rho _{1} = {\gamma _{1} \mu _{1} \over \gamma _{2}\mu _{2}} \quad \hbox{and} \quad \rho_{2} = {\min \{\gamma _{1}\mu _{2}, \gamma _{2}\mu _{1}\} \over \max\{\gamma _{1}\mu _{2}, \gamma _{2}\mu _{1}\}}.$

Since γ₁μ₁ < min{γ₁μ₂, γ₂μ₁} and γ₂μ₂ > max{γ₁μ₂, γ₂μ₁}, we have that ρ₁ < ρ₂. Then we know from Proposition 3.1 that it suffices to show that T _π₂(0) > T _π₁(0). With some algebra, we have

$\eqalignno{T_{\pi_{2}}\lpar 0\rpar - T_{\pi_{1}}\lpar 0\rpar& = \gamma _{1}\gamma _{2}\mu _{1}\mu _{2}\lpar \gamma _{2} - \gamma _{1}\rpar \lpar \mu_{2}-\mu_{1}\rpar [\gamma _{1}\gamma _{2}\lpar \mu_{1}^{2} + \mu _{2}^{2}\rpar &\cr &\quad+ \mu _{1}\mu _{2}\lpar \gamma _{1}\gamma _{2}+\gamma\, _{1}^{2}+\gamma\, _{2}^{2}\rpar ] [\lpar \gamma\, _{2}^{2}\mu _{2}^{2} +\gamma _{1}\mu _{1}\gamma _{2}\mu _{2} +\gamma\, _{1}^{2}\mu _{1}^{2}\rpar \cr &\quad \times\lpar \gamma\, _{1}^{2}\mu _{2}^{2} + \gamma _{1}\mu _{1}\gamma _{2}\mu _{2}+\gamma\, _{2}^{2}\mu _{1}^{2}\rpar ]^{-1} \gt 0,}$

regardless of whether ρ₂ < 1 or ρ₂ = 1. This completes the proof.■

3.2. Systems with One Dedicated and One Flexible Server

In this subsection we assume that M = 2 and l = 1. First, we specify the optimal policy when the dedicated and flexible servers are known. The following result follows from Theorem 4.1 of Andradóttir et al. [Reference Andradóttir, Ayhan and Down6] by setting the rate of the dedicated server at the station the server is not assigned to equal to zero.

Proposition 3.4

Suppose that Assumption Eholds and B < ∞.

(i) If the dedicated server is at station 1, then the policy that assigns the flexible server to station 2 unless station 2 is starved and assigns the flexible server to station 1 when station 2 is starved is optimal. Moreover, this is the unique optimal policy in the set of Markovian stationary deterministic policies if the optimal throughput is positive.
(ii) If the dedicated server is at station 2, then the policy that assigns the flexible server to station 1 unless station 1 is blocked and assigns the flexible server to station 2 when station 1 is blocked is optimal. Moreover, this is the unique optimal policy in the set of Markovian stationary deterministic policies if the optimal throughput is positive.

With respect to the question of determining which server should be the dedicated one and which server should be the flexible one, we know from Proposition 3.4 that we need to consider only four policies. The throughput expression (g ₀) given in the proof of Theorem 4.1 of [Reference Andradóttir, Ayhan and Down6] (with the rate of the dedicated server at the station the server is not assigned to set equal to zero) can now be used to compare the throughputs of the resulting four policies and to determine which one is optimal.

Now, assume that μ_ij = μ_iγ_j > 0, for all i ∈ {1, 2}, and j ∈ {1, 2}, and that Assumption E holds. Then from Proposition 2.2, we know that the faster server should be the flexible one. Thus, in order to specify the optimal policy, it suffices to determine the station to which the dedicated (slower) server is assigned. The next proposition, which completely characterizes the optimal policy when the servers are generalists, states that the slower (dedicated) server should be assigned to the slower station. Note that the optimal choice of the flexible server and the assignment of the dedicated server are only unique when the rates μ₁ and μ₂ of the two servers and the rates γ₁ and γ₂ of the two stations are both different. The intuition behind the optimal policy described in Proposition 3.5 is to keep the faster server busy at all times (which is obtained by having the faster server as the flexible one) and to keep the slower server as busy as possible (which is obtained by assigning the slower server to the slower station).

Proposition 3.5

Suppose that Assumption Eholds and B < ∞. If μ_ij = μ_iγ_jfor all i ∈ {1, 2} and j ∈ {1, 2}, then the optimal policy is nonidling to the extent possible, has server arg min{i : μ_i} dedicated to station arg min{j : γ_j}, and the flexible server arg max{i : μ_i} is assigned to work at station arg max{j : γ_j} unless station arg max{j : γ_j} is blocked or starved, in which case the server works at station arg min{j : γ_j}.

Proof

We only consider the case with μ₁ ≤ μ₂ and γ₁ ≤ γ₂ since the proofs of the other cases are similar. Let π₁ (π₂) be the policy that has server 2 as the flexible one and server 1 dedicated to station 1 (station 2). Then it follows from the throughput expression g ₀ given in the proof of Theorem 4.1 of [Reference Andradóttir, Ayhan and Down6] that for all B ≥ 0,

$T_{\pi_{1}}\lpar B\rpar - T_{\pi_{2}}\lpar B\rpar = {\Gamma_{1} \over \Gamma_{2}} \ge 0,$

where

$\eqalign{ \Gamma_{1} &= \mu_{2}\mu_{1}^{B+2} \gamma _{1}\gamma _{2}\lpar \mu_{1}+\mu_{2}\rpar \lpar \gamma_{2}-\gamma_ {1}\rpar \sum\limits_{k=0}^{B+1} \mu_{2}^{B+1-k} \lpar \mu_{1}\gamma_ {1}\gamma _{2}\rpar ^{k} \cr & \quad \times \sum\limits_{j=0}^{2B+2-2k} \gamma_{1}^{\,\,j} \gamma_{2}^{\,B-j}}$

and

$\eqalignno{\Gamma_{2} &= \left[\mu_{2} \gamma_{2} \sum\limits_{j=0}^{B+1} \lpar \mu_{1}\gamma_ {1}\rpar ^{j} \lpar \mu_{2}\gamma_{2}\rpar ^{B+1-j} + \sum\limits_{j=0}^{B+2} \lpar \mu _{1}\gamma _{1}\rpar ^{\,j}\lpar \mu _{2}\gamma _{2}\rpar ^{B+2-j}\right] \cr &\quad \times \left[\mu _{2}\gamma_{2} \sum\limits_{j=0}^{B+1} \lpar \mu _{2}\gamma_ {1}\rpar ^{j}\lpar \mu_ {1}\gamma _{2}\rpar ^{B+1-j} + \sum\limits_{j=0}^{B+2} \lpar \mu _{2}\gamma _{1}\rpar ^{j}\lpar \mu _{1}\gamma _{2}\rpar ^{B+2-j}\right].}$

■

3.3. Systems with Two Flexible Servers

In this subsection we assume that Assumption E holds, B < ∞, M = 2, and l = 2. The optimal policy in this case is given in Theorem 4.1 of [Reference Andradóttir, Ayhan and Down6], but we repeat it here for the sake of completeness.

Proposition 3.6

(i) If μ₁₁μ₂₂ ≥ μ₂₁μ₁₂, then the policy that assigns server 1 to station 1 and server 2 to station 2 unless station 1 is blocked or station 2 is starved and assigns both servers to station 1 (station 2) when station 2 (station 1) is starved (blocked) is optimal. Moreover, this is the unique optimal policy in the set of Markovian stationary deterministic policies if the inequality is strict.
(ii) If μ₂₁μ₁₂ ≥ μ₁₁μ₂₂, then the policy that assigns server 2 to station 1 and server 1 to station 2 unless station 1 is blocked or station 2 is starved and assigns both servers to station 1 (station 2) when station 2 (station 1) is starved (blocked) is optimal. Moreover, this is the unique optimal policy in the set of Markovian stationary deterministic policies if the inequality is strict.

If the servers are generalists, then Theorem 2.1 implies that any nonidling policy is optimal, including both of the policies defined in Proposition 3.6.

4. SYSTEMS WITH THREE SERVERS

In this section we focus on the assignment of M = 3 servers to the two stations. In particular, Section 4.1 focuses on systems in which two servers are dedicated and one server is flexible. In Section 4.2 we study systems with two flexible servers and one dedicated server. Finally, Section 4.3 presents the optimal server assignment policy when all three servers are flexible. Note that the case when all three servers are dedicated is covered by Remark 3.1.

We will need the following preliminaries. For nonnegative scalars μ_d1, μ_d2, μ_m1, μ_m2, μ_u1, and μ_u2 and for all i ∈ {0, 1, …}, define

(9)

$\eqalignno{f\lpar i\rpar& = \mu_{d2}^{i-2} \lpar \mu_{m1}\mu_{d2} -\mu_{d1} \mu _{m2}\rpar \lpar \mu _{d2}+\mu _{m2}+\mu _{u2}\rpar &\cr &\quad\times \sum\limits_{j=0}^{B-i+2} \mu_{u1}^{j}\lpar \mu _{d2}+\mu_ {m2}\rpar ^{B-i-j+2} - \mu_{u1}^{B-i+2} \lpar \mu _{u1}\mu _{m2} - \mu_{m1} \mu _{u2}\rpar &\cr &\quad\times \lpar \mu _{d1}+\mu _{m1}+\mu _{u1}\rpar \sum\limits_{j=0}^{i-2} \mu _{d2}^{j}\lpar \mu _{m1}+\mu _{u1}\rpar ^{i-j-2},}$

with the convention that summation over an empty set equals zero. Note that for i = 1, … , B + 2, f(i) is proportional to the difference between the throughputs of two policies that have server m move to station 2 at state i and state i − 1, respectively. Let

$S^{\ast} = \{s \in S \setminus \! \{0\}: f\lpar s\rpar \ge 0 \,\hbox{and}\, f\lpar s+1\rpar \le 0\}.$

We know from Proposition 3.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5] that S* ≠.

4.1. Systems with Two Dedicated Servers and One Flexible Server

We first assume that M = 3 and l = 1. Without loss of generality, we consider the case in which the dedicated servers are assigned to different stations because otherwise this would be a special case of the model with one dedicated and one flexible server discussed in Section 3.2. The server who is dedicated to station 1 is called the “upstream” server and will be denoted by u ∈ {1, 2, 3}, the server who is dedicated to station 2 is called the “downstream” server and will be denoted by d ∈ {1, 2, 3}∖{u}, and the flexible server is called the “moving” server and will be denoted by m ∈ {1, 2, 3}∖{u, d}.

For B < ∞, fixed d, m, u, and all i ∈ {0, 1, …}, (9) reduces to

$\eqalignno{f\lpar i\rpar &= \mu _{m1}\mu _{d2}^{i-1} \lpar \mu _{d2} + \mu _{m2}\rpar \sum\limits_{j=0}^{B-i+2} \mu _{u1}^{j} \lpar \mu _{d2} + \mu _{m2}\rpar ^{B-i+2-j} &\cr &\quad - \mu _{m2}\mu _{u1}^{B-i+3} \lpar \mu _{m1}+\mu _{u1}\rpar \sum\limits_{j=0}^{i-2} \mu _{d2}^{j} \lpar \mu _{m1}+\mu _{u1}\rpar ^{i-j-2}}$

by setting μ_u2 = μ_d1 = 0. The following proposition, which follows from Theorem 3.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5], describes the optimal dynamic assignment of the flexible server m when servers d, m, and u are known and B < ∞.

Proposition 4.1

Suppose that Assumption Eholds, B < ∞, d, m, and u are fixed, and s* ∈ S*. Let

$\delta^{\ast} \lpar s \rpar= \left\{\matrix{ servers\, m\, and\, u\, work\, at \,station\, { 1,\,\,} server\, d\, works\, at\, station\, { 2}\quad\quad\quad\quad\quad\quad & \cr \hfill for\, 0\le\, s\le\, s^{\ast} - 1 \cr server\, u\, works\, at\, station\, { 1,\,} servers\, d\, and\, m\, work\, at\, station\, { 2}\quad\quad\quad\quad\quad\quad & \cr\hfill for\, s^{\ast} \le s \le B + 2.} \right.$

Then (δ*)^∞(the policy corresponding to the decision rule δ*) is optimal. Moreover, this is the unique optimal policy in the class of Markovian stationary deterministic policies if S* = {s*}.

With respect to determining which server should be the upstream one, which server should be the downstream one, and which server should be the flexible one, we know from Proposition 4.1 that we need to consider only six policies. The throughput expression g ₀ given in the proof of Theorem 3.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5] (with the rate of the upstream server at station 2 and the rate of the downstream server at station 1 set equal to zero) can now be used to compare the throughputs of the resulting policies and to determine which one is optimal.

Next assume that μ_ij = μ_iγ_j > 0 for all i ∈ {1, 2, 3} and j ∈ {1, 2}. We can assume that μ₁, μ₂, and μ₃ are all strictly positive because the problem reduces to having two servers if any of these rates are equal to zero. We will show that the optimal policy should have the fastest server as the flexible one. The following lemma, whose proof is given in the Appendix, shows that if the upstream server is known, among the remaining two servers the faster one should be the flexible server.

Lemma 4.1

Suppose that Assumption Eholds and B < ∞. If server 1 is dedicated to the upstream station and μ₃ ≥ μ₂, then in the class of policies with a dedicated server at each station, (δ*)^∞is optimal where

$\eqalignno{\delta^{\ast}\lpar s\rpar = \left\{\matrix{servers\, { 1}\, and\, { 3}\, work\, at\, station\, { 1},\, server\, { 2}\, works\, at\, station\, { 2}\quad\quad\quad\quad\quad\quad \cr \quad\hfill for \,0 \le s \le s^{\ast} - 1 \cr server\, { 1}\, works\, at\, station\, { 1}\!,\, servers\, { 2}\, and\, { 3}\, work\, at\, station\, { 2}\quad\quad\quad\quad\quad\quad \cr \hfill for\, s^{\ast} \le s \le B + 2,} \right.}$

and s* ∈ S* is defined as above with μ_u1 = μ₁γ₁, μ_m1 = μ₃γ₁, μ_m2 = μ₃γ₂, μ_d2 = μ₂γ₂, and μ_u2 = μ_d1 = 0.

The following proposition states that when μ_ij = μ_iγ_j for all i ∈ {1, 2, 3} and j ∈ {1, 2}, the fastest server should be the flexible one.

Proposition 4.2

Suppose that Assumption Eholds and B < ∞. If μ_ij = μ_iγ_jfor all i ∈ {1, 2, 3} and j ∈ {1, 2}, then the optimal policy should have the server arg max{i : μ_i} as the flexible server.

Proof

The proof follows from Lemma 4.1 above and Proposition 5.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5] on the reversibility of tandem queues with two stations and flexible servers. ■

Remark 4.1

Note that the result in Proposition 4.2 is correct even if there is more than one dedicated server at stations 1 and 2 and a team of multiple flexible servers who move together between the stations. Hence, if there are three teams of generalist servers and one of these teams should be dedicated to each of stations 1 and 2, then it is optimal to have the fastest team as the flexible one.

From Proposition 4.2, we know that m ∈ arg max{i : μ_i}, but the choice of the servers u and d is not specified and in fact this choice can depend on the buffer size B (see Section 6). However, Propositions 4.1 and 4.2 show that when the servers are all generalists, there are only two policies that might be optimal. Consequently, one can compute the throughput of both policies and determine which one yields the higher throughput. However, we now consider a special case in which we can specify the allocation of the dedicated servers.

Remark 4.2

Suppose that M = 3, l = 1, and γ₁ = γ₂. Then we can characterize the optimal policy completely because in this case as long as the fastest server is the flexible one, the throughput does not depend on the allocation of the remaining two servers. In order to see this, without loss of generality, assume that γ₁ = γ₂ = 1, μ₃ ≥ μ₁, μ₃ ≥ μ₂, and B < ∞. Let T _π₁(B) be the optimal throughput under a policy π₁ that has server 1 as the upstream server, server 2 as the downstream server, and server 3 as the flexible server. Similarly, let T _π₂(B) be the optimal throughput under a policy π₂ that has server 2 as the upstream server, server 1 as the downstream server, and server 3 as the flexible server. It then follows from Proposition 5.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5] that T _π₁(B) = T _π₂(B) for all 0 ≤ B < ∞.

4.2. Systems with One Dedicated and Two Flexible Servers

In this subsection we assume that M = 3 and l = 2. First, assume that the server at station 1 is dedicated and as before denote this server by u. Define

$d \in \hbox{arg} \mathop{\min}\limits_{i\in\{1,\ 2,\ 3\} \setminus \{u\}} \left\{{\mu_{i1} \over \mu_{i2}}\right\}$

and m ∈ {1, 2, 3}∖{d, u}. For all i ∈ {0, 1, …}, (9) now reduces to

(10)

$\eqalignno{f\lpar i\rpar& = \mu _{d2}^{i-2} \lpar \mu _{m1} \mu _{d2} - \mu_{d1}\mu _{m2}\rpar \lpar \mu _{d2} + \mu _{m2}\rpar \sum\limits_{j = 0}^{B-i+2} \mu_{u1}^{\,j}\lpar \mu _{d2} + \mu _{m2}\rpar ^{B-i-j+2} &\cr &\quad - \mu_{u1}^{B-i+3} \mu _{m2}\lpar \mu _{d1}+\mu _{m1}+\mu _{u1}\rpar \sum\limits_{j=0}^{i-2} \mu\, _{d2}^{j} \lpar \mu _{m1}+\mu _{u1}\rpar ^{i-j-2}}$

by setting μ_u2 = 0. The following proposition, which describes the optimal dynamic assignment of servers d and m when server u is dedicated to station 1 and B < ∞, follows from Theorem 3.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5].

Proposition 4.3

Suppose that Assumption Eholds, B < ∞, server u is dedicated to station 1, servers d and m are defined as above, and s* ∈ S*. Let

$\delta^{\ast} \lpar s\rpar = \left\{\matrix{servers\, d{ \comma} \ m{ \comma} \, and\, u \,work\, at\, station\, { 1}\hfill & \cr \quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\qquad\qquad\qquad\qquad\qquad\quad\,\, \hfill for\ s = 0 \hfill\cr servers\, m\, and\, u\, work\, at\, station\, { 1\!,}\ server\, d\, works\, at\, station\, { 2}\hfill &\cr \quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\qquad\qquad\qquad\qquad\qquad\quad\,\ \hfill for\ 1 \le s \le s^{\ast} - 1 \hfill \cr server\, u\, works\, at\, station\, { 1\!,}\, servers\, d\, and\, m\, work\, at\, station\, { 2}\hfill & \cr \qquad\qquad\qquad\qquad\quad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\,\,\ \hfill for \, s^{\ast} \le s \le B + 2.\hfill }\right.\hfill$

Then (δ*)^∞is optimal. Moreover, this is the unique optimal policy in the class of Markovian stationary deterministic policies if S* = {s*}.

Next, assume that the server at station 2 is dedicated and denote this server by d. Define

$m \in \arg \min\limits_{i \in \{1,\ 2,\ 3\} \setminus \{d\}} \left\{\mu_{i1} \over \mu_{i\,2}\right\}$

and u ∈ {1, 2, 3}∖{m, d}. For all i ∈ {0, 1, …}, (9) now reduces to

(11)

$\eqalignno{f\lpar i\rpar & = \mu_{d2}^{i-1} \mu_{m1} \lpar \mu_{d2} + \mu_{m2} + \mu_{u2}\rpar \sum\limits_{j=0}^{B-i+2} \mu\, _{u1}^j \lpar \mu_{d2} + \mu_{m2}\rpar ^{B-i-j + 2} - \mu_{u1}^{B-i+2}\cr & \quad \times\lpar \mu_{u1} \mu_{m2} - \mu_{m1} \mu_{u2}\rpar \lpar \mu_{m1} + \mu_{u1}\rpar \sum\limits_{j=0}^{i-2} \mu\, _{d2}^j \lpar \mu_{m1} + \mu_{u1}\rpar ^{i-j-2}}$

by setting μ_d1 = 0. The following proposition, which describes the optimal dynamic assignment of servers u and m when server d is dedicated to station 2 and B < ∞, follows from Theorem 3.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5].

Proposition 4.4

Suppose that Assumption Eholds, B < ∞, server d is dedicated to station 2, servers u and m are defined as above, and s* ∈ S*. Let

$\delta^{\ast}\lpar s\rpar = \left\{\matrix{servers \,m\, and\, u\, work\, at\, station\, { 1\!\comma}\, server\, d\, works\, at\, station\, { 2}\hfill &\cr \qquad\qquad\qquad\qquad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\,\,\ \hfill for \ 0 \le s \le s^{\ast} -1 \hfill \cr server\, u\, works\, at\, station\, { 1\!\comma}\, servers\, m\, and\, d\, work\, at\, station\, { 2}\hfill & \cr \qquad\qquad\qquad\qquad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\,\,\, \hfill for\ s^{\ast} \le s \le B+1 \hfill \cr servers\, d{ \comma}\, m{ \comma}\, and\, u\, work\, at\, station\, { 2}\hfill & \cr \qquad\qquad\qquad\qquad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\,\,\, \hfill for\ s = B +2\hfill }\right.\hfill$

Then (δ*)^∞ is optimal. Moreover, this is the unique optimal policy in the class of Markovian stationary deterministic policies if S* = {s*}.

The next proposition provides a complete characterization of the optimal policy (which does not depend on B) when μ_ij = μ_iγ_j for all i ∈ {1, 2, 3} and j ∈ {1, 2}. The intuition behind the optimal policy described in Proposition 4.5 is to keep the faster servers busy at all times (which is achieved by having the faster servers as the flexible ones) and to keep the slower server as busy as possible (which is achieved by assigning the slower server to the slower station).

Proposition 4.5

Suppose that Assumption Eholds and B < ∞. If μ_ij =μ_iγ_jfor all i ∈ {1, 2, 3} and j ∈ {1, 2} and l = 2, then the optimal policy with one dedicated server is nonidling to the extent possible, has server arg min{i : μ_i} dedicated to station arg min{j : γ_j}, and the flexible servers {1, 2, 3}∖{arg min{i : μ_i}} work at station arg max{j : γ_j} unless station arg max{j : γ_j} is blocked or starved, in which case the servers work at station arg min{j : γ_j}.

Proof

We only consider the case with μ₁ ≤ min{μ₂, μ₃} and γ₁ ≤ γ₂ since the proofs of other cases are similar. Since μ₁ ≤ min{μ₂, μ₃}, we know from Proposition 2.2 that servers 2 and 3 are the flexible ones. Hence, it suffices to show that server 1 is dedicated to station 1 and servers 2 and 3 work at station 2 unless station 2 is starved. It follows from (10) that if server 1 is dedicated to station 1, then S* {1}. Similarly, we know from (11) that if server 1 is dedicated to station 2, then S* = {B + 2}. Let π₁ (π₂) be the policy in which server 1 is dedicated to station 1 (station 2) and servers 2 and 3 work at station 2 (station 1) unless station 2 (station 1) is starved (blocked), in which case servers 2 and 3 work at station 1 (station 2). From Propositions 4.2 and 4.3, it suffices to show that T _π₁(B) ≥ T _π₂(B) for all B ≥ 0. With some algebra we have

(12)

$\eqalignno{T_{\pi_1} \lpar B\rpar - T_{\pi_2} \lpar B\rpar &= \lpar \mu_2 + \mu_3\rpar \gamma_2 \sum\limits_{s=1}^{B+2} p_{\pi_1} \lpar s\rpar \cr &\quad - \left( \mu_1 \gamma_2 \sum\limits_{s=1}^{B+1} p_{\pi_2} \lpar s\rpar + \lpar \mu_1 + \mu_2 + \mu_3\rpar \gamma_2 p_{\pi_2} \lpar B + 2\rpar \right) \cr & = {\Upsilon_1 \over \Upsilon_2} \ge 0}$

for all B ≥ 0, where

$\eqalignno{\Upsilon_{1} & = \lpar \mu _{1} - \mu _{2} + \mu_{3}\rpar \lpar \mu _{2}+\mu _{3}\rpar \gamma _{1} \gamma _{2}\mu_ {1}^{B+2} \lpar \gamma_{2} - \gamma_{1}\rpar\cr &\quad \times \sum\limits_{k=0}^{B+1} \gamma\, _{1}^{k} \gamma\, _{2}^k\mu_{1}^{k} \lpar \mu_{2} + \mu_{3}\rpar ^{B+1-k} \sum\limits_{j=0}^{2B+2-2k} \gamma\, _{1}^{\,j} \gamma_{2}^{2B+2-2k-j}}$

and

$\eqalignno{\Upsilon_{2} &= \left[\lpar \lpar \mu _{2} + \mu _{3}\rpar \gamma _{2}\rpar ^{B+2} + \lpar \mu _{1} + \mu _{2} + \mu _{3}\rpar \gamma _{1} \sum\limits_ {j=0}^{B+1} \lpar \mu _{1}\gamma _{1}\rpar\, ^{j}\lpar \lpar \mu _{2}+\mu _{3}\rpar \gamma _{2}\rpar ^{B+1-j} \right] \cr &\quad \! \times \left[\lpar \lpar \mu _{2}+\mu _{3}\rpar \gamma _{1}\rpar ^{B+2} + \lpar \mu _{1} + \mu _{2} + \mu _{3}\rpar \gamma _{2} \sum\limits_{j=0}^{B+1} \lpar \mu _{1}\gamma _{2}\rpar \, ^{j}\lpar \lpar \mu _{2}+\mu _{3}\rpar \gamma _{1}\rpar ^{B+1-j}\right].}$

Note that the equality in (12) holds only when γ₁ = γ₂, in which case the optimal throughput does not depend on the allocation of the dedicated server (see also Remark 4.2).■

4.3. Systems with Three Flexible Servers

In this subsection we assume that M = 3 and l = 3. The optimal policy is given in Theorem 3.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5], but we repeat it here for the sake of completeness. Let

$\eqalignno{& d \in \hbox{arg} \mathop{\min}\limits_{i\in \{1,\ 2,\ 3\}} \left\{{\mu _{i1} \over \mu _{i2}}\right\}, &\cr & m \in \hbox{arg} \mathop{\min}\limits_{i\in \{1,\ 2,\ 3\}\setminus \{d\}} \left\{{\mu _{i1} \over \mu _{i2}}\right\},}$

and u ∈ {1, 2, 3}∖{d, m}. Consider the following policy

$\delta^{i}\lpar s\rpar = \left\{\matrix{{\rm servers}\,d, \ m,\ {\rm and} \ u\ {\rm work \ at \ station \ 1} \hfill&\cr \qquad\qquad\qquad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\,\,\ \hfill {\rm for} \ \hbox{s} = 0 \hfill \cr {\rm servers} \ {m}\ {\rm and} \ u\ {\rm work\ at\ station\ 1}, {\rm server} \, d\, {\rm works \ at\ station \, 2} \hfill & \cr \quad\quad\qquad\qquad\qquad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad \,\, \hfill {\rm for} \, 1 \le \hbox{s} \le \hbox{i} - 1 \hfill \cr {\rm server} \ u\ {\rm works\ at\ station \ } 1, {\rm servers}\ d\ {\rm and} \ m\ {\rm work \ at\ station \ 2} \hfill & \cr \quad\quad\qquad\qquad\qquad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\,\, \hfill {\rm for}\, i \le {s} \le {B} + 1 \hfill \cr {\rm servers} \ d, \ m,\ {\rm and} \ u\ {\rm work \ at\ station \ 2} \hfill & \cr \qquad\qquad\qquad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\,\,\, \hfill {\rm for} \, s = B + 2, \hfill }\right.$

where i ∈ {1, … , B + 2}. The following proposition characterizes the optimal server assignment policy.

Proposition 4.6

Suppose that Assumption Eholds and B < ∞. Define s* ∈ S*. Then (δ^s*)^∞is optimal. Moreover, this is the unique optimal policy in the class of Markovian stationary deterministic policies if S* = {s*}.

When the servers are generalists, then any nonidling policy is optimal by Theorem 2.1, including all of the threshold policies (δⁱ)^∞, where i = 1, … , B + 2.

5. SYSTEMS WITH LARGE BUFFERS

In this section we show that the throughput of the optimal policies for the finite-buffered systems considered in Sections 3 and 4 converges to the throughput of the optimal policy for the corresponding infinite-buffered systems as the (finite) buffer size B becomes large. Consider a system with M ≥ 1 flexible servers and N = 2 stations. For i = 1, … , M, define

$\rho_{i} = {\mu _{i1} \over \mu _{i2}},$

with the convention that a positive real number divided by zero is equal to ∞ (recall that we have assumed throughout that ∑_j=1^N μ_ij > 0 for i = 1, … , M). Relabel the servers so that ρ₁ ≤ ρ₂ ≤ ··· ≤ ρ_M. Let T*(∞) be the throughput of the optimal policy when B = ∞. We know from Andradóttir et al. [Reference Andradóttir, Ayhan and Down7] that T*(∞) = λ*, where λ* is the optimal objective function value of the following linear program (P):

$\matrix{\hbox{Maximize} &\lambda\hfill\cr \hbox{s.t.} &\cr &\displaystyle\sum\limits_{i=1}^{M} \alpha_{i1} \mu_{i1} \ge \lambda,\hfill\cr &\displaystyle\sum\limits_{i=1}^{M} \alpha_{i2} \mu_{i2} \ge \lambda,\hfill\cr &\alpha_{i1} + \alpha_{i2} \le 1 \,\hbox{for all}\, i=1, \ldots, M,\hfill\cr &\alpha_{i1} \ge 0, \alpha_{i2} \ge 0, \hbox{for all}\, i=1, \ldots, M.\cr}$

The parameters α_ij, i = 1, … , M, j = 1, 2, can be interpreted as the long-run fraction of time that server i spends at station j. We have the following:

Proposition 5.1

Let p* = min{i ∈ {1, … , M} : ∑_k=1ⁱμ_k2 ≥ ∑_k=i+1^Mμ_k1}. Then one optimal solution to (P) is given as

$\alpha_{i1}^{\ast} = \left\{\matrix{0\hfill & for \, i = 1, \ldots, p^{\ast} - 1\cr {\displaystyle\sum\limits_{i=1}^{p^{\ast}} \mu_{i2} - \displaystyle\sum\limits_{i=p{\ast}+1}^{M} \mu_{i1} \over \mu_{p^{\ast}1} + \mu_{p^{\ast}2}} & for\,i = p^{\ast}\hfill\cr 1\hfill & for\,i = p^{\ast} + 1, \ldots, M,\cr } \right.$

$\eqalign {\alpha^*_{i\,2} = 1 -\alpha^*_{i\,1} \ for\ i=1, \ldots , M{ ,} \ and \cr \lambda^{\ast} = {\mu_{p^{\ast}1} \displaystyle\sum\limits_{i=1}^{p^{\ast}} \mu_{i2} + \mu_{p^{\ast}2} \displaystyle\sum\limits_{i=p^{\ast}+1}^{M} \mu_{i1} \over \mu_{p^{\ast}1} + \mu_{p^{\ast}2}}.}$

Proof

For j = 1, 2, let P _j be the set of servers with μ_ij = 0 and let P = {1, … , M}∖{(P ₁ ∪ P ₂) be the set of servers with μ_i1 and μ_i2 positive. Without loss of generality, we can let α_ij = 0 for j = 1, 2 and all i ∈ P _j. For all i ∈ P and j = 1, 2, let β_ij = α_ijμ_i2. Then (P) is equivalent to

$\matrix{\hbox{Maximize} &\lambda\hfill\cr \hbox{s.t.} &\cr &\displaystyle\sum\limits_{i\in P} \beta_{i1}\rho_{i} \ge \lambda - \displaystyle\sum\limits_{i\in P_{2}} \mu_{i1},\hfill\cr &\displaystyle\sum\limits_{i\in P} \beta_{i2} \ge \lambda - \sum\limits_{i \in P_{1}} \mu_{i2},\hfill\cr &\beta_{i1} + \beta_{i2} \le \mu_{i2}\, \hbox{for all}\, i \in P,\hfill \cr &\beta_{i1} \ge 0, \beta_{i2} \ge 0,\, \hbox{for all}\, i \in P.\hfill \cr}$

It is now clear that we can restrict our attention to solutions β_ij, i ∈ P and j = 1, 2, and satisfying β_i1 > 0 implies β_k1 = μ_k2 for all k > i and β_i2 = μ_i2 − β_i1 (because ρ₁ ≤ ρ₂ ≤ ··· ≤ ρ_M). This implies that there exists p ∈ {1, … , M} and a solution to (P) with α_i1 = 0 for i < p, α_i1 = 1 for i > p, and α_i2 = 1−α_i1 for all i ∈ P. It only remains to show that p = p* and that α_p1 = α^*_p*1. Note that α_p*1 ∈ [0, 1) by the definition of p* and that

(13)

$\sum\limits_{i=1}^{M} \alpha_{i1}^{\ast} \mu_{i1} = {\mu_{p^{\ast}1} \displaystyle\sum\limits_{i=1}^{p^{\ast}} \mu_{i2} - \mu_{p^{\ast}1} \sum\limits_{i=p^{\ast}+1}^{M} \mu_{i1} \over \mu_{p^{\ast}1} + \mu_{p^{\ast}2}} + \sum\limits_{i=p^{\ast}+1}^{M} \mu_{i1} = \lambda^{\ast}$

and

(14)

$\sum\limits_{i=1}^{M} \alpha_{i2}^{\ast} \mu_{i2} = \mu_{p^{\ast}2} - \displaystyle{{\mu_{p^{\ast}2} \displaystyle\sum\limits_{i=1}^{p^{\ast}} \mu_{i2} - \mu_{p^{\ast}2} \sum\limits_{i=p^{\ast}+1}^{M} \mu_{i1} \over \mu_{p^{\ast}1}+ \mu_{p^{\ast}2}} } + \sum\limits_{i=1}^{p^{\ast}-1} \mu_{i2} = \lambda^{\ast}.$

If p < p*, then we must have

$\lambda \le \sum\limits_{i=1}^{p-1} \mu_{i2} + \lpar 1 - \alpha_{p1}\rpar \mu_{p2} = \sum\limits_{i=1}^{p} \mu_{i2} - \alpha_{p1} \mu_{p2} \le \sum\limits_{i=1}^{p^{\ast}-1} \mu_{i2} \le \lambda^{\ast}.$

Similarly, if p > p*, then

$\lambda \le \alpha_{p1}\mu_{p1} + \sum\limits_{i=p+1}^{M} \mu_{i1} \le \sum\limits_{i=p^{\ast}+1}^{M} \mu_{i1} \le \lambda^{\ast}.$

This shows that p = p*. Finally, the optimality of the choice α_p1 = α^*_p*1 now follows from (13) and (14).■

Proposition 5.1 shows that the servers are ordered in the same manner for the infinite-buffered system as for the finite-buffered system (i.e., according to the magnitude of ρ₁, … , ρ_M) and illustrates that when B = ∞, even though all M servers are flexible, there is an optimal policy with only one server working at both stations (in other words, optimality can be achieved with only one flexible server; see also Proposition 2 of Andradóttir et al. [Reference Andradóttir, Ayhan and Down7]). Note that this is different from the finite-buffered case, where all servers work at both stations in the optimal policy.

Corollaries 5.1 and 5.2 provide an optimal solution to (P) for systems with M = 2 and M = 3 flexible servers, respectively.

Corollary 5.1

For a two-station tandem queue with M = 2 flexible servers, relabel the servers so that ρ₁ ≤ ρ₂. Then an optimal solution to (P) is given as follows:

(i) If μ₁₂ ≥ μ₂₁, then
$\alpha_{11}^{\ast}= {\mu _{12} - \mu _{\,21} \over \mu _{11} + \mu_{12}}{ \comma}$
$\alpha_{21}^{\ast} = 1{ \comma}$
$\alpha_{12}^{\ast} = {\mu _{11} + \mu _{\,21} \over \mu _{11} + \mu _{12}}{ \comma}$
$\alpha_{22}^{\ast} = 0,$
$\qquad \,\,\lambda^{\ast} = {\mu _{12}\lpar \mu _{11} + \mu _{21}\rpar \over \mu _{11} + \mu _{12}}.$
(ii) If μ₁₂ > μ₂₁, then
$\eqalign {\alpha ^*_{11} &= 0{ \comma} \cr \alpha_{21}^{\ast} &= {\mu _{12}+\mu _{\,22} \over \mu _{21}+\mu _{\,22}}{ \comma}}$
$\alpha_{12}^{\ast} = 1{ \comma}$
$\alpha_{22}^{\ast} = {\mu _{\,21} - \mu_{12} \over \mu _{\,21}+\mu _{22}}{ \comma}$
$\qquad \ \,\lambda^{\ast} = {\mu _{21}\lpar \mu _{12}+\mu _{\,22}\rpar \over \mu _{21} + \mu _{\,22}}.$

Corollary 5.2

For a two-station tandem queue with M = 3 flexible servers, define d, m, and u as in Section 4.3. Then an optimal solution to (P) is given as:

(i) If μ_d2 ≥ μ_m1 + μ_u1, then α^*_u2 = α^*_m2 = 0, α^*_u1 = α^*_m1 = 1,
$\alpha_{d1}^{\ast} = {\mu _{d2} - \mu_{m1} - \mu_{u1} \over \mu _{d1}+\mu _{d2}}{ \comma}$
$\alpha_{d2}^{\ast} = {\mu _{d1}+\mu _{m1} + \mu _{u1} \over \mu _{d1}+\mu _{d2}}{ \comma}$
$\lambda^{\ast} = {\mu _{d2}\lpar \mu _{d1}+\mu _{m1}+\mu _{u1}\rpar \over \mu _{d1} + \mu _{d2}}.$
(ii) If μ_d2 < μ_m1 + μ_u1and μ_d2 + μ_m2 ≥ μ_u1, then α_u2^* = α_d1^* = 0, α_u1^* = α_d2^* = 1,
$\alpha_{m1}^{\ast} = {\mu _{d2} + \mu _{m2} - \mu_{u1} \over \mu _{m1} + \mu _{m2}}{ \comma}$
$\alpha_{m2}^{\ast} = {\mu _{m1} + \mu _{u1} - \mu_{d2} \over \mu _{m1} + \mu _{m2}}{ \comma}$
$\lambda^{\ast} = {\mu _{u1} \mu _{m2}+\mu _{m1}\mu _{d2}+\mu _{m1}\mu _{m2} \over \mu _{m1}+\mu _{m2}}.$
(iii) If μ_d2 + μ_m2 < μ_u1, then α_m1^* = α_d1^* = 0, α_m2^* = α_d2^* = 1,
$\alpha_{u1}^{\ast} = {\mu _{d2}+\mu _{m2}+\mu _{u2} \over \mu _{u1}+\mu _{u2}}{ \comma}$
$\alpha_{u2}^{\ast} = {\mu _{u1} - \mu_{m2} - \mu_{d2} \over \mu _{u1}+\mu _{u2}}{ \comma}$
$\lambda^{\ast} = {\mu _{u1}\lpar \mu _{u2}+\mu _{m2}+\mu _{d2}\rpar \over \mu _{u1}+\mu _{u2}}.$

We are now ready to show that the throughput of the optimal policy for the finite-buffered systems considered in Sections 3 and 4 converges to the optimal throughput given in Corollaries 5.1 and 5.2 as the buffer size gets large. In order to prove this, it is sufficient to focus on the system with three flexible servers discussed in Section 4.3 because the other systems can be obtained from this one by setting appropriate service rates equal to zero. Let T _s(B) be the throughput of the policy (δ ^s)^∞ (see Section 4.3). Lemma 5.1 shows that T _s(B) approaches to T*(∞) as B and s get large.

Lemma 5.1

Suppose that Assumption Eholds and that servers d, m, and u are chosen as in Section 4.3. Then

$\mathop{\lim}\limits_{s\to \infty} \mathop{\lim}\limits_{B\to \infty} T_{s} \lpar B\rpar = T^{\ast} \lpar \infty\rpar .$

Proof

It is shown in Eq. (13) of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5] that

$T_{s}\lpar B\rpar = \left\{\matrix{\displaystyle{\Theta_{1} \over \Theta_{2}} &\hbox{if}\; \mu _{d2} \neq \mu _{u1}+\mu _{m1}\, \hbox{and}\, \mu _{u1} \neq \mu _{m2}+\mu _{d2}\cr \displaystyle{\Theta_{3} \over \Theta_{4}} &\hbox{if}\; \mu _{d2}=\mu _{u1}+\mu _{m1}\, \hbox{and}\, \mu _{u1} \neq \mu _{m2}+\mu _{d2}\hfill\cr \displaystyle{\Theta_{5} \over \Theta_{6}} &\hbox{if}\; \mu _{d2} \neq \mu _{u1}+\mu _{m1}\, \hbox{and}\, \mu _{u1}=\mu _{m2}+\mu _{d2},\cr} \right.$

where

$\eqalignno{\Theta _{1} &= \lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \left({\lpar \mu_{d2}^{s} - \lpar \mu _{u1}+\mu _{m1}\rpar ^{s}\rpar \over \mu _{d2}^{s - 1} \lpar \mu _{d2} - \lpar \mu _{u1}+\mu _{m1}\rpar \rpar } \right. &\cr &\quad \left. + {\lpar \mu _{u1}+\mu _{m1}\rpar ^{s - 1} \mu _{u1}\lpar \lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s} - \mu _{u1}^{B+2 - s}\rpar \over \lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s} \mu _{d2}^{s - 1} \lpar \lpar \mu _{m2}+\mu _{d2}\rpar - \mu_{u}1\rpar }\right) ,}$

$\eqalignno{\Theta_{2} &= 1 + {\lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \lpar \mu_{d2}^{s - 1} - \lpar \mu _{u1}+\mu _{m1}\rpar ^{s - 1}\rpar \over \mu _{d2}^{s - 1} \lpar \mu _{d2} - \lpar \mu _{u1}+\mu _{m1}\rpar \rpar } &\cr &\quad + {\lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \lpar \mu _{u1}+\mu _{m1}\rpar ^{s - 1} \over \lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s} \mu _{d2}^{s - 1}} &\cr &\quad \times \Bigg( {\lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s} - \mu _{u1}^{B+2 - s} \over \lpar \mu _{m2}+\mu _{d2}\rpar - \mu_{u1}} + {\mu_{u1}^{B+2 - s} \over \mu _{u2}+\mu _{m2}+\mu _{d2}}\Bigg) ,}$

$\Theta_3 = \lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \Bigg( s + {\mu _{u1}\lpar \lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s} - \mu_{u1}^{B+2 - s}\rpar \over \lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s} \lpar \lpar \mu _{m2}+\mu _{d2}\rpar -\mu_{u}1\rpar }\Bigg),$

$\fleqalignno{\Theta_4 &= 1 + {\lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \lpar s - 1\rpar \over \mu _{d2}} + {\lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \over \lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s}} &\cr &\quad\!\! \times\Bigg( {\lpar \mu _{m2}+\mu _{d2}\rpar ^{B+2 - s} - \mu _{u1}^{B+2 - s} \over \lpar \mu _{m2}+\mu _{d2}\rpar - \mu_{u1}} + {\mu_{u1}^{B+2 - s} \over \mu _{u2} + \mu _{m2}+\mu _{d2}}\Bigg) ,}$

$\eqalignno{&\Theta_5 = \lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \Bigg( {\lpar \mu _{d2}^{s} - \lpar \mu _{u1}+\mu _{m1}\rpar ^{s}\rpar \over \mu_ {d2}^{s - 1} \lpar \mu _{d2} - \lpar \mu _{u1} + \mu _{m1}\rpar \rpar } &\cr &\qquad + {\lpar \mu _{u1} + \mu _{m1}\rpar ^{s - 1} \lpar B+2-s\rpar \over \mu _{d2}^{s - 1}}\Bigg) ,}$

and

$\eqalignno{&\Theta_{6} = 1 + {\lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \lpar \mu _{d2}^{s - 1} - \lpar \mu _{u1}+\mu _{m1}\rpar ^{s - 1}\rpar \over \mu _{d2}^{s - 1} \lpar \mu _{d2} - \lpar \mu _{u1}+\mu _{m1}\rpar \rpar } &\cr &\qquad + {\lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \lpar \mu _{u1}+\mu _{m1}\rpar ^{s - 1} \over \lpar \mu _{m2}+\mu _{d2}\rpar \mu _{d2}^{s - 1}} \Bigg( B + 2 - s + {\lpar \mu _{m2}+\mu _{d2}\rpar \over \mu _{u2}+\mu _{m2}+\mu _{d2}}\Bigg) .}$

Note that the case with μ_d2 = μ_m1 + μ_u1 and μ_u1 = μ_d2 + μ_m2 is not possible because this implies that μ_m1 = μ_m2 = 0.

We will consider the three cases listed in Corollary 5.2. First, assume that μ_d2 ≥ μ_u1 + μ_m1. Consider the case when μ_d2 > μ_u1 + μ_m1 (if μ_d2 = μ_u1 + μ_m1, then one can carry out a similar analysis using the expression T _s(B) = Θ₃/Θ₄). We have T _s(B) = Θ₁/Θ₂ and

$\eqalignno{\mathop{\lim}\limits_{s\to \infty} \mathop{\lim}\limits_{B\to \infty} T_{s}\lpar B\rpar &= \mathop{\lim}\limits_{s\to \infty} \Bigg( \lpar \mu _{u1} + \mu _{m1} + \mu _{d1}\rpar \ \Bigg({\mu _{d2}\left( 1 - \left( {\mu _{u1} + \mu _{m1} \over \mu _{d2}}\right) ^{s}\right) \over \mu _{d2} - \mu_{u1} - \mu_{m1}} &\cr &\qquad + {\mu _{u1}\lpar \mu _{u1} + \mu _{m1}\rpar ^{s - 1} \over \mu _{d2}^{s - 1} \lpar \mu _{m2} + \mu _{d2} - \mu_{u1}\rpar }\Bigg) \Bigg[ 1 + \lpar \mu _{u1} + \mu _{m1} + \mu _{d1}\rpar &\cr &\qquad \times\Bigg( {1 - \left( {\mu _{u1} + \mu _{m1} \over \mu _{d2}}\right) ^{s - 1} \over \mu _{d2} - \mu_{u1} - \mu_{m1}} + \left( {\mu _{u1} + \mu _{m1} \over \mu _{d2}}\right) ^{s - 1} {1 \over \mu _{m2} + \mu _{d2} - \mu_{u1}}\Bigg) \Bigg]^{-1}\Bigg) &\cr & =\,{\mu _{d2}\lpar \mu _{u1} + \mu _{m1} + \mu _{d1}\rpar \over \mu _{d1} + \mu _{d2}} \cr & =\, T^{\ast}\lpar \infty\rpar. }$

Next, assume that μ_d2 < μ_m1 + μ_u1 and μ_d2 +μ_m2 ≥ μ_u1. Consider the case when μ_d2 + μ_m2 > μ_u1 (if μ_u1 = μ_d2 + μ_m2, then one can carry out a similar analysis using the expression T _s(B) = Θ₅/Θ₆). We have T _s(B) = Θ₁/Θ₂ and

$\eqalignno{\mathop{\lim}\limits_{s\to \infty} \mathop{\lim}\limits_{B\to \infty} T_{s}\lpar B\rpar & = \mathop{\lim}\limits_{s\to \infty} \Bigg( \lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar \Bigg( {\mu _{d2} \left( {\mu _{d2}\over \mu _{u1}+\mu _{m1} } \right) ^{s - 1} - \lpar \mu _{u1}+\mu _{m1}\rpar \over \mu _{d2} - \mu_{u1} - \mu_{m1}} &\cr &\quad + {\mu _{u1} \over \mu _{m2}+\mu _{d2} - \mu_{u1}}\Bigg) \Bigg[ \bigg( {\mu _{d2} \over \mu _{u1}+\mu _{m1}} \bigg) ^{s - 1} + \lpar \mu _{u1}+\mu _{m1}+\mu _{d1}\rpar &\cr &\quad \times\Bigg( {\left( {\mu _{d2} \over \mu _{u1}+\mu _{m1}}\right) ^{s - 1} - 1 \over \mu _{d2} - \mu_{u1} - \mu_{m1}} + {1 \over \mu _{m2}+\mu _{d2} - \mu_{u1}}\Bigg) \Bigg]^{-1}\Bigg) &\cr &= {\mu _{u1}\mu _{m2}+\mu _{m1}\mu _{d2}+\mu _{m1}\mu _{m2} \over \mu _{m1}+\mu _{m2}} \cr &= T^{\ast}\lpar \infty\rpar. }$

Finally, assume that μ_d2 + μ_m2 < μ_u1. Then T _s(B) = Θ₁/Θ₂ and

$\eqalignno{\lim\limits_{s \rightarrow \infty} \lim\limits_{B \rightarrow \infty} T_s \lpar B\rpar &= \lim\limits_{s \rightarrow \infty} \left[{-\mu_{u1} \over \mu_{m2} + \mu_{d2} - \mu_{u1}} \left( {-1 \over \mu_{m2} + \mu_{d2} - \mu_{u1}} \right. \right. \cr &\quad \left. \left. + {1 \over \mu_{u2} + \mu_{m2} + \mu_{d2}} \right)^{-1} \right] &\cr &= {\mu_{u1} \lpar \mu_{u2} + \mu_{m2} + \mu_{d2}\rpar \over \mu_{u1} + \mu_{u2}} = T^{\ast} \lpar \infty\rpar }$

(note that s becomes irrelevant in this case as B → ∞ because the system is not stable).■

Let T* (B) = T _s*(B). We can now prove the following result.

Proposition 5.2

Suppose that Assumption Eholds and d, m, and u are chosen as in Section 4.3. Then

$\lim\limits_{B \rightarrow \infty} T^{\ast} \lpar B\rpar = T^{\ast} \lpar \infty\rpar .$

Proof

Let π_B^* be an optimal policy when the buffer size is given by B ≥ 0. Now consider a system with buffer size B′, where B ≤ B′ ≤ ∞, and let π_B′ be the policy that chooses the same actions as π_B^* in states 0, 1, … , B + 2 and assigns all the servers to station 2 in states B + 3, B + 4, … , B′ + 2. Then

$T^{\ast} \lpar B\rpar = T_{\pi_{B}^{\ast}} \lpar B\rpar = T_{\pi_{B^{\prime}}} \lpar B^{\prime}\rpar \le T^{\ast} \lpar B^{\prime}\rpar \quad \hbox{for all}\,B^{\prime} \ge B,$

where the second equality follows since states B + 3, … , B′ + 2 are transient under policy π_B′. Consequently, we have that T ^*(B) ≤ T ^*(B + 1) and T ^*(B) ≤ T ^*(∞) for all B ≥ 0. This implies that lim_B→∞T ^*(B) exists and is bounded above by T ^*(∞). The result now follows from the fact that

$T^{\ast} \lpar \infty\rpar = \lim\limits_{s \rightarrow \infty}\,\lim\limits_{B \rightarrow \infty} T_s \lpar B\rpar \le \lim\limits_{s \rightarrow \infty}\,\lim_{B \rightarrow \infty} T^{\ast} \lpar B\rpar = \lim_{B \rightarrow \infty} T^{\ast} \lpar B\rpar ,$

where the first equality follows from Lemma 5.1.■

Note that the result in Proposition 5.2 also holds for systems with M = 2 servers (let μ_m1 = μ_m2 = 0 and select the switch point s arbitrarily in {1, … , B + 2}). Thus, Proposition 5.2 demonstrates that the throughput of the optimal policy for the finite-buffered system with M = 2 or M = 3 servers converges to the throughput of the optimal policy for the corresponding infinite-buffered system as the buffer size becomes large. Moreover, Proposition 5.2 and Corollaries 5.1 and 5.2 show that when the buffer size is large, the throughput of the best policy with a single moving server is close to the throughput of the optimal policy where all servers are flexible for systems with M = 2 and 3 servers. (When M = 1, the throughput of the optimal policy is the same for all buffer sizes B ≥ 0; see Theorem 2.1). Our numerical examples in Section 7 indicate that this assertion also holds for systems with large buffers and M > 3 servers who are generalists. One can now achieve near-optimal throughput with just one flexible server by selecting the dedicated servers and the stations to which they are assigned as in Proposition 5.1 and assigning the flexible server appropriately to stations for systems with large (but finite) buffers. However, when the buffer size is small, there could be a significant difference between the throughput of the optimal policy and the throughput of the best policy with a single moving server, as Figures 1 and 2 in Section 7 indicate.

Figure 1. Throughput values as a function of the number of servers when B = 0.

Figure 2. Throughput values as a function of the number of servers when B = 5.

6. COUNTERINTUITIVE EXAMPLES

In this section we provide examples with a dedicated server at each station and also a flexible server to illustrate the fact that both the selection of the downstream, upstream, and flexible servers and the assignment of the dedicated servers to the stations can depend on the buffer size B. Consequently, these examples suggest that obtaining structural results beyond the ones provided in this article that specify the optimal choice of the flexible server and to what stations the dedicated servers should be assigned is difficult.

Consider the case with μ₁₁ = 1.0, μ₁₂ = 1.099, μ₂₁ = 1.1, μ₂₂ = 1.21, μ₃₁ = 3.0, and μ₃₃ = 3.3. If B = 0, then the policy that assigns server 1 to station 1 and server 2 to station 2 and has server 3 as the flexible server is optimal among all the policies with only one flexible server (although the servers are not generalists in this example, this is consistent with Proposition 4.2). The same policy is also optimal for systems with B = ∞. Then one might expect that this policy is also optimal for all 0 < B < ∞. However, this statement is not correct. It turns out that the optimal policy with one flexible server for B = 1 is the one that assigns server 2 to station 1, and server 1 to station 2 and has server 3 as the flexible one. Even though the policy with server 1 at station 1, server 2 at station 2, and server 3 moving is optimal for all B ≥ 6, for B < 6 the optimal policy alternates between the two policies mentioned earlier. Note that the counterintuitive behavior described in this paragraph can also occur when the servers are all generalists. For example, similar results hold for a system with generalist servers where γ₁ = 1.0, γ₂ = 1.1, μ₁ = 1.0, μ₂ = 1.1, and μ₃ = 3.0.

The examples in the previous paragraph also show that if a policy is optimal for two systems with buffer size B ₁ and B ₂, respectively, where B ₁ < B ₂ < ∞, it is not necessarily correct that it is optimal for all B ₁ ≤ B ≤ B ₂. Moreover, it indicates that even if the optimal choice of which servers are dedicated does not depend on B, the assignment of the dedicated servers to stations might nevertheless depend on the buffer size.

With the next example we demonstrate that the choice of the flexible server can also depend on the buffer size. Suppose that μ₁₁ = 6.0, μ₁₂ = 5.0, μ₂₁ = 4.1, μ₂₂ = 4.01, and μ₃₁ = μ₃₂ = 5.0. For B = 0, the policy that assigns server 1 to station 1 and server 2 to station 2 and has server 3 as the flexible one is optimal. On the other hand, when B = 1, the optimal policy involves assigning server 2 to station 1 and server 3 to station 2 and having server 1 as the flexible server. Finally, when the buffer size is large, the optimal policy assigns server 1 to station 1 and server 3 to station 2 and has server 2 as the flexible one. Hence, depending on the buffer size, the optimal policy with one flexible server can have any one of the three servers as the flexible one.

7. NUMERICAL RESULTS

In this section we provide numerical results for systems with two stations, 1 ≤ M ≤ 10 servers, and exponentially distributed service requirements. Our objective with these numerical experiments is to see the effects of server flexibility on system throughput. Toward this end, we consider systems with a buffer of size B ∈ {0, 5, 10, 20} between the two stations and generalist servers (i.e., μ_ij = μ_iγ_j for all i ∈ {1, … , M} and j ∈ {1, 2}). We consider four different sets of numerical experiments for each buffer size B. In the first set of experiments, μ_i = 50 for all i ∈{1, … , 10} and γ₁ = γ₂ = 1. In the second set of experiments, the service rate μ_i of server i ∈ {1, … , M} is drawn independently from a uniform distribution with range [0, 100] and γ₁ = γ₂ = 1. In the third set of experiments, μ_i = 50 and γ₁ and γ₂ are drawn independently from a uniform distribution with range [0, 2]. Finally, in the fourth set of experiments, μ_i, for i ∈ {1, … , M}, is drawn independently from a uniform distribution with range [0, 100] and γ₁ and γ₂ are drawn independently from a uniform distribution with range [0, 2]. Note that in all four sets of numerical experiments, the mean of μ_i, for i = 1, … , M, is 50 and the common mean of γ₁ and γ₂ is 1, but the variance of μ_i, for i = 1, … , M, is either zero or 10⁴/12, and, similarly, the common variance of γ₁ and γ₂ is either zero or 1/3. Hence, the first set of numerical examples is concerned with situations in which the servers are all identical and the line is balanced; the second set focuses on situations with nonidentical servers and balanced lines; the third set addresses situations where servers are all identical and the line is unbalanced; and the fourth set of numerical examples studies situations with nonidentical servers and unbalanced lines.

For all four sets of numerical experiments, we consider five different policies:

• an arbitrary stationary policy where server 1 and all even-numbered servers except for server 2 work at station 1 and the remaining servers work at station 2;
• the best stationary policy (i.e., the policy with the largest throughput among those with l = 0);
• an arbitrary policy with only one flexible server (so that l = 1) assigned optimally to stations, which is any nonidling policy for M = 1, server 1 works at station 1 and server 2 is flexible for M = 2, and for M ≥ 3, server 3 moves, all remaining odd-numbered servers work at station 1, and all even-numbered servers work at station 2;
• the best policy with one flexible server and dedicated servers at both stations when M ≥ 3 and a dedicated server at one station when M = 2 (i.e., the policy with the largest throughput among those with l = 1 and dedicated servers at both stations when M ≥ 3 and a dedicated server optimally assigned to one station when M = 2);
• the optimal policy where all servers are allowed to move (l = M).

For each buffer size B, we compute the long-run average throughput of these five policies when the number of servers M varies from 1 to 10. Clearly, in the first set of experiments, the long-run average throughput of all five policies can be computed exactly. Moreover, in this case, since the throughputs of the arbitrary and best stationary policies are equal to each other and, similarly, the throughputs of the arbitrary and best policies with one flexible server are also equal to each other, we have only three different policies to compare. However, in the remaining three sets of numerical experiments, the long-run average throughputs of the five policies are estimated by finding the average throughput values of 1,000,000 replications for systems with 0 or 5 buffers, 500,000 replications for systems with 10 buffers, and 100,000 replications for systems with 20 buffers and for all choices of M (where each replication involves generating a different set of service rates μ_ij, for i ∈ {1, … , M} and j ∈ {1, 2}, and determining the throughput of the five policies under consideration for these service rates). Figures 1 through 4 display the throughputs of these policies for the four sets of numerical experiments as a function of the number of servers for the four different choices of B.

As expected, Figures 1–4 show that the long-run average throughput of the five policies increases as the number of servers increases. Moreover, the long-run average throughput of the optimal policy is a linear function of the number of servers (as predicted by Theorem 2.1). This assertion seems to hold for the average throughput of the best policy with one flexible server except for the first set of numerical experiments (depicted in plot (a) of Figs. 1–4). Similarly, it holds for the best stationary policy except for the first set of experiments when M > 1 (when M = 1, all policies with l = 0 have throughput equal to zero). Note that since all the servers are identical and the stations are balanced, an additional server does not improve the performance as much in the first set of numerical experiments if it leads to having an unequal number of servers dedicated to stations 1 and 2. Figures 1–4 also demonstrate that one can significantly improve the throughput by allowing servers to move. As Proposition 5.2 suggests, the performance of the best policy with one flexible server approaches that of the optimal policy as the buffer size increases. In fact, even for B = 5, the throughput of the best policy subject to one flexible server is close to the throughput of the optimal policy. Also, in numerical results not presented here in the interest of space, we found that when the optimal policy is considered as the baseline, the difference between the throughputs of the best policy subject to one flexible server and the optimal policy was always less than 17.2% for systems with B = 1 and less than 11.5% for systems with B = 3 (for all four sets of numerical experiments and all 1 ≤ M ≤ 10). This suggests that employing the best policy with one flexible server can yield near-optimal throughput even for systems with small to moderate buffer sizes. On the other hand, even though an arbitrary policy with one flexible server is better than an arbitrary stationary policy, one has to be careful when using an arbitrary policy with only one flexible server. As Figures 2–4 illustrate, the best stationary policy starts outperforming the arbitrary policy with one flexible server as the buffer size and the number of servers increase.

Figure 3. Throughput values as a function of the number of servers when B = 10.

Figure 4. Throughput values as a function of the number of servers when B = 20.

Finally, we comment on the improvement that we obtain by cross-training another server versus the improvement obtained by adding a resource (i.e., a new server or a buffer space). As Figures 1–4 illustrate, in all four sets of numerical experiments, the arbitrary policy with one flexible server among M (for M = 2, … , 9) servers yields very similar throughput as the arbitrary stationary policy with M + 1 servers (considering the arbitrary stationary policy with M + 1 servers as the baseline, the difference between throughputs is less than 6% except when M is odd in the first set of numerical experiments in which case the difference is less than 22%). The same assertion holds when comparing the best policy with one flexible server among M servers with the best stationary policy with M + 1 servers (in this case, the difference between throughputs is at most 24.1% considering the best stationary policy with M + 1 servers as the baseline and this difference decreases as the number of servers increases). This suggests that, on average, increasing flexibility is almost as effective as increasing the number of servers in terms of improving throughput.

On the other hand, having all the servers flexible and assigning them optimally to the stations is more effective than adding buffer space. This is consistent with Theorem 2.1, which shows that when the servers are generalists, the optimal throughput (under the assumption that all servers are flexible) does not depend on the buffer size. Hence, for a fixed number of servers M, the throughput of the optimal policy for systems with B = 0 is larger than the throughput of all other policies for systems with B > 0. Similarly, in plots (b), (c), and (d) in Figures 1–4, for all M = 1, … ,10, the throughput of the best policy with one flexible server for systems with B = 5 outperforms the throughput of the best stationary policy for systems with B = 10 and the throughput of the best policy with one flexible server for systems with B = 10 outperforms the throughput of the best stationary policy for systems with B = 20. The same assertion holds for plot (a) (the first set of numerical experiments) when the number of servers is odd. When the number of servers is even, the throughput performance of the two policies for the corresponding systems is very similar (the throughput for the systems with one flexible server and the smaller buffer size is slightly smaller). Even though the throughput of the best policy with one flexible server for systems with B = 0 is, in general, less than the throughput of the best stationary policy for systems with B = 5, the difference between the throughputs is small in most cases. These observations suggest that, in general, it is more effective to add server flexibility rather than buffer space in order to increase the long-run average throughput.

8. CONCLUSIONS

For a tandem queuing network with N ≥ 2 stations, M ≥ 1 servers, an infinite supply of jobs in front of station 1, infinite room for completed jobs after station N, and either a finite or infinite buffer between consecutive stations, we studied the dynamic allocation of servers to stations with the goal of maximizing the long-run average throughput under the assumption that only a subset of the servers are flexible, with the remaining servers being dedicated to particular stations. When N = M = 2 and both servers are dedicated, we have specified which server should be assigned to which station in order to maximize the throughput. When N = 2, 1 ≤ M ≤ 3, and only a subset of the servers are flexible, we have shown that the allocation of the flexible servers is of threshold type and characterized the threshold values. However, the optimal selection of the dedicated and flexible servers, assignment of dedicated servers to stations, and the threshold(s) where the flexible server(s) switch from station 1 to station 2 can depend on the buffer size.

When the servers are generalists, we were able to completely characterize the optimal policy in almost all cases. In particular, when all servers are flexible, we proved that any nonidling policy is optimal for systems with finite buffers. When N = 2, B < ∞, service requirements are exponentially distributed, and all servers are assigned to two dedicated teams, we have shown that it is optimal to assign the faster team of servers to the slower station. Similarly, when N = 2, B < ∞, and the service requirements are exponentially distributed, we proved that the optimal policy should have the fastest l servers as the flexible ones if all of the dedicated servers are at the same station or if there is at least one dedicated server at each station and a single team of flexible servers and also that when all the dedicated servers are at the same station, they should be assigned to the slower station.

Finally, we showed that the throughput of the optimal policy for two-station Markovian tandem queues with M = 2 or M = 3 servers converges to the throughput of the optimal policy for the corresponding infinite-buffered systems as the buffer size becomes large. Moreover, we proved that for large buffer sizes, the throughput of the best policy with a single flexible server for these systems is close to the throughput of the optimal policy where all servers are flexible. Our numerical examples indicated that this assertion also holds for two-station tandem queues with M > 3 servers and moderate buffer size B when the servers are generalists with exponentially distributed service requirements. Furthermore, the numerical results illustrated that, in general, adding flexibility is almost as effective as adding a new server and more effective than adding a buffer space in improving the system throughput.

Acknowledgments

The research of the first author was supported by the National Science Foundation under grants DMI–0000135, DMI–0217860, and DMI–0400260. The research of the second author was supported by the National Science Foundation under grant DMI–9984352. The research of the third author was supported by the National Science Foundation under grant DMI–0000135 and by the Natural Sciences and Engineering Research Council of Canada.

APPENDIX

Proof of Lemma 4.1

If Assumption E holds and π ∈ Π, then it is clear that the stochastic process {X _π(t) : t ≥ 0} defined in Section 2.2 is a continuous-time Markov chain and that there exists a scalar q _π ≤ ∑_i=1^Mmax_1≤j≤2μ_ij < ∞ such that the transition rates {q _π(x, x′)} of {X _π(t)} satisfy ∑_{x′∈S,x≠x′}q _π(x, x′) ≤ q _π for all x ∈ S. Hence, {X _π(t)} is uniformizable for all π ∈ Π. Let {Y _π(k)} be the corresponding discrete-time Markov chain, so that {Y _π(k)} has state space S and transition probabilities p _π(x, x′) = q _π(x, x′)/q _π if x ≠ x′ and p _π(x, x) = 1 − ∑_{x′∈S,x≠x′}q _π(x, x′)/q _π for all x ∈ S. It has been shown by Andradóttir et al. [Reference Andradóttir, Ayhan and Down6] that the original optimization problem in (1) can be translated into an equivalent (discrete-time) Markov decision problem. More specifically, let

$R_{\pi} \lpar x\rpar = \left\{\matrix{q_{\pi}\, \lpar x, x - 1\rpar &\hbox{for}\,x \in \{1, \ldots, B + 2\} \cr 0\hfill &\hbox{for}\,x = 0,\hfill} \right.$

be the departure rate from state x under policy π, for all x ∈ S and π ∈ Π. Then the optimization problem (1) has the same solution as the Markov decision problem

(A1)

$\mathop{\max}\limits_{\pi \in \Pi} \lim_{K \rightarrow \infty } {\open E} \left[{1 \over K} \sum\limits_{k = 1}^{K} R_{\,\pi} \lpar Y_{\,\pi} \lpar k - 1\rpar \rpar \right].$

In what follows, we solve a more general problem that determines the optimal allocation of servers 2 and 3 subject to having at least one server at station 2 at all times (but this server could depend on the state) and server 1 at station 1 at all times. Since (as we show below) the optimal policy for this less restrictive problem has server 2 dedicated to station 2, server 1 dedicated to station 1, and server 3 flexible, we can conclude that the faster server should be the flexible one when there is a dedicated server at each station.

It follows from Proposition 2.1 of Andradóttir and Ayhan [Reference Andradóttir and Ayhan5] that it is sufficient to consider actions that do not allow the flexible servers to idle. Then the set A_s of allowable actions in state s ∈ S is given by A _s = {d ₀, d ₁, d ₂} for s ∈ {0, … , B + 1} and A _B+2 = {d ₁}, where

$\eqalign{d_0 &= \hbox{servers 1 and 3 work at station 1, server 2 works at station 2}, \cr d_{1} &= \hbox{server 1 works at station 1, servers 2 and 3 work at station 2}, \cr d_{2} &= \hbox{servers 1 and 2 work at station 1, server 3 works at station 2.}}$

From our assumptions on the service rates, we have a recurrent Markov decision process and we can use the policy iteration algorithm for unichain models (see Puterman [Reference Puterman25, p. 378]) to prove the optimality of the policy described in Lemma 4.1.

In the policy iteration algorithm, we start by choosing

$\delta_0 \lpar s\rpar = \delta^{\ast} \lpar s\rpar =\left\{\matrix{d_0 &\hbox{for}\ 0 \le s \le s^{\ast} - 1\hfill \cr d_1 \hfill &\hbox{for}\ s^{\ast} \le s \le B + 2,} \right.$

corresponding to the policy described in Proposition 4.1. Then the reward vector r _δ₀ and the probability transition matrix Pδ₀ corresponding to the decision rule δ₀ are given as

$r_{\delta_0} \lpar s\rpar =\left\{\matrix{0\hfill &\hbox{for}\, s = 0 \hfill \cr \mu_{2} \gamma_{2}\hfill &\hbox{for}\,1 \le s \le s^{\ast} - 1\hfill \cr \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2} \hfill &\hbox{for}\,s^{\ast}\le s \le B + 2,} \right.$

and

$P_{\delta_0} \lpar s, s^{\prime}\rpar = \left\{\matrix{\displaystyle{\lpar \mu_{1} + \mu_{3}\rpar \gamma_{1} \over q}\hfill &\quad \hbox{for}\,s = 0, s^{\prime} = 1 \hfill \cr \displaystyle{q - \lpar \mu_{3} + \mu_{1}\rpar \gamma_{1} \over q} \hfill &\quad \hbox{for}\,s = s^{\prime} = 0 \hfill \cr \displaystyle{\mu_{2} \gamma_{2} \over q} \hfill &\quad \hbox{for}\,1 \le s \le s^{\ast} - 1, s^{\prime} = s - 1 \hfill \cr \displaystyle{q - \lpar \mu_{2} \gamma_{2} + \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar \over q} &\quad \hbox{for}\,1 \le s \le s^{\ast} - 1, s^{\prime} = s \hfill \cr \displaystyle{\lpar \mu_{1} + \mu_{3}\rpar \gamma_{1} \over q} \hfill &\quad \hbox{for}\,1 \le s \le s^{\ast} - 1, s^{\prime}= s + 1 \cr \displaystyle{\lpar \mu_{2} + \mu_{3}\rpar \gamma_{2} \over q} \hfill &\quad \hbox{for}\,s^{\ast} \le s \le B + 1, s^{\prime} = s - 1\cr \displaystyle{q - \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2} + \mu_{1} \gamma_{1}\rpar \over q} \hfill &\quad \hbox{for}\,s^{\ast} \le s \le B + 1, s^{\prime} = s \hfill \cr \displaystyle{\mu_{1} \gamma_{1} \over q} \hfill &\quad \hbox{for}\,s^{\ast} \le s \le B + 1, s^{\prime} = s + 1 \hfill \cr \displaystyle{\lpar \mu_{2} + \mu_{3}\rpar \gamma_{2} \over q} \hfill &\quad \hbox{for}\,s = B + 2, s^{\prime} = B + 1 \hfill \cr \displaystyle{q - \lpar \mu_{2} + \mu_{3}\rpar \gamma_{2} \over q} \hfill &\quad \hbox{for}\,s = s^{\prime} = B + 2.\hfill} \right.$

Since the Markov chain under the policy (δ₀)^∞ is irreducible, we can find a scalar g ₀ and a vector h ₀ solving

(A2)

$r_{\delta_0} - g_0 e + \lpar P_{\delta_0} - I\rpar h_0 = 0,$

subject to h ₀(0) = 0. In (A2), e is a column vector of ones and I is the identity matrix. Define

$\eqalign{\Phi_0 &= \gamma_{1} \gamma_{2} \lpar \mu_{1} + \mu_{3}\rpar \lpar \mu_{2} + \mu_{3}\rpar \cr &\quad \! \! \times \left( \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^{B + 2 - s^{\ast}} \sum\limits_{j = 0}^{s^{\ast} - 1} \lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar ^{s^{\ast} - 1 - j} \lpar \mu_{2} \gamma_{2}\rpar \,^j \right. \cr &\quad \left. + \,\, \lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar ^{s^{\ast}-1} \sum\limits_{j = 1}^{B + 2 - s^{\ast}} \lpar \, \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^{B + 2 - s^{\ast} - j} \lpar \mu_{1} \gamma_{1}\rpar \,^j \right) }$

and

$\eqalign{\Phi_1 &= \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^{B + 3 - s^{\ast}} \sum\limits_{j = 0}^{s^{\ast}-1} \lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar ^{s^{\ast} - 1 - j} \lpar \mu_{2} \gamma_{2}\rpar \,^j \cr &\quad + \lpar \mu_{1}\gamma_{1} + \mu_{3} \gamma _{1}\rpar ^{s^{\ast}} \sum_{j=0}^{B + 2 - s^{\ast}} \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^{B + 2 - s^{\ast}-j} \lpar \mu_{1}\gamma_{1}\rpar \,^j.}$

One can show that g ₀ = Φ₀/Φ₁, h ₀(0) = 0,

$h_0 \lpar s\rpar = {q\lpar g_{0} - \mu_1 \gamma_{1} - \mu_3 \gamma_{1}\rpar \over \lpar \mu_{1} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^s} \sum\limits_{j=0}^{s - 2} \lpar\, j + 1\rpar \lpar \mu_{2} \gamma_{2}\rpar ^{s-j-1} \lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar ^{\,\,j} + {qg_0 \over \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}}$

for 1 ≤ s ≤ s*, and

$\eqalign{h_0 \lpar s\rpar &= h_0 \lpar s^{\ast}\rpar + \displaystyle{q\lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar \sum\limits_{j = 0}^{s^{\ast}-s-1} \lpar \mu_{1} \gamma_{1}\rpar ^{s^{\ast}-s-j-1} \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar \,^j \over \lpar \mu_{1} \gamma_{1}\rpar ^{s^{\ast}-s} \lpar \mu_{1} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^{s}}\cr &\quad \times \left( g_{0}\lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar ^{s^{\ast}-1} + \lpar g_{0} - \mu_1 \gamma_{1} - \mu_3 \gamma_{1}\rpar \right. \cr &\quad \times \left. \sum\limits_{j=0}^{s^{\ast}-2}\lpar \mu_{2} \gamma_{2}\rpar ^{s^{\ast}-j-1} \lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar \,^j \right) \cr &\quad + {q\lpar g_{0} - \mu_{2} \gamma_{2} - \mu_3 \gamma_{2}\rpar \over \lpar \mu_{1} \gamma_{1}\rpar ^{s^{\ast}-s}} \sum\limits_{j = 0}^{s^{\ast}-s-1}\lpar\, j + 1\rpar \lpar \mu_{1} \gamma_{1}\rpar ^{s^{\ast}-s-j-1} \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar \,^j}$

for s* + 1 ≤ s ≤ B + 2, constitute a solution to (A2). As is discussed in Puterman [Reference Puterman25, pp. 338–339], g ₀ represents the long-run average reward under policy (δ₀)^∞ (which is the long-run average throughput under policy (δ₀)^∞) and h ₀(s), s = 1, … , B + 2, can be interpreted as the asymptotic relative difference in total reward that results from starting the process in state s versus state 0.

As the next step of the policy iteration algorithm, we choose

$\delta _1\lpar s\rpar \in \hbox{arg} \mathop{\max}\limits_{a \in A_s} \left\{r\,\lpar s, a\rpar +\sum\limits_{j \in S} p\lpar \,j \vert s, a\rpar h_0 \lpar \,j\rpar \right \}, \quad \forall_s \in S,$

setting δ₁(s) = δ₀(s) if possible (here r(s, a) is the reward gained when action a is chosen in state s and p(j|s, a) is the transition probability from state s to state j when action a is chosen in state s). We now show that if s* ∈ S*, then δ₁(s) = δ₀(s), for all s ∈ S. In particular, for all s ∈ S and a ∈ A _s, we will show that the differences

$r\,\lpar s, a\rpar + \sum\limits_{j \in S} p\lpar\, j \vert s, a\rpar h_0 \, \lpar\, j\rpar - \left( r\,\lpar s, d_0 \lpar s\rpar \rpar + \sum\limits_{j \in S} p\lpar\, j \vert s, d_0\lpar s\rpar \rpar h_0 \lpar\, j\rpar \right)$

are nonpositive. Thus, we will prove that the decision rule corresponding to (δ₀)^∞ (which is the policy described in Lemma 4.1) satisfies the optimality equations and, hence, it is gain (long-run average reward) optimal.

For 0 ≤ s ≤ s* − 1, we have that δ₀(s) = d ₀. Define

$\eqalign{\Phi_2 &= \sum\limits_{j = 0}^{s^{\ast}-s-2} \lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar ^{s^{\ast}-s-2-j} \lpar \mu_{2} \gamma_{2}\rpar ^{s + j}\cr &\quad \times \left( (\mu_{1} \gamma_{1}\rpar ^{B + 3-s^{\ast}} + \mu_{3} \gamma_{1} \sum\limits_{j = 0}^{B+2-s^{\ast}} \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^{B+2-s^{\ast}-j} \mu\,^j_1 \right) \cr &\quad \times \lpar \mu_{1} + \mu_{2} + \mu_{3}\rpar .}$

We have

$\eqalign{& r\,\lpar s, d_1\rpar +\sum\limits_{j \in S} p\lpar \,j \vert s, d_1\rpar h_0 \lpar \,j\rpar - \left( r\lpar s, d_0\rpar + \sum\limits_{j \in S} p\lpar \, j \vert s, d_0\rpar h_0 \lpar \, j\rpar \right) \cr & \quad = - {\mu_{3} \gamma_{1} \gamma_{2} \lpar \Phi_2 + f\lpar s^{\ast}\rpar \rpar \over \Phi_1} \le 0}$

and

$\eqalign{& r\,\lpar s, d_2\rpar + \sum\limits_{j \in S} p\lpar \,j \vert s, d_2\rpar h_0\lpar\, j\rpar - \left( r\,\lpar s, d_0\rpar + \sum\limits_{j \in S} p\lpar\, j \vert s, d_0\rpar h_0 \lpar \,j\rpar \right) \cr & \qquad =- {\lpar \mu_{3} - \mu_{2}\rpar \gamma_{1} \gamma_{2} \lpar \Phi_2 + f\lpar s^{\ast}\rpar \rpar \over \Phi_1} \le 0.}$

Similarly, for s* ≤ s ≤ B + 1, we have that δ₀(s) = d ₁. Define

$\eqalign{\Phi_3 &= \sum\limits_{j = 0}^{s-s^{\ast}-1} \lpar \mu_{2} \gamma_{2} + \mu_{3} \gamma_{2}\rpar ^{s-s^{\ast}-1-j} \lpar \mu_{1} \gamma_{1}\rpar ^{B-s+2+j} \cr &\quad \times \left( \lpar \mu_{2} \gamma_{2}\rpar ^{s^{\ast}} + \mu_{3} \gamma_{2} \sum\limits_{j = 0}^{s^{\ast}-1} \lpar \mu_{1} \gamma_{1} + \mu_{3} \gamma_{1}\rpar ^{s^{\ast}-1-j} \lpar \mu_{2} \gamma_{2}\rpar ^{\,j} \right) \cr &\quad \times \lpar \mu_{1}+ \mu_{2} + \mu_{3}\rpar .}$

We have

$\eqalign{& r\,\lpar s, d_0\rpar + \sum\limits_{j \in S} p\lpar \,j \vert s, d_0\rpar h_0\lpar \,j\rpar - \left( r\,\lpar s, d_1\rpar + \sum\limits_{j \in S} p\lpar \,j \vert s, d_1\rpar h_0 \lpar \,j\rpar \right) \cr &= -{\mu_{3} \gamma_{1} \gamma_{2} \lpar \Phi_3 - f\lpar s^{\ast}+1\rpar \rpar \over \Phi_1} \le 0}$

and

$\eqalign{& r\,\lpar s, d_2\rpar + \sum\limits_{j \in S} p\lpar \,j\vert s, d_2\rpar h_0 \lpar \,j\rpar - \left( r\,\lpar s, d_1\rpar + \sum\limits_{j \in S} p\lpar \, j\vert s, d_1\rpar h_0 \lpar \,j\rpar \right) \cr &= -{\mu_{2} \gamma_{1} \gamma_{2} \lpar \Phi_3 - f\lpar s^{\ast} + 1\rpar \rpar \over \Phi_1} \le 0.}$

Since A _B+2 = {d ₁}, we have shown that δ₁(s) = δ₀(s) for all s ∈ S. By Theorem 8.6.2 of Puterman [Reference Puterman25], this proves that the policy described in Lemma 4.1 is optimal.■

References

1.Ahn, H.-S., Duenyas, I., & Lewis, M.E. (2002). The optimal control of a two-stage tandem queueing system with flexible servers. Probability in the Engineering and Informational Sciences 16: 453–469.CrossRef Google Scholar

2.Ahn, H-S., Duenyas, I., & Zhang, R. (1999). Optimal scheduling of a 2-stage tandem queue with parallel servers. Advances in Applied Probability 31: 1095–1117.CrossRef Google Scholar

3.Ahn, H.-S., Duenyas, I., & Zhang, R. (2004). Optimal control of a flexible server. Advances in Applied Probability 36: 139–170.CrossRef Google Scholar

4.Ahn, H.-S. & Righter, R. (2006). Dynamic load balancing with flexible workers. Advances in Applied Probability 38: 621–642.CrossRef Google Scholar

5.Andradóttir, S. & Ayhan, H. (2005). Throughput maximization for tandem lines with two stations and flexible servers. Operations Research 53: 516–531.CrossRef Google Scholar

6.Andradóttir, S., Ayhan, H., & Down, D.G. (2001). Server assignment policies for maximizing the steady-state throughput of finite queueing systems. Management Science 47: 1421–1439.Google Scholar

7.Andradóttir, S., Ayhan, H., & Down, D.G. (2003). Dynamic server allocation for queueing networks with flexible servers. Operations Research 51: 952–968.Google Scholar

8.Andradóttir, S., Ayhan, H., & Down, D.G. (2007). Compensating for failures with flexible servers. Operations Research (to appear).CrossRef Google Scholar

9.Bartholdi, J.J. & Eisenstein, D.D. (1996). A production line that balances itself. Operations Research 44(1): 21–34.CrossRef Google Scholar

10.Bartholdi, J.J., Eisenstein, D.D., & Foley, R.D. (2001). Performance of bucket brigades when work is stochastic. Operations Research 49: 710–719.CrossRef Google Scholar

11.Bell, S.L. & Williams, R.J. (2001). Dynamic scheduling of a system with two parallel servers in heavy traffic with complete resource pooling: Asymptotic optimality of a continuous review threshold policy. Annals of Applied Probability 11: 608–649.CrossRef Google Scholar

12.Bell, S.L. & Williams, R.J. (2005). Dynamic scheduling of a parallel server system in heavy traffic with complete resource pooling: Asymptotic optimality of a threshold policy. Electronic Journal of Probability 10: 1044–1115.Google Scholar

13.Farrar, T.M. (1993). Optimal use of extra server in a two station tandem queueing network. IEEE Transactions on Automatic Control 38(8): 196–199.CrossRef Google Scholar

14.Gurumurthi, S. & Benjaafar, S. (2004). Modeling and analysis of flexible queueing systems. Naval Research Logistics 51: 755–782.Google Scholar

15.Hajek, B. (1984). Optimal control of interacting service stations. IEEE Transactions on Automatic Control 29(6): 491–499.Google Scholar

16.Harrison, J.M. & López, M.J. (1999). Heavy traffic resource pooling in parallel-server systems. Queueing Systems 33: 339–368.CrossRef Google Scholar

17.Hopp, W.J., Tekin, E., & van Oyen, M.P. (2004). Benefits of skill chaining in serial production lines with cross-trained workers. Management Science 50: 83–98.CrossRef Google Scholar

18.Hopp, W.J. & van Oyen, M.P. (2004). Agile workforce evaluation: A framework for cross-training and coordination. IIE Transactions 36: 919–940.CrossRef Google Scholar

19.Jordan, W.J. & Graves, S.C. (1995). Principles on the benefits of manufacturing process flexibility. Management Science 41: 577–594.CrossRef Google Scholar

20.Kaufman, D.L., Ahn, H-S., & Lewis, M.E. (2005). On the introduction of an agile, temporary workforce into a tandem queueing system. Queueing Systems: Theory and Applications 51: 135–171.CrossRef Google Scholar

21.Mandelbaum, A. & Stolyar, A.L. (2002). Scheduling flexible servers with convex delay costs: Heavy-traffic optimality of the generalized cμ-rule. Operations Research 52: 836–855.CrossRef Google Scholar

22.McClain, J.O., Thomas, L.J., & Sox, C. (1992). On-the-fly line balancing with very little WIP. International Journal of Production Economics 27: 283–289.CrossRef Google Scholar

23.Ostalaza, J., McClain, J., & Thomas, L.J. (1990). The use of dynamic programming (state-dependent) assembly line balancing to improve throughput. Journal of Manufacturing Operations Management 3: 105–133.Google Scholar

24.Pandelis, D.G. & Teneketsiz, D. (1994). Optimal multiserver stochastic scheduling of two interconnected priority queues. Advances in Applied Probability 26: 258–279.Google Scholar

25.Puterman, M.L. (1994). Markov decision processes. New York: Wiley.CrossRef Google Scholar

26.Rosberg, Z., Varaiya, P.P., & Walrand, J.C. (1982). Optimal control of service in tandem queues. IEEE Transactions on Automatic Control 27(3): 600–609.CrossRef Google Scholar

27.Sheikhzadeh, M., Benjaafar, S., & Gupta, D. (1998). Machine sharing in manufacturing systems: Flexibility versus chaining. International Journal of Flexible Manufacturing Systems 10: 351–378.CrossRef Google Scholar

28.Squillante, M.S., Xia, C.H., Yao, D.D., & Zhang, L. (2001). Threshold based priority policies for parallel-server systems with affinity scheduling. In Proceedings of the 2001 American Control Conference, pp. 2992–2999.Google Scholar

29.Tassiulas, L. & Bhattacharya, L.L. (2000). Allocation of interdependent resources for maximal throughput. Stochastic Models 16: 27–48.CrossRef Google Scholar

30.Wang, Y., Perkins, J.R., Vakili, P., & Khurana, A. (2001). Optimal allocation of heterogeneous resources: A control theoretic approach. In Proceedings of the 40th IEEE Conference on Decision and Control, pp. 1953–1958.Google Scholar

31.Wu, C.-H., Down, D.G., & Lewis, M.E. (2005). Heuristics for allocation of reconfigurable resources in a serial line with reliability considerations. Preprint.Google Scholar

32.Wu, C.-H., Lewis, M.E., & Veatch, M. (2006). Dynamic allocation of reconfigurable resources in a two-stage tandem queueing system with reliability considerations. IEEE Transactions on Automatic Control 51: 309–314.Google Scholar

33.Zavadlav, E., McClain, J.O., & Thomas, L.J. (1996). Self-buffering, self-balancing, self-flushing production lines. Management Science 42(8): 1151–1164.CrossRef Google Scholar

Figure 1. Throughput values as a function of the number of servers when B = 0.

Figure 2. Throughput values as a function of the number of servers when B = 5.

Figure 3. Throughput values as a function of the number of servers when B = 10.

Figure 4. Throughput values as a function of the number of servers when B = 20.

Article contents

DYNAMIC ASSIGNMENT OF DEDICATED AND FLEXIBLE SERVERS IN TANDEM LINES

Abstract

1. INTRODUCTION

2. PRELIMINARIES

Assumption S

Assumption G

2.1. Systems with Flexible, Generalist Servers

Theorem 2.1

Proof

2.2. Systems with Both Flexible and Dedicated, Generalist Servers

Proposition 2.1

Proof

Corollary 2.1

Assumption E

Lemma 2.1

Proof

Proposition 2.2

3. SYSTEMS WITH TWO SERVERS

3.1. Systems with Two Dedicated Servers

Proposition 3.1

Proof

Proposition 3.2

Proof

Remark 3.1

Proposition 3.3

Proof

3.2. Systems with One Dedicated and One Flexible Server

Proposition 3.4

Proposition 3.5

Proof

3.3. Systems with Two Flexible Servers

Proposition 3.6

4. SYSTEMS WITH THREE SERVERS

4.1. Systems with Two Dedicated Servers and One Flexible Server

Proposition 4.1

Lemma 4.1

Proposition 4.2

Proof

Remark 4.1

Remark 4.2

4.2. Systems with One Dedicated and Two Flexible Servers

Proposition 4.3

Proposition 4.4

Proposition 4.5

Proof

4.3. Systems with Three Flexible Servers

Proposition 4.6

5. SYSTEMS WITH LARGE BUFFERS

Proposition 5.1

Proof

Corollary 5.1

Corollary 5.2

Lemma 5.1

Proof

Proposition 5.2

Proof

6. COUNTERINTUITIVE EXAMPLES

7. NUMERICAL RESULTS

8. CONCLUSIONS

Acknowledgments

APPENDIX

Proof of Lemma 4.1

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests