Flexible Queueing Architectures

We study a multiserver model with n flexible servers and n queues, connected through a bipartite graph, where the level of flexibility is captured by an upper bound on the graph's average degree, dn. Applications in content replication in data centers, skillbased routing in call centers, and fl...

Full description

Bibliographic Details
Main Authors: Tsitsiklis, John N. (Author), Xu, Kuang (Author)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor), Massachusetts Institute of Technology. Laboratory for Information and Decision Systems (Contributor)
Format: Article
Language:English
Published: Institute for Operations Research and the Management Sciences (INFORMS), 2020-04-24T17:38:43Z.
Subjects:
Online Access:Get fulltext
LEADER 02016 am a22001933u 4500
001 124853
042 |a dc 
100 1 0 |a Tsitsiklis, John N.  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Laboratory for Information and Decision Systems  |e contributor 
700 1 0 |a Xu, Kuang  |e author 
245 0 0 |a Flexible Queueing Architectures 
260 |b Institute for Operations Research and the Management Sciences (INFORMS),   |c 2020-04-24T17:38:43Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/124853 
520 |a We study a multiserver model with n flexible servers and n queues, connected through a bipartite graph, where the level of flexibility is captured by an upper bound on the graph's average degree, dn. Applications in content replication in data centers, skillbased routing in call centers, and flexible supply chains are among our main motivations. We focus on the scaling regime where the system size n tends to infinity, while the overall traffic intensity stays fixed. We show that a large capacity region and an asymptotically vanishing queueing delay are simultaneously achievable even under limited flexibility (d n ≪n). Our main results demonstrate that, when d n ≫ln n, a family of expander-graphbased flexibility architectures has a capacity region that is within a constant factor of the maximum possible, while simultaneously ensuring a diminishing queueing delay for all arrival rate vectors in the capacity region. Our analysis is centered around a new class of virtual-queue-based scheduling policies that rely on dynamically constructed job-toserver assignments on the connectivity graph. For comparison, we also analyze a natural family of modular architectures, which is simpler but has provably weaker performance. 
520 |a National Science Foundation (U.S.). (Grant CMMI-1234062) 
546 |a en 
655 7 |a Article 
773 |t Operations Research