TY - GEN
T1 - Distributed resource management across process boundaries
AU - Suresh, Lalith
AU - Bodik, Peter
AU - Menache, Ishai
AU - Canini, Marco
AU - Ciucu, Florin
N1 - Publisher Copyright:
© 2017 Association for Computing Machinery.
PY - 2017/9/24
Y1 - 2017/9/24
N2 - Multi-tenant distributed systems composed of small services, such as Service-oriented Architectures (SOAs) and Micro-services, raise new challenges in attaining high performance and efficient resource utilization. In these systems, a request execution spans tens to thousands of processes, and the execution paths and resource demands on different services are generally not known when a request first enters the system. In this paper, we highlight the fundamental challenges of regulating load and scheduling in SOAs while meeting end-to-end performance objectives on metrics of concern to both tenants and operators. We design Wisp, a framework for building SOAs that transparently adapts rate limiters and request schedulers system-wide according to operator policies to satisfy end-to-end goals while responding to changing system conditions. In evaluations against production as well as synthetic workloads, Wisp successfully enforces a range of end-to-end performance objectives, such as reducing average latencies, meeting deadlines, providing fairness and isolation, and avoiding system overload.
AB - Multi-tenant distributed systems composed of small services, such as Service-oriented Architectures (SOAs) and Micro-services, raise new challenges in attaining high performance and efficient resource utilization. In these systems, a request execution spans tens to thousands of processes, and the execution paths and resource demands on different services are generally not known when a request first enters the system. In this paper, we highlight the fundamental challenges of regulating load and scheduling in SOAs while meeting end-to-end performance objectives on metrics of concern to both tenants and operators. We design Wisp, a framework for building SOAs that transparently adapts rate limiters and request schedulers system-wide according to operator policies to satisfy end-to-end goals while responding to changing system conditions. In evaluations against production as well as synthetic workloads, Wisp successfully enforces a range of end-to-end performance objectives, such as reducing average latencies, meeting deadlines, providing fairness and isolation, and avoiding system overload.
KW - Microservices
KW - Rate limiting
KW - Resource management
KW - Scheduling
KW - Service-oriented architectures
UR - http://www.scopus.com/inward/record.url?scp=85032445578&partnerID=8YFLogxK
U2 - 10.1145/3127479.3132020
DO - 10.1145/3127479.3132020
M3 - Conference contribution
AN - SCOPUS:85032445578
T3 - SoCC 2017 - Proceedings of the 2017 Symposium on Cloud Computing
SP - 611
EP - 623
BT - SoCC 2017 - Proceedings of the 2017 Symposium on Cloud Computing
PB - Association for Computing Machinery, Inc
T2 - 2017 Symposium on Cloud Computing, SoCC 2017
Y2 - 24 September 2017 through 27 September 2017
ER -