TY - GEN
T1 - Fast and Accurate Load Balancing for Geo-Distributed Storage Systems
AU - Bogdanov, Kirill L.
AU - Reda, Waleed
AU - Maguire, Gerald Q.
AU - Kostić, Dejan
AU - Canini, Marco
N1 - KAUST Repository Item: Exported on 2020-10-01
PY - 2018/9/28
Y1 - 2018/9/28
N2 - The increasing density of globally distributed datacenters reduces the network latency between neighboring datacenters and allows replicated services deployed across neighboring locations to share workload when necessary, without violating strict Service Level Objectives (SLOs).
We present Kurma, a practical implementation of a fast and accurate load balancer for geo-distributed storage systems. At run-time, Kurma integrates network latency and service time distributions to accurately estimate the rate of SLO violations for requests redirected across geo-distributed datacenters. Using these estimates, Kurma solves a decentralized rate-based performance model enabling fast load balancing (in the order of seconds) while taming global SLO violations. We integrate Kurma with Cassandra, a popular storage system. Using real-world traces along with a geo-distributed deployment across Amazon EC2, we demonstrate Kurma’s ability to effectively share load among datacenters while reducing SLO violations by up to a factor of 3 in high load settings or reducing the cost of running the service by up to 17%.
AB - The increasing density of globally distributed datacenters reduces the network latency between neighboring datacenters and allows replicated services deployed across neighboring locations to share workload when necessary, without violating strict Service Level Objectives (SLOs).
We present Kurma, a practical implementation of a fast and accurate load balancer for geo-distributed storage systems. At run-time, Kurma integrates network latency and service time distributions to accurately estimate the rate of SLO violations for requests redirected across geo-distributed datacenters. Using these estimates, Kurma solves a decentralized rate-based performance model enabling fast load balancing (in the order of seconds) while taming global SLO violations. We integrate Kurma with Cassandra, a popular storage system. Using real-world traces along with a geo-distributed deployment across Amazon EC2, we demonstrate Kurma’s ability to effectively share load among datacenters while reducing SLO violations by up to a factor of 3 in high load settings or reducing the cost of running the service by up to 17%.
UR - http://hdl.handle.net/10754/630817
UR - https://dl.acm.org/doi/10.1145/3267809.3267820
UR - http://www.scopus.com/inward/record.url?scp=85059006718&partnerID=8YFLogxK
U2 - 10.1145/3267809.3267820
DO - 10.1145/3267809.3267820
M3 - Conference contribution
SN - 9781450360111
SP - 386
EP - 400
BT - Proceedings of the ACM Symposium on Cloud Computing - SoCC '18
PB - Association for Computing Machinery (ACM)
ER -