Fast and Accurate Load Balancing for Geo-Distributed Storage Systems

Kirill L. Bogdanov, Waleed Reda, Gerald Q. Maguire, Dejan Kostić, Marco Canini

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Scopus citations

Abstract

The increasing density of globally distributed datacenters reduces the network latency between neighboring datacenters and allows replicated services deployed across neighboring locations to share workload when necessary, without violating strict Service Level Objectives (SLOs). We present Kurma, a practical implementation of a fast and accurate load balancer for geo-distributed storage systems. At run-time, Kurma integrates network latency and service time distributions to accurately estimate the rate of SLO violations for requests redirected across geo-distributed datacenters. Using these estimates, Kurma solves a decentralized rate-based performance model enabling fast load balancing (in the order of seconds) while taming global SLO violations. We integrate Kurma with Cassandra, a popular storage system. Using real-world traces along with a geo-distributed deployment across Amazon EC2, we demonstrate Kurma’s ability to effectively share load among datacenters while reducing SLO violations by up to a factor of 3 in high load settings or reducing the cost of running the service by up to 17%.
Original languageEnglish (US)
Title of host publicationProceedings of the ACM Symposium on Cloud Computing - SoCC '18
PublisherAssociation for Computing Machinery (ACM)
Pages386-400
Number of pages15
ISBN (Print)9781450360111
DOIs
StatePublished - Sep 28 2018

Fingerprint

Dive into the research topics of 'Fast and Accurate Load Balancing for Geo-Distributed Storage Systems'. Together they form a unique fingerprint.

Cite this