TY - GEN
T1 - Multithreaded Asynchronous Graph Traversal for In-Memory and Semi-External Memory
AU - Pearce, Roger
AU - Gokhale, Maya
AU - Amato, Nancy M.
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledged KAUST grant number(s): KUS-C1–016-04
Acknowledgements: This work was partially performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52–07NA27344 (LLNL-CONF-427572). Funding partially provided by LDRD 07-ERD-063. Portions of experiments were performed at the Livermore Computing facility resources. This research supported in part by NSF awards CRI-0551685, CCF-0833199, CCF-0830753, IIS-096053, IIS-0317266, by NSF/DNDO award 2008-DN-077-ARI018–02, by the DOE NNSA under the Predictive Science Academic Alliances Program by grant DE-FC52–08NA28616, by THECB NHARP grant 000512–0097-2009, by Chevron, IBM, Intel, HP, Oracle/Sun and by King Abdullah University of Science and Technology (KAUST) Award KUS-C1–016-04 Pearce is supported in part by a Lawrence Scholar Fellowship and a Dept. of Education Graduate Fellowship (GAANN).
This publication acknowledges KAUST support, but has no KAUST affiliated authors.
PY - 2010/11
Y1 - 2010/11
N2 - Processing large graphs is becoming increasingly important for many domains such as social networks, bioinformatics, etc. Unfortunately, many algorithms and implementations do not scale with increasing graph sizes. As a result, researchers have attempted to meet the growing data demands using parallel and external memory techniques. We present a novel asynchronous approach to compute Breadth-First-Search (BFS), Single-Source-Shortest-Paths, and Connected Components for large graphs in shared memory. Our highly parallel asynchronous approach hides data latency due to both poor locality and delays in the underlying graph data storage. We present an experimental study applying our technique to both In-Memory and Semi-External Memory graphs utilizing multi-core processors and solid-state memory devices. Our experiments using synthetic and real-world datasets show that our asynchronous approach is able to overcome data latencies and provide significant speedup over alternative approaches. For example, on billion vertex graphs our asynchronous BFS scales up to 14x on 16-cores. © 2010 IEEE.
AB - Processing large graphs is becoming increasingly important for many domains such as social networks, bioinformatics, etc. Unfortunately, many algorithms and implementations do not scale with increasing graph sizes. As a result, researchers have attempted to meet the growing data demands using parallel and external memory techniques. We present a novel asynchronous approach to compute Breadth-First-Search (BFS), Single-Source-Shortest-Paths, and Connected Components for large graphs in shared memory. Our highly parallel asynchronous approach hides data latency due to both poor locality and delays in the underlying graph data storage. We present an experimental study applying our technique to both In-Memory and Semi-External Memory graphs utilizing multi-core processors and solid-state memory devices. Our experiments using synthetic and real-world datasets show that our asynchronous approach is able to overcome data latencies and provide significant speedup over alternative approaches. For example, on billion vertex graphs our asynchronous BFS scales up to 14x on 16-cores. © 2010 IEEE.
UR - http://hdl.handle.net/10754/598920
UR - http://ieeexplore.ieee.org/document/5644845/
UR - http://www.scopus.com/inward/record.url?scp=78650808887&partnerID=8YFLogxK
U2 - 10.1109/sc.2010.34
DO - 10.1109/sc.2010.34
M3 - Conference contribution
SN - 9781424475575
BT - 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
PB - Institute of Electrical and Electronics Engineers (IEEE)
ER -