TY - GEN
T1 - Multi-scale real-time grid monitoring with job stream mining
AU - Zhang, Xiangliang
AU - Sebag, Michèle
AU - Cécile, Germain Renaud
N1 - Generated from Scopus record by KAUST IRTS on 2023-09-21
PY - 2009
Y1 - 2009
N2 - The ever increasing scale and complexity of large computational systems ask for sophisticated management tools, paving the way toward Autonomic Computing. A first step toward Autonomic Grids is presented in this paper; the interactions between the grid middleware and the stream of computational queries are modeled using statistical learning. The approach is implemented and validated in the context of the EGEE grid. The GSTRAP system, embedding the STRAP Data Streaming algorithm, provides manageable and understandable views of the computational workload based on gLite reporting services. An online monitoring module shows the instant distribution of the jobs in real-time and its dynamics, enabling anomaly detection. An offline monitoring module provides the administrator with a consolidated view of the workload, enabling the visual inspection of its long-term trends.
AB - The ever increasing scale and complexity of large computational systems ask for sophisticated management tools, paving the way toward Autonomic Computing. A first step toward Autonomic Grids is presented in this paper; the interactions between the grid middleware and the stream of computational queries are modeled using statistical learning. The approach is implemented and validated in the context of the EGEE grid. The GSTRAP system, embedding the STRAP Data Streaming algorithm, provides manageable and understandable views of the computational workload based on gLite reporting services. An online monitoring module shows the instant distribution of the jobs in real-time and its dynamics, enabling anomaly detection. An offline monitoring module provides the administrator with a consolidated view of the workload, enabling the visual inspection of its long-term trends.
UR - http://www.scopus.com/inward/record.url?scp=70349743748&partnerID=8YFLogxK
U2 - 10.1109/CCGRID.2009.20
DO - 10.1109/CCGRID.2009.20
M3 - Conference contribution
AN - SCOPUS:70349743748
SN - 9780769536224
T3 - 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID 2009
SP - 420
EP - 427
BT - 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID 2009
T2 - 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID 2009
Y2 - 18 May 2009 through 21 May 2009
ER -