High Performance Computing (HPC) is becoming a common tool in many research
areas. Social influence (e.g., project collaboration) among increasing users of HPC
systems creates bursty behavior in underlying workloads. This bursty behavior is
increasingly common with the advent of grid computing and cloud computing. Mining
the user bursty behavior is important for HPC workloads prediction and scheduling,
which has direct impact on overall HPC computing performance.
A representative work in this area is the Mixed User Group Model (MUGM),
which clusters users according to the resource demand features of their submissions,
such as duration time and parallelism. However, MUGM has some difficulties when
implemented in real-world system. First, representing user behaviors by the features
of their resource demand is usually difficult. Second, these features are not always
available. Third, measuring the similarities among users is not a well-defined problem.
In this work, we propose a Social Influence Model (SIM) to identify, analyze,
and quantify the level of social influence across HPC users. The advantage of the
SIM model is that it finds HPC communities by analyzing user job submission time, thereby avoiding the difficulties of MUGM. An offline algorithm and a fast-converging,
computationally-efficient online learning algorithm for identifying social groups are
proposed. Both offline and online algorithms are applied on several HPC and grid
workloads, including Grid 5000, EGEE 2005 and 2007, and KAUST Supercomputing
Lab (KSL) BGP data. From the experimental results, we show the existence of a social
graph, which is characterized by a pattern of dominant users and followers. In order
to evaluate the effectiveness of identified user groups, we show the pattern discovered
by the offline algorithm follows a power-law distribution, which is consistent with
those observed in mainstream social networks. We finally conclude the thesis and
discuss future directions of our work.
Date of Award | Jun 2011 |
---|
Original language | English (US) |
---|
Awarding Institution | - Computer, Electrical and Mathematical Sciences and Engineering
|
---|
Supervisor | David Keyes (Supervisor) |
---|