TY - GEN

T1 - Common Association Rules for Dispersed Information Systems

AU - Moshkov, Mikhail

AU - Zielosko, Beata

AU - Tetteh, Evans Teiko

N1 - KAUST Repository Item: Exported on 2022-12-14
Acknowledgements: Research reported in this publication was supported by King Abdullah University of Science and Technology (KAUST).

PY - 2022/10/19

Y1 - 2022/10/19

N2 - Association rules are popular form for knowledge discovery domain. They are used for finding interesting relationships and patterns hidden in large data sets and in the area of associative classification, where usually rules with one item in the right-hand side are considered. There are many different approaches and algorithms for mining association rules. One of the most popular group are methods which are based on mining frequent itemsets, usually applied for data in transaction format. Such data can be transformed to binary information system which corresponds to matrix data format. Technological development means that we are dealing with an increasing amount of data that can be heterogeneous, taking into account their format and location. In this paper, we assume that dispersed data is represented by a finite set S of information systems with equal sets of attributes. We discuss one of the possible ways to the study association rules common to all information systems from the set S: building a joint information system for which the set of true association rules that are realizable for a given row r and have given attribute f on the right-hand side coincides with the set of association rules that are true for all information systems from S, are realizable for the row r, and have the attribute f on the right-hand side. We show how to build a joint information system in a polynomial time. When we build such an information system, we can apply to it various association rule learning algorithms.

AB - Association rules are popular form for knowledge discovery domain. They are used for finding interesting relationships and patterns hidden in large data sets and in the area of associative classification, where usually rules with one item in the right-hand side are considered. There are many different approaches and algorithms for mining association rules. One of the most popular group are methods which are based on mining frequent itemsets, usually applied for data in transaction format. Such data can be transformed to binary information system which corresponds to matrix data format. Technological development means that we are dealing with an increasing amount of data that can be heterogeneous, taking into account their format and location. In this paper, we assume that dispersed data is represented by a finite set S of information systems with equal sets of attributes. We discuss one of the possible ways to the study association rules common to all information systems from the set S: building a joint information system for which the set of true association rules that are realizable for a given row r and have given attribute f on the right-hand side coincides with the set of association rules that are true for all information systems from S, are realizable for the row r, and have the attribute f on the right-hand side. We show how to build a joint information system in a polynomial time. When we build such an information system, we can apply to it various association rule learning algorithms.

UR - http://hdl.handle.net/10754/686399

UR - https://linkinghub.elsevier.com/retrieve/pii/S1877050922014223

UR - http://www.scopus.com/inward/record.url?scp=85143348717&partnerID=8YFLogxK

U2 - 10.1016/j.procs.2022.09.525

DO - 10.1016/j.procs.2022.09.525

M3 - Conference contribution

SP - 4613

EP - 4620

BT - Procedia Computer Science

PB - Elsevier BV

ER -