A Method For Generating Aggregated Associations Between Discrete Data Features
Price
Free (open access)
Volume
35
Pages
10
Published
2005
Size
464 kb
Paper DOI
10.2495/DATA050281
Copyright
WIT Press
Author(s)
E. Titova, D. Sitnikov, O. Ryabov, B. D’Cruz
Abstract
The existing algorithms for discovering association rules in databases have been developed mainly for binary features (i.e. a feature is either present in a transaction or not). Nevertheless, in real databases it often happens that a feature can take on values from an arbitrary finite set. Such features are considered to be categorical values. In many cases continuous data is binned to obtain a set of discrete values. We suggest an algorithm for discovering generalised association rules in databases containing categorical data. This algorithm is based on building a tree of covers which allows us to generate associations with necessary support, and also to take into account updated records. To discover generalised associations we use a method of merging different branches of the cover tree for given values of a feature. The resulting rules look as follows: if the feature X takes on values from the set {a1,a2,...ai}, then feature Y takes on values from the set {b1,...,bk}. This is beneficial for small discrete feature domains. Keywords: data mining, generalized association rules, finite predicate. 1 Introduction Discovering association rules is a typical task for Data Mining. The process of generating associations consists of obtaining rules in the implicative form. Originally this task appeared in the marketing area (so called \“market basket
Keywords
data mining, generalized association rules, finite predicate.