Mining Binary Attributes
Price
Free (open access)
Volume
22
Pages
15
Published
1998
Size
1,407 kb
Paper DOI
10.2495/DATA980091
Copyright
WIT Press
Author(s)
Joachim P.P. Costa
Abstract
In this work, we consider the case where we have categorical attributes with a huge number of values in a prediction context. In particular, the method- ology introduced here concerns the use of these attributes in binary decision trees; nevertheless, it is applicable to other prediction methods. The main idea consists in extracting ("mining") the most predictive binary attributes, from the set of initial attributes ("mine"). In order to do this, we consider three different operations: hierarchical clustering, multiplication, and factorization. The first operation, hierarchical clustering, serves for reducing the number of values of a categorical attribute. In fact, by ap- plying the hierarchical clustering method AVL [6, 7] to the set of values of a categorical attribute, we can group these values into clusters. Then, we can
Keywords