We consider the problem of discovering
attributes, or properties, accounting for the a-priori stated abnormality of a
group of anomalous individuals (the outliers) with respect to an overall given
population (the inliers). To this aim, we introduce the notion of exceptional
property and define the concept of exceptionality score, which measures the
significance of a property. In particular, in order to single out exceptional
properties, we resort to a form of minimum distance estimation for evaluating
the badness of fit of the values assumed by the outliers compared to the
probability distribution associated with the values assumed by the inliers.
Suitable exceptionality scores are introduced for both numeric and categorical
attributes. These scores are, both from the analytical and the empirical point
of view, designed to be effective for small samples, as it is the case for
outliers. We present an algorithm, called EXPREX, for efficiently discovering
exceptional properties. The algorithm is able to reduce the needed
computational effort by exploring only relevant numerical intervals and by exploiting
suitable pruning rules. The experimental results confirm that our technique is
able to provide knowledge characterizing outliers in a natural manner.
Keywords:- IEEE Project Titles 2012, Data Mining Titles, Cloud Computing Titles, Networking Titles.
No comments:
Post a Comment