Eamonn Keogh
Information and Computer Science
University of California, Irvine
Irvine, CA 92697-3425
E-mail: eamonn@ics.uci.edu
Phone: 949-824-9630
Fax: 949-824-4056
Michael J. Pazzani
Information and Computer Science
University of California, Irvine
Irvine, CA 92697-3425
E-mail: pazzani@ics.uci.edu
Phone: 949-824-5888
Fax: 949-824-4056
The nave Bayes classifier is built on the assumption of conditional independence between the attributes given the class. The algorithm has been shown to be surprisingly robust to obvious violations of this condition, but it is natural to ask if it is possible to further improve the accuracy by relaxing this assumption. We examine an approach where nave Bayes is augmented by the addition of correlation arcs between attributes. We explore two methods for finding the set of augmenting arcs, a greedy hill-climbing search, and a novel, more computationally efficient algorithm that we call SuperParent. We compare these methods to TAN; a state-of the-art distribution-based approach to finding the augmenting arcs.
Bayesian networks, Search, Classification