AutoClass
Autoclass - A Bayesian Approach to Classification. We describe a Bayesian approach to the unsupervised discovery of classes in a set of cases, sometimes called finite mixture separation or clustering. The main difference between clustering and our approach is that we search for the “best” set of class descriptions rather than grouping the cases themselves. We describe our classes in terms of probability distribution or density functions, and the locally maximal posterior probability parameters. We rate our classifications with an approximate posterior probability of the distribution function w.r.t. the data, obtained by marginalizing over all the parameters. Approximation is necessitated by the computational complexity of the joint probability, and our marginalization is w.r.t. a local maxima in the parameter space. This posterior probability rating allows direct comparison of alternate density functions that differ in number of classes and/or individual class density functions. We discuss the rationale behind our approach to classification. We give the mathematical development for the basic mixture model, describe the approximations needed for computational tractability, give some specifics of models for several common attribute types, and describe some of the results achieved by the AutoClass program..
Keywords for this software
References in zbMATH (referenced in 70 articles )
Showing results 61 to 70 of 70.
Sorted by year (- Mastroianni, Carlo; Talia, Domenico; Trunfio, Paolo: Metadata for managing grid resources in data mining applications (2004) ioport
- Zhang, Nevin L.: Hierarchical latent class models for cluster analysis (2004)
- Sebastiani, Paola; Gussoni, Emanuela; Kohane, Isaac S.; Ramoni, Marco F.: Statistical challenges in functional genomics. (With comments and a rejoinder). (2003)
- Fan, Jianhua; Li, Deyi: An overview of data mining and knowledge discovery (1998)
- Gyllenberg, Mats; Koski, Timo; Verlaan, Martin: Classification of binary vectors by stochastic complexity. (1997)
- Stutz, John; Cheeseman, Peter: Autoclass -- a Bayesian approach to classification (1996)
- Cook, Diane J.; Holder, Lawrence B.; Djoko, Surnjani: Knowledge discovery from structural data. (1995) ioport
- Zhong, Ning; Ohsuga, Setsuo: KOSI --- an integrated system for discovering functional relations from databases. (1995) ioport
- Neal, Radford M.: Connectionist learning of belief networks (1992)
- Fisher, Douglas H.; Chan, Philip K.: Statistical guidance in symbolic learning. (1990) ioport