ERIC Number: EJ789912
Record Type: Journal
Publication Date: 2008-Mar
Reference Count: 0
Selection of Variables in Cluster Analysis: An Empirical Comparison of Eight Procedures
Steinley, Douglas; Brusco, Michael J.
Psychometrika, v73 n1 p125-144 Mar 2008
Eight different variable selection techniques for model-based and non-model-based clustering are evaluated across a wide range of cluster structures. It is shown that several methods have difficulties when non-informative variables (i.e., random noise) are included in the model. Furthermore, the distribution of the random noise greatly impacts the performance of nearly all of the variable selection procedures. Overall, a variable selection technique based on a variance-to-range weighting procedure coupled with the largest decreases in within-cluster sums of squares error performed the best. On the other hand, variable selection methods used in conjunction with finite mixture models performed the worst.
Springer. 233 Spring Street, New York, NY 10013. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-348-4505; e-mail: email@example.com; Web site: http://www.springerlink.com
Publication Type: Journal Articles; Reports - Evaluative
Education Level: N/A
Authoring Institution: N/A