NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J.; Singh, Renu; Steinley, Douglas – Psychometrika, 2009
The selection of a subset of variables from a pool of candidates is an important problem in several areas of multivariate statistics. Within the context of principal component analysis (PCA), a number of authors have argued that subset selection is crucial for identifying those variables that are required for correct interpretation of the…
Descriptors: Heuristics, Multivariate Analysis, Mathematics, School Personnel
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J.; Kohn, Hans-Friedrich; Stahl, Stephanie – Psychometrika, 2008
Dynamic programming methods for matrix permutation problems in combinatorial data analysis can produce globally-optimal solutions for matrices up to size 30x30, but are computationally infeasible for larger matrices because of enormous computer memory requirements. Branch-and-bound methods also guarantee globally-optimal solutions, but computation…
Descriptors: Heuristics, Programming, Data Analysis, Matrices
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J.; Steinley, Douglas – Psychometrika, 2007
Perhaps the most common criterion for partitioning a data set is the minimization of the within-cluster sums of squared deviation from cluster centroids. Although optimal solution procedures for within-cluster sums of squares (WCSS) partitioning are computationally feasible for small data sets, heuristic procedures are required for most practical…
Descriptors: Heuristics, Behavioral Sciences, Mathematics, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J.; Kohn, Hans-Friedrich – Psychometrika, 2009
The clique partitioning problem (CPP) requires the establishment of an equivalence relation for the vertices of a graph such that the sum of the edge costs associated with the relation is minimized. The CPP has important applications for the social sciences because it provides a framework for clustering objects measured on a collection of nominal…
Descriptors: Evaluation, Heuristics, Social Sciences, Problem Solving
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J.; Kohn, Hans-Friedrich – Psychometrika, 2009
Several authors have touted the p-median model as a plausible alternative to within-cluster sums of squares (i.e., K-means) partitioning. Purported advantages of the p-median model include the provision of "exemplars" as cluster centers, robustness with respect to outliers, and the accommodation of a diverse range of similarity data. We developed…
Descriptors: Teaching Methods, Experiments, Computational Linguistics, Simulation
Peer reviewed Peer reviewed
Brusco, Michael J.; Cradit, J. Dennis – Psychometrika, 2001
Presents a variable selection heuristic for nonhierarchical (K-means) cluster analysis based on the adjusted Rand index for measuring cluster recovery. Subjected the heuristic to Monte Carlo testing across more than 2,200 datasets. Results indicate that the heuristic is extremely effective at eliminating masking variables. (SLD)
Descriptors: Cluster Analysis, Heuristics, Monte Carlo Methods, Selection
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J.; Kohn, Hans-Friedrich – Psychometrika, 2008
Although the "K"-means algorithm for minimizing the within-cluster sums of squared deviations from cluster centroids is perhaps the most common method for applied cluster analyses, a variety of other criteria are available. The "p"-median model is an especially well-studied clustering problem that requires the selection of "p" objects to serve as…
Descriptors: Telecommunications, Item Response Theory, Multivariate Analysis, Heuristics
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J. – Psychometrika, 2006
Minimization of the within-cluster sums of squares (WCSS) is one of the most important optimization criteria in cluster analysis. Although cluster analysis modules in commercial software packages typically use heuristic methods for this criterion, optimal approaches can be computationally feasible for problems of modest size. This paper presents a…
Descriptors: Multivariate Analysis, Evaluation Criteria, Heuristics, Problem Solving
Peer reviewed Peer reviewed
Brusco, Michael J.; Stahl, Stephanie – Psychometrika, 2001
Describes an interactive procedure for multiobjective asymmetric unidimensional seriation problems that uses a dynamic-programming algorithm to generate partially the efficient set of sequences for small to medium-sized problems and a multioperational heuristic to estimate the efficient set for larger problems. Applies the procedure to an…
Descriptors: Algorithms, Data Analysis, Estimation (Mathematics), Heuristics
Peer reviewed Peer reviewed
Direct linkDirect link
Brusco, Michael J. – Psychological Methods, 2004
A number of important applications require the clustering of binary data sets. Traditional nonhierarchical cluster analysis techniques, such as the popular K-means algorithm, can often be successfully applied to these data sets. However, the presence of masking variables in a data set can impede the ability of the K-means algorithm to recover the…
Descriptors: Mathematics, Multivariate Analysis, Statistical Data, Statistical Analysis