Publication Date

In 2022 | 0 |

Since 2021 | 0 |

Since 2018 (last 5 years) | 0 |

Since 2013 (last 10 years) | 1 |

Since 2003 (last 20 years) | 4 |

Descriptor

Source

Educational and Psychological… | 20 |

Author

Wilcox, Rand R. | 20 |

Serang, Sarfaraz | 1 |

Publication Type

Journal Articles | 20 |

Reports - Research | 14 |

Reports - Evaluative | 5 |

Reports - Descriptive | 2 |

Numerical/Quantitative Data | 1 |

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Wilcox, Rand R.; Serang, Sarfaraz – Educational and Psychological Measurement, 2017

The article provides perspectives on p values, null hypothesis testing, and alternative techniques in light of modern robust statistical methods. Null hypothesis testing and "p" values can provide useful information provided they are interpreted in a sound manner, which includes taking into account insights and advances that have…

Descriptors: Hypothesis Testing, Bayesian Statistics, Computation, Effect Size

Wilcox, Rand R. – Educational and Psychological Measurement, 2006

Consider the nonparametric regression model Y = m(X)+ [tau](X)[epsilon], where X and [epsilon] are independent random variables, [epsilon] has a median of zero and variance [sigma][squared], [tau] is some unknown function used to model heteroscedasticity, and m(X) is an unknown function reflecting some conditional measure of location associated…

Descriptors: Nonparametric Statistics, Mathematical Models, Regression (Statistics), Probability

Wilcox, Rand R. – Educational and Psychological Measurement, 2006

For two random variables, X and Y, let D = X - Y, and let theta[subscript x], theta[subscript y], and theta[subscript d] be the corresponding medians. It is known that the Wilcoxon-Mann-Whitney test and its modern extensions do not test H[subscript o] : theta[subscript x] = theta[subscript y], but rather, they test H[subscript o] : theta[subscript…

Descriptors: Scores, Inferences, Comparative Analysis, Statistical Analysis

Wilcox, Rand R. – Educational and Psychological Measurement, 2005

It is known that nonnormality, a heteroscedastic error term, or a nonlinear association can create serious practical problems when using the conventional analysis of covariance (ANCOVA) method. This article describes a simple ANCOVA method that allows heteroscedasticity, nonnormality, nonlinearity, and multiple covariates. When standard…

Descriptors: Statistical Analysis, Error of Measurement, Measurement Techniques

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1983

When comparing k normal populations an investigator might want to know the probability that the population with the largest population mean will have the largest sample mean. This paper describes and illustrates methods of approximating this probability when the variances are unknown and possibly unequal. (Author/BW)

Descriptors: Data Analysis, Hypothesis Testing, Mathematical Formulas, Probability

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1982

Results in the engineering literature on "k out of n system reliability" can be used to characterize tests based on estimates of the probability of correctly determining whether the examinee knows the correct response. In particular, the minimum number of distractors required for multiple-choice tests can be empirically determined.…

Descriptors: Achievement Tests, Mathematical Models, Multiple Choice Tests, Test Format

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1983

This article provides unbiased estimates of the proportion of items in an item domain that an examinee would answer correctly if every item were attempted, when a closed sequential testing procedure is used. (Author)

Descriptors: Estimation (Mathematics), Psychometrics, Scores, Sequential Approach

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1982

When determining criterion-referenced test length, problems of guessing are shown to be more serious than expected. A new method of scoring is presented that corrects for guessing without assuming that guessing is random. Empirical investigations of the procedure are examined. Test length can be substantially reduced. (Author/CM)

Descriptors: Criterion Referenced Tests, Guessing (Tests), Multiple Choice Tests, Scoring

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

A formal framework is presented for determining which of the distractors of multiple-choice test items has a small probability of being chosen by a typical examinee. The framework is based on a procedure similar to an indifference zone formulation of a ranking and election problem. (Author/BW)

Descriptors: Mathematical Models, Multiple Choice Tests, Probability, Test Items

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1997

Some results on how the Alexander-Govern heteroscedastic analysis of variance (ANOVA) procedure (R. Alexander and D. Govern, 1994) performs under nonnormality are presented. This method can provide poor control of Type I errors in some cases, and in some situations power decreases as differences among the means get large. (SLD)

Descriptors: Analysis of Variance, Error of Measurement, Power (Statistics), Statistical Distributions

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)

Descriptors: Criterion Referenced Tests, Scoring, Test Reliability

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1980

Using three sets of real data, a comparison of four discrete discriminate analysis procedures is made using the actual versus the optimal error rate. The kernel method gives the most accurate results in all three cases. (Author/RL)

Descriptors: Achievement Tests, Comparative Analysis, Discriminant Analysis, Error of Measurement

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

The paper considers the problem of selecting the t best of k normal populations and simultaneously determining whether the selected populations have a mean larger than a known standard. Illustrations are given for selecting the t best of k examinees when the binomial error model applies. (Author)

Descriptors: Competitive Selection, Criterion Referenced Tests, Decision Making, Mathematical Models

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

In many situations in education and psychology it is desired to select from k binomial populations the one having the largest probability of success. This paper describes a two-stage procedure for accomplishing this goal. (Author/CTM)

Descriptors: Probability, Sampling, Statistical Analysis

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

The classical estimate of a binomial probability function is to estimate its mean in the usual manner and to substitute the results in the appropriate expression. Two alternative estimation procedures are described and examined. Emphasis is given to the single administration estimate of the mastery test reliability. (Author/CTM)

Descriptors: Cutting Scores, Mastery Tests, Probability, Scores

Previous Page | Next Page ยป

Pages: 1 | **2**