ERIC - Search Results

Publication Date

In 2024	0
Since 2023	1
Since 2020 (last 5 years)	4
Since 2015 (last 10 years)	25
Since 2005 (last 20 years)	41

Descriptor

Hypothesis Testing	221
Statistical Significance	77
Analysis of Variance	62
Statistical Analysis	60
Correlation	34
Mathematical Models	32
Research Design	31
Computer Programs	30
Comparative Analysis	28
Probability	27
Nonparametric Statistics	24
Research Methodology	22
Factor Analysis	21
Data Analysis	20
Sampling	17
Power (Statistics)	16
Effect Size	14
Statistical Inference	14
Monte Carlo Methods	13
Analysis of Covariance	11
Mathematical Formulas	11
Predictor Variables	11
Computation	10
Matrices	10
Measurement Techniques	10
More ▼

Source

Educational and Psychological…

221

Publication Type

Journal Articles	136
Reports - Research	90
Reports - Evaluative	26
Reports - Descriptive	13
Opinion Papers	6
Guides - Non-Classroom	3
Numerical/Quantitative Data	3
Book/Product Reviews	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Higher Education	2
Grade 4	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Florida	1
Guyana	1
Israel	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

Bem Sex Role Inventory	1
Florida Comprehensive…	1
National Teacher Examinations	1
Rod and Frame Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 221 results Save | Export

A Note on Statistical Hypothesis Testing: Probabilifying "Modus Tollens" Invalidates Its Force? Not True!

Peer reviewed

Direct link

Widaman, Keith F. – Educational and Psychological Measurement, 2023

The import or force of the result of a statistical test has long been portrayed as consistent with deductive reasoning. The simplest form of deductive argument has a first premise with conditional form, such as p[right arrow]q, which means that "if p is true, then q must be true." Given the first premise, one can either affirm or deny…

Descriptors: Hypothesis Testing, Statistical Analysis, Logical Thinking, Probability

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error

Peer reviewed

Direct link

Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…

Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation

Use of the Lagrange Multiplier Test for Assessing Measurement Invariance under Model Misspecification

Peer reviewed

Direct link

Guastadisegni, Lucia; Cagnone, Silvia; Moustaki, Irini; Vasdekis, Vassilis – Educational and Psychological Measurement, 2022

This article studies the Type I error, false positive rates, and power of four versions of the Lagrange multiplier test to detect measurement noninvariance in item response theory (IRT) models for binary data under model misspecification. The tests considered are the Lagrange multiplier test computed with the Hessian and cross-product approach,…

Descriptors: Measurement, Statistical Analysis, Item Response Theory, Test Items

Non-Iterative Conditional Pairwise Estimation for the Rating Scale Model

Peer reviewed

Direct link

Elliott, Mark; Buttery, Paula – Educational and Psychological Measurement, 2022

We investigate two non-iterative estimation procedures for Rasch models, the pair-wise estimation procedure (PAIR) and the Eigenvector method (EVM), and identify theoretical issues with EVM for rating scale model (RSM) threshold estimation. We develop a new procedure to resolve these issues--the conditional pairwise adjacent thresholds procedure…

Descriptors: Item Response Theory, Rating Scales, Computation, Simulation

Investigating Measurement Invariance by Means of Parameter Instability Tests for 2PL and 3PL Models

Peer reviewed

Direct link

Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2019

M-fluctuation tests are a recently proposed method for detecting differential item functioning in Rasch models. This article discusses a generalization of this method to two additional item response theory models: the two-parametric logistic model and the three-parametric logistic model with a common guessing parameter. The Type I error rate and…

Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Maximum Likelihood Statistics

Proportion of Indicator Common Variance Due to a Factor as an Effect Size Statistic in Revised Parallel Analysis

Peer reviewed

Direct link

Xia, Yan; Green, Samuel B.; Xu, Yuning; Thompson, Marilyn S. – Educational and Psychological Measurement, 2019

Past research suggests revised parallel analysis (R-PA) tends to yield relatively accurate results in determining the number of factors in exploratory factor analysis. R-PA can be interpreted as a series of hypothesis tests. At each step in the series, a null hypothesis is tested that an additional factor accounts for zero common variance among…

Descriptors: Effect Size, Factor Analysis, Hypothesis Testing, Psychometrics

What Constitutes Science and Scientific Evidence: Roles of Null Hypothesis Testing

Peer reviewed

Direct link

Chang, Mark – Educational and Psychological Measurement, 2017

We briefly discuss the philosophical basis of science, causality, and scientific evidence, by introducing the hidden but most fundamental principle of science: the similarity principle. The principle's use in scientific discovery is illustrated with Simpson's paradox and other examples. In discussing the value of null hypothesis statistical…

Descriptors: Hypothesis Testing, Evidence, Sciences, Scientific Principles

On Some Assumptions of the Null Hypothesis Statistical Testing

Peer reviewed

Direct link

Patriota, Alexandre Galvão – Educational and Psychological Measurement, 2017

Bayesian and classical statistical approaches are based on different types of logical principles. In order to avoid mistaken inferences and misguided interpretations, the practitioner must respect the inference rules embedded into each statistical method. Ignoring these principles leads to the paradoxical conclusions that the hypothesis…

Descriptors: Hypothesis Testing, Bayesian Statistics, Statistical Inference, Statistical Analysis

Tests of Statistical Significance Made Sound

Peer reviewed

Direct link

Haig, Brian D. – Educational and Psychological Measurement, 2017

This article considers the nature and place of tests of statistical significance (ToSS) in science, with particular reference to psychology. Despite the enormous amount of attention given to this topic, psychology's understanding of ToSS remains deficient. The major problem stems from a widespread and uncritical acceptance of null hypothesis…

Descriptors: Statistical Significance, Statistical Analysis, Hypothesis Testing, Psychology

The Need for Nuance in the Null Hypothesis Significance Testing Debate

Peer reviewed

Direct link

Häggström, Olle – Educational and Psychological Measurement, 2017

Null hypothesis significance testing (NHST) provides an important statistical toolbox, but there are a number of ways in which it is often abused and misinterpreted, with bad consequences for the reliability and progress of science. Parts of contemporary NHST debate, especially in the psychological sciences, is reviewed, and a suggestion is made…

Descriptors: Hypothesis Testing, Statistical Analysis, Psychological Studies, Taxonomy

Hypothesis Testing, "p" Values, Confidence Intervals, Measures of Effect Size, and Bayesian Methods in Light of Modern Robust Techniques

Peer reviewed

Direct link

Wilcox, Rand R.; Serang, Sarfaraz – Educational and Psychological Measurement, 2017

The article provides perspectives on p values, null hypothesis testing, and alternative techniques in light of modern robust statistical methods. Null hypothesis testing and "p" values can provide useful information provided they are interpreted in a sound manner, which includes taking into account insights and advances that have…

Descriptors: Hypothesis Testing, Bayesian Statistics, Computation, Effect Size

Performing Contrast Analysis in Factorial Designs: From NHST to Confidence Intervals and Beyond

Peer reviewed

Direct link

Wiens, Stefan; Nilsson, Mats E. – Educational and Psychological Measurement, 2017

Because of the continuing debates about statistics, many researchers may feel confused about how to analyze and interpret data. Current guidelines in psychology advocate the use of effect sizes and confidence intervals (CIs). However, researchers may be unsure about how to extract effect sizes from factorial designs. Contrast analysis is helpful…

Descriptors: Data Analysis, Effect Size, Computation, Statistical Analysis

Hypothesis Testing in the Real World

Peer reviewed

Direct link

Miller, Jeff – Educational and Psychological Measurement, 2017

Critics of null hypothesis significance testing suggest that (a) its basic logic is invalid and (b) it addresses a question that is of no interest. In contrast to (a), I argue that the underlying logic of hypothesis testing is actually extremely straightforward and compelling. To substantiate that, I present examples showing that hypothesis…

Descriptors: Hypothesis Testing, Testing Problems, Test Validity, Relevance (Education)

Thou Shalt Not Bear False Witness against Null Hypothesis Significance Testing

Peer reviewed

Direct link

García-Pérez, Miguel A. – Educational and Psychological Measurement, 2017

Null hypothesis significance testing (NHST) has been the subject of debate for decades and alternative approaches to data analysis have been proposed. This article addresses this debate from the perspective of scientific inquiry and inference. Inference is an inverse problem and application of statistical methods cannot reveal whether effects…

Descriptors: Hypothesis Testing, Statistical Inference, Effect Size, Bayesian Statistics

Psychometric Consequences of Subpopulation Item Parameter Drift

Peer reviewed

Direct link

Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

Privacy | Copyright | Contact Us | Selection Policy | API

Shine, Lester C., II	8
Levy, Kenneth J.	5
Wilcox, Rand R.	5
Keselman, H. J.	4
Kingma, Johannes	4
Green, Samuel B.	3
Halperin, Silas	3
Lutz, J. Gary	3
Malgady, Robert G.	3
Marcoulides, George A.	3
Raykov, Tenko	3
Roberge, James J.	3
Thompson, Marilyn S.	3
Van Den Bos, Kees P.	3
Algina, James	2
Armenakis, Achilles A.	2
Berry, Kenneth J.	2
Blair, R. Clifford	2
Borich, Gary D.	2
Cooper, Martin	2
Edgington, Eugene S.	2
Feldt, Leonard S.	2
Fordyce, Michael W.	2
Fowler, Robert L.	2
More ▼