Publication Date
| In 2015 | 2 |
| Since 2014 | 7 |
| Since 2011 (last 5 years) | 26 |
| Since 2006 (last 10 years) | 65 |
| Since 1996 (last 20 years) | 154 |
Descriptor
| Reliability | 215 |
| Scores | 87 |
| Validity | 51 |
| Correlation | 42 |
| Factor Analysis | 36 |
| Generalization | 35 |
| Factor Structure | 31 |
| Psychometrics | 31 |
| Meta Analysis | 28 |
| Comparative Analysis | 27 |
| More ▼ | |
Author
| Henson, Robin K. | 6 |
| Thompson, Bruce | 6 |
| Vacha-Haase, Tammi | 6 |
| Zumbo, Bruno D. | 5 |
| Caruso, John C. | 4 |
| Feldt, Leonard S. | 4 |
| Michael, William B. | 4 |
| Attali, Yigal | 3 |
| Barnette, J. Jackson | 3 |
| Fan, Xitao | 3 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 9 |
| High Schools | 5 |
| Postsecondary Education | 5 |
| Secondary Education | 5 |
| Grade 10 | 3 |
| Grade 8 | 3 |
| Middle Schools | 3 |
| Elementary Education | 2 |
| Grade 11 | 2 |
| Grade 6 | 2 |
| More ▼ | |
Audience
| Researchers | 2 |
Showing 1 to 15 of 215 results
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015
A direct approach to point and interval estimation of Cronbach's coefficient alpha for multiple component measuring instruments is outlined. The procedure is based on a latent variable modeling application with widely circulated software. As a by-product, using sample data the method permits ascertaining whether the population discrepancy…
Descriptors: Computation, Statistical Analysis, Reliability, Models
France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015
Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…
Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory
Dowdy, Erin; Nylund-Gibson, Karen; Felix, Erika D.; Morovati, Diane; Carnazzo, Katherine W.; Dever, Bridget V. – Educational and Psychological Measurement, 2014
The practice of screening students to identify behavioral and emotional risk is gaining momentum, with limited guidance regarding the frequency with which screenings should occur. Screening frequency decisions are influenced by the stability of the constructs assessed and changes in risk status over time. This study investigated the 4-year…
Descriptors: Screening Tests, Risk, Behavior Disorders, Emotional Disturbances
Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014
Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…
Descriptors: Observation, Teacher Evaluation, Reliability, Validity
Attali, Yigal – Educational and Psychological Measurement, 2014
This article presents a comparative judgment approach for holistically scored constructed response tasks. In this approach, the grader rank orders (rather than rate) the quality of a small set of responses. A prior automated evaluation of responses guides both set formation and scaling of rankings. Sets are formed to have similar prior scores and…
Descriptors: Responses, Item Response Theory, Scores, Rating Scales
Meyer, J. Patrick; Liu, Xiang; Mashburn, Andrew J. – Educational and Psychological Measurement, 2014
Researchers often use generalizability theory to estimate relative error variance and reliability in teaching observation measures. They also use it to plan future studies and design the best possible measurement procedures. However, designing the best possible measurement procedure comes at a cost, and researchers must stay within their budget…
Descriptors: Reliability, Classroom Observation Techniques, Generalizability Theory, Error of Measurement
Williams, Ryan T.; Swanlund, Andrew; Miller, Shazia; Konstantopoulos, Spyros; Eno, Jared; van der Ploeg, Arie; Meyers, Coby – Educational and Psychological Measurement, 2014
This study operationalizes four measures of instructional differentiation: one for Grade 2 English language arts (ELA), one for Grade 2 mathematics, one for Grade 5 ELA, and one for Grade 5 mathematics. Our study evaluates their measurement properties of each measure in a large field experiment: the Indiana Diagnostic Assessment Tools Study, which…
Descriptors: Individualized Instruction, Grade 2, Grade 5, English Instruction
Casabianca, Jodi M.; McCaffrey, Daniel F.; Gitomer, Drew H.; Bell, Courtney A.; Hamre, Bridget K.; Pianta, Robert C. – Educational and Psychological Measurement, 2013
Classroom observation of teachers is a significant part of educational measurement; measurements of teacher practice are being used in teacher evaluation systems across the country. This research investigated whether observations made live in the classroom and from video recording of the same lessons yielded similar inferences about teaching.…
Descriptors: Secondary School Mathematics, Mathematics Instruction, Classroom Observation Techniques, Algebra
Padilla, Miguel A.; Divers, Jasmin – Educational and Psychological Measurement, 2013
The performance of the normal theory bootstrap (NTB), the percentile bootstrap (PB), and the bias-corrected and accelerated (BCa) bootstrap confidence intervals (CIs) for coefficient omega was assessed through a Monte Carlo simulation under conditions not previously investigated. Of particular interests were nonnormal Likert-type and binary items.…
Descriptors: Sampling, Statistical Inference, Computation, Statistical Analysis
Shear, Benjamin R.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2013
Type I error rates in multiple regression, and hence the chance for false positive research findings, can be drastically inflated when multiple regression models are used to analyze data that contain random measurement error. This article shows the potential for inflated Type I error rates in commonly encountered scenarios and provides new…
Descriptors: Error of Measurement, Multiple Regression Analysis, Data Analysis, Computer Simulation
Davison, Mark L.; Semmes, Robert; Huang, Lan; Close, Catherine N. – Educational and Psychological Measurement, 2012
Data from 181 college students were used to assess whether math reasoning item response times in computerized testing can provide valid and reliable measures of a speed dimension. The alternate forms reliability of the speed dimension was .85. A two-dimensional structural equation model suggests that the speed dimension is related to the accuracy…
Descriptors: Computer Assisted Testing, Reaction Time, Reliability, Validity
Thomas, D. Roland; Zumbo, Bruno D. – Educational and Psychological Measurement, 2012
There is such doubt in research practice about the reliability of difference scores that granting agencies, journal editors, reviewers, and committees of graduate students' theses have been known to deplore their use. This most maligned index can be used in studies of change, growth, or perhaps discrepancy between two measures taken on the same…
Descriptors: Statistical Analysis, Reliability, Scores, Change
Cheng, Ying; Yuan, Ke-Hai; Liu, Cheng – Educational and Psychological Measurement, 2012
Reliability of test scores is one of the most pervasive psychometric concepts in measurement. Reliability coefficients based on a unifactor model for continuous indicators include maximal reliability rho and an unweighted sum score-based omega, among many others. With increasing popularity of item response theory, a parallel reliability measure pi…
Descriptors: Reliability, Factor Analysis, Psychometrics, Item Response Theory
Wakita, Takafumi; Ueshima, Natsumi; Noguchi, Hiroyuki – Educational and Psychological Measurement, 2012
This study examined whether the number of options in the Likert scale influences the psychological distance between categories. The most important assumption when using the Likert scale is that the psychological distance between options is equal. The authors proposed a new algorithm for calculating the scale values of options by applying item…
Descriptors: Likert Scales, Test Items, Personality Measures, Item Response Theory
Padilla, Miguel A.; Veprinsky, Anna – Educational and Psychological Measurement, 2012
Issues with correlation attenuation due to measurement error are well documented. More than a century ago, Spearman proposed a correction for attenuation. However, this correction has seen very little use since it can potentially inflate the true correlation beyond one. In addition, very little confidence interval (CI) research has been done for…
Descriptors: Correlation, Error of Measurement, Sampling, Statistical Inference

Peer reviewed
Direct link
