Publication Date
In 2023 | 4 |
Since 2022 | 17 |
Since 2019 (last 5 years) | 65 |
Since 2014 (last 10 years) | 167 |
Since 2004 (last 20 years) | 393 |
Descriptor
Measurement | 132 |
Models | 107 |
Item Response Theory | 106 |
Psychometrics | 79 |
Measurement Techniques | 73 |
Evaluation Methods | 71 |
Educational Assessment | 56 |
Statistical Analysis | 56 |
Test Validity | 54 |
Scores | 51 |
Validity | 50 |
More ▼ |
Source
Measurement:… | 393 |
Author
Engelhard, George, Jr. | 15 |
Borsboom, Denny | 8 |
Raykov, Tenko | 7 |
Kane, Michael | 6 |
Kane, Michael T. | 6 |
Marcoulides, George A. | 6 |
Markus, Keith A. | 6 |
Wind, Stefanie A. | 6 |
Briggs, Derek C. | 5 |
Hill, Heather C. | 5 |
Wang, Jue | 5 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 1 |
Researchers | 1 |
Location
United States | 10 |
Australia | 4 |
Germany | 4 |
United Kingdom | 4 |
United Kingdom (England) | 4 |
California | 3 |
Canada | 3 |
Asia | 2 |
China | 2 |
Italy | 2 |
Kentucky | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 7 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023
Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…
Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models
Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023
A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…
Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation
Raykov, Tenko; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2023
This article outlines a readily applicable procedure for point and interval estimation of the population discrepancy between reliability and the popular Cronbach's coefficient alpha for unidimensional multi-component measuring instruments with uncorrelated errors, which are widely used in behavioral and social research. The method is developed…
Descriptors: Measurement, Test Reliability, Measurement Techniques, Error of Measurement
Gan, Zhengdong; He, Jinbo; Zhang, Lawrence Jun; Schumacker, Randall – Measurement: Interdisciplinary Research and Perspectives, 2023
While classroom feedback has been shown to be a key mediating factor in students' learning process and performance, the bulk of current research on feedback in the field of foreign language education has largely focused on how teachers respond to students' linguistic errors. Published research on how students in a foreign language context respond…
Descriptors: Learning Motivation, English (Second Language), Second Language Learning, Scaffolding (Teaching Technique)
Cui, Zhongmin – Measurement: Interdisciplinary Research and Perspectives, 2022
Although many educational and psychological tests are labeled as computerized adaptive test (CAT), not all tests show the same level of adaptivity -- some tests might not have much adaptation because of various constraints imposed by test developers. Researchers have proposed some indices to measure the amount of adaption for an adaptive test.…
Descriptors: Adaptive Testing, Computer Assisted Testing, Measurement Techniques
Cole, Ki; Paek, Insu – Measurement: Interdisciplinary Research and Perspectives, 2022
Statistical Analysis Software (SAS) is a widely used tool for data management analysis across a variety of fields. The procedure for item response theory (PROC IRT) is one to perform unidimensional and multidimensional item response theory (IRT) analysis for dichotomous and polytomous data. This review provides a summary of the features of PROC…
Descriptors: Item Response Theory, Computer Software, Item Analysis, Statistical Analysis
Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022
A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…
Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length
Wheeler, Jordan M.; Engelhard, George; Wang, Jue – Measurement: Interdisciplinary Research and Perspectives, 2022
Objectively scoring constructed-response items on educational assessments has long been a challenge due to the use of human raters. Even well-trained raters using a rubric can inaccurately assess essays. Unfolding models measure rater's scoring accuracy by capturing the discrepancy between criterion and operational ratings by placing essays on an…
Descriptors: Accuracy, Scoring, Statistical Analysis, Models
Jin, Kuan-Yu; Eckes, Thomas – Measurement: Interdisciplinary Research and Perspectives, 2022
Recent research on rater effects in performance assessments has increasingly focused on rater centrality, the tendency to assign scores clustering around the rating scale's middle categories. In the present paper, we adopted Jin and Wang's (2018) extended facets modeling approach and constructed a centrality continuum, ranging from raters…
Descriptors: Performance Based Assessment, Evaluators, Scoring, Sample Size
Levy, Roy – Measurement: Interdisciplinary Research and Perspectives, 2022
Obtaining values for latent variables in factor analysis models, also referred to as factor scores, has long been of interest to researchers. However, many treatments of factor analysis do not focus on inference about the latent variables, and even fewer do so from a Bayesian perspective. Researchers may therefore be ill-acquainted with Bayesian…
Descriptors: Factor Analysis, Bayesian Statistics, Inferences, Decision Making
Raykov, Tenko; Doebler, Philipp; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2022
This article is concerned with the large-sample parameter estimator behavior in applications of Bayesian confirmatory factor analysis in behavioral measurement. The property of strong convergence of the popular Bayesian posterior median estimator is discussed, which states numerical convergence with probability 1 of the resulting estimates to the…
Descriptors: Bayesian Statistics, Measurement Techniques, Correlation, Factor Analysis
Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022
In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…
Descriptors: Evaluators, Bias, Identification, Performance Based Assessment
Thompson, James J. – Measurement: Interdisciplinary Research and Perspectives, 2022
With the use of computerized testing, ordinary assessments can capture both answer accuracy and answer response time. For the Canadian Programme for the International Assessment of Adult Competencies (PIAAC) numeracy and literacy subtests, person ability, person speed, question difficulty, question time intensity, fluency (rate), person fluency…
Descriptors: Foreign Countries, Adults, Computer Assisted Testing, Network Analysis
Leventhal, Brian C.; Gregg, Nikole; Ames, Allison J. – Measurement: Interdisciplinary Research and Perspectives, 2022
Response styles introduce construct-irrelevant variance as a result of respondents systematically responding to Likert-type items regardless of content. Methods to account for response styles through data analysis as well as approaches to mitigating the effects of response styles during data collection have been well-documented. Recent approaches…
Descriptors: Response Style (Tests), Item Response Theory, Test Items, Likert Scales
Roduta Roberts, Mary; Gotch, Chad M.; Cook, Megan; Werther, Karin; Chao, Iris C. I. – Measurement: Interdisciplinary Research and Perspectives, 2022
Performance-based assessment is a common approach to assess the development and acquisition of practice competencies among health professions students. Judgments related to the quality of performance are typically operationalized as ratings against success criteria specified within a rubric. The extent to which the rubric is understood,…
Descriptors: Protocol Analysis, Scoring Rubrics, Interviews, Performance Based Assessment