ERIC - Search Results

Publication Date

In 2024	0
Since 2023	1
Since 2020 (last 5 years)	5
Since 2015 (last 10 years)	9
Since 2005 (last 20 years)	14

Descriptor

Scores	7
Test Items	4
Computer Assisted Testing	3
Item Response Theory	3
Tests	3
Accuracy	2
Correlation	2
Data	2
Difficulty Level	2
Educational Assessment	2
Equated Scores	2
Measurement	2
Models	2
Psychometrics	2
Regression (Statistics)	2
Test Results	2
Test Theory	2
Adaptive Testing	1
Adults	1
Algorithms	1
Basic Skills	1
Classification	1
Comparative Analysis	1
Credentials	1
Data Analysis	1
More ▼

Source

Educational Measurement:…

Author

Sinharay, Sandip	14
Haberman, Shelby	2
Haberman, Shelby J.	2
Puhan, Gautam	2
Bennett, Randy E.	1
Boughton, Keith	1
Deane, Paul	1
Dorans, Neil J.	1
Guo, Hongwen	1
Johnson, Matthew S.	1
Liang, Longjuan	1
Livne, Oren	1
Pan, Yiqin	1
Steinhauer, Eric W.	1
Sweeney, Sandra M.	1
Wollack, James A.	1
Zhang, Mo	1
More ▼

Publication Type

Journal Articles	14
Reports - Descriptive	6
Reports - Evaluative	4
Reports - Research	4
Opinion Papers	1

Education Level

Adult Education	1
High School Equivalency…	1
High Schools	1
Secondary Education	1

Audience

Location

United States

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Item Selection Algorithm Based on Collaborative Filtering for Item Exposure Control

Peer reviewed

Direct link

Pan, Yiqin; Livne, Oren; Wollack, James A.; Sinharay, Sandip – Educational Measurement: Issues and Practice, 2023

In computerized adaptive testing, overexposure of items in the bank is a serious problem and might result in item compromise. We develop an item selection algorithm that utilizes the entire bank well and reduces the overexposure of items. The algorithm is based on collaborative filtering and selects an item in two stages. In the first stage, a set…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Are There Distinctive Profiles in Examinee Essay-Writing Processes?

Peer reviewed

Direct link

Bennett, Randy E.; Zhang, Mo; Sinharay, Sandip; Guo, Hongwen; Deane, Paul – Educational Measurement: Issues and Practice, 2022

Grouping individuals according to a set of measured characteristics, or profiling, is frequently used in describing, understanding, and acting on a phenomenon. The advent of computer-based assessment offers new possibilities for profiling writing because aspects can be captured that were not heretofore observable. We explored whether writing…

Descriptors: Computer Assisted Testing, Adults, High School Equivalency Programs, Tests

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Score Reporting for Examinees with Incomplete Data on Large-Scale Educational Assessments

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2021

Technical difficulties occasionally lead to missing item scores and hence to incomplete data on computerized tests. It is not straightforward to report scores to the examinees whose data are incomplete due to technical difficulties. Such reporting essentially involves imputation of missing scores. In this paper, a simulation study based on data…

Descriptors: Data Analysis, Scores, Educational Assessment, Educational Testing

Digital Module 07: Subscores--Evaluation and Reporting https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2019

Test score users often demand the reporting of subscores due to their potential diagnostic, remedial, and instructional benefits. Therefore, there is substantial pressure on testing programs to report subscores. However, professional standards require that subscores have to satisfy minimum quality standards before they can be reported. In this…

Descriptors: Testing, Scores, Item Response Theory, Evaluation Methods

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

An NCME Instructional Module on Data Mining Methods for Classification and Regression

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2016

Data mining methods for classification and regression are becoming increasingly popular in various scientific fields. However, these methods have not been explored much in educational measurement. This module first provides a review, which should be accessible to a wide audience in education measurement, of some of these methods. The module then…

Descriptors: Data Collection, Information Retrieval, Classification, Regression (Statistics)

Too Simple to Be Useful: A Comment on Feinberg and Wainer (2014)

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby; Boughton, Keith – Educational Measurement: Issues and Practice, 2015

Feinberg and Wainer (2014) provided a simple equation to approximate/predict a subscore's value. The purpose of this note is to point out that their equation is often inaccurate in that it does not always predict a subscore's value correctly. Therefore, the utility of their simple equation is not clear.

Descriptors: Equations (Mathematics), Scores, Prediction, Accuracy

How Often Is the Misfit of Item Response Theory Models Practically Significant?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2014

Standard 3.9 of the Standards for Educational and Psychological Testing ([, 1999]) demands evidence of model fit when item response theory (IRT) models are employed to data from tests. Hambleton and Han ([Hambleton, R. K., 2005]) and Sinharay ([Sinharay, S., 2005]) recommended the assessment of practical significance of misfit of IRT models, but…

Descriptors: Item Response Theory, Goodness of Fit, Models, Tests

A Note on Assessing the Added Value of Subscores

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2014

Brennan (Brennan, R. L., 2012) noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman (Haberman, S. J., 2008) suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. According to this…

Descriptors: Scores, Test Theory, Test Interpretation

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

An NCME Instructional Module on Subscores

Peer reviewed

Direct link

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2011

The purpose of this ITEMS module is to provide an introduction to subscores. First, examples of subscores from an operational test are provided. Then, a review of methods that can be used to examine if subscores have adequate psychometric quality is provided. It is demonstrated, using results from operational and simulated data, that subscores…

Descriptors: Scores, Psychometrics, Tests, Data

Subscores Based on Classical Test Theory: To Report or Not to Report

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby; Puhan, Gautam – Educational Measurement: Issues and Practice, 2007

There is an increasing interest in reporting subscores, both at examinee level and at aggregate levels. However, it is important to ensure reasonable subscore performance in terms of high reliability and validity to minimize incorrect instructional and remediation decisions. This article employs a statistical measure based on classical test theory…

Descriptors: Test Reliability, Test Theory, Test Validity, Statistical Analysis

Privacy | Copyright | Contact Us | Selection Policy | API