Publication Date
| In 2015 | 0 |
| Since 2014 | 1 |
| Since 2011 (last 5 years) | 2 |
| Since 2006 (last 10 years) | 11 |
| Since 1996 (last 20 years) | 21 |
Descriptor
| Test Validity | 77 |
| Testing Problems | 27 |
| Test Use | 25 |
| Test Construction | 23 |
| Elementary Secondary Education | 22 |
| Achievement Tests | 14 |
| Standardized Tests | 14 |
| Test Interpretation | 14 |
| Educational Assessment | 13 |
| Standards | 12 |
| More ▼ | |
Source
| Educational Measurement:… | 77 |
Author
| Linn, Robert L. | 4 |
| Mehrens, William A. | 4 |
| Frisbie, David A. | 3 |
| Cizek, Gregory J. | 2 |
| Madaus, George F. | 2 |
| Popham, W. James | 2 |
| Rudner, Lawrence M. | 2 |
| Shepard, Lorrie A. | 2 |
| Sireci, Stephen G. | 2 |
| Bejar, Issac I. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 77 |
| Opinion Papers | 31 |
| Reports - Evaluative | 23 |
| Reports - Descriptive | 16 |
| Reports - Research | 12 |
| Information Analyses | 10 |
| Speeches/Meeting Papers | 5 |
| Guides - Non-Classroom | 1 |
Education Level
| Elementary Secondary Education | 2 |
| Higher Education | 1 |
Audience
| Researchers | 1 |
Showing 1 to 15 of 77 results
Buzick, Heather; Stone, Elizabeth – Educational Measurement: Issues and Practice, 2014
Read aloud is a testing accommodation that has been studied by many researchers, and its use on K-12 assessments continues to be debated because of its potential to change the measured construct or unfairly increase test scores. This study is a summary of quantitative research on the read aloud accommodation. Previous studies contributed…
Descriptors: Meta Analysis, Reading Aloud to Others, Educational Research, Statistical Analysis
Bejar, Issac I. – Educational Measurement: Issues and Practice, 2012
The scoring process is critical in the validation of tests that rely on constructed responses. Documenting that readers carry out the scoring in ways consistent with the construct and measurement goals is an important aspect of score validity. In this article, rater cognition is approached as a source of support for a validity argument for scores…
Descriptors: Scores, Inferences, Validity, Scoring
Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009
This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…
Descriptors: Tests, Test Validity, Scores, Data Collection
Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008
Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…
Descriptors: Test Items, Disabilities, Test Construction, Testing Programs
Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007
The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…
Descriptors: Inferences, Disabilities, Rating Scales, Eligibility
Lu, Ying; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2007
Speededness refers to the situation where the time limits on a standardized test do not allow substantial numbers of examinees to fully consider all test items. When tests are not intended to measure speed of responding, speededness introduces a severe threat to the validity of interpretations based on test scores. In this article, we describe…
Descriptors: Test Items, Timed Tests, Standardized Tests, Test Validity
Sinharay, Sandip; Haberman, Shelby; Puhan, Gautam – Educational Measurement: Issues and Practice, 2007
There is an increasing interest in reporting subscores, both at examinee level and at aggregate levels. However, it is important to ensure reasonable subscore performance in terms of high reliability and validity to minimize incorrect instructional and remediation decisions. This article employs a statistical measure based on classical test theory…
Descriptors: Test Reliability, Test Theory, Test Validity, Statistical Analysis
Cizek, Gregory J.; Crocker, Linda; Frisbie, David A.; Mehrens, William A.; Stiggins, Richard J. – Educational Measurement: Issues and Practice, 2006
The authors describe the significant contributions of Robert Ebel to educational measurement theory and its applications. A biographical sketch details Ebel's roots and professional resume. His influence on classroom assessment views and procedures are explored. Classic publications associated with validity, reliability, and score interpretation…
Descriptors: Test Theory, Educational Assessment, Psychometrics, Test Reliability
Gorin, Joanna S. – Educational Measurement: Issues and Practice, 2006
One of the primary themes of the National Research Council's 2001 book "Knowing What Students Know" was the importance of cognition as a component of assessment design and measurement theory (NRC, 2001). One reaction to the book has been an increased use of sophisticated statistical methods to model cognitive information available in test data.…
Descriptors: Test Construction, Student Evaluation, Academic Ability, Evaluation Methods
Sireci, Stephen G.; Parker, Polly – Educational Measurement: Issues and Practice, 2006
The psychometric literature is replete with comprehensive discussions of test validity, test validation, and the characteristics of quality assessment programs. The most authoritative source for guidance regarding sound test development and evaluation practices is the Standards for Educational and Psychological Testing. However, the Standards are…
Descriptors: Psychometrics, Test Validity, Educational Testing, Psychological Testing
Wise, Steven L.; Bhola, Dennison S.; Yang, Sheng-Ta – Educational Measurement: Issues and Practice, 2006
The attractiveness of computer-based tests (CBTs) is due largely to their capability to expand the ways we conduct testing. A relatively unexplored application, however, is actively using the computer to reduce construct-irrelevant variance while a test is being administered. This investigation introduces the effort-monitoring CBT, in which the…
Descriptors: Computer Assisted Testing, Test Validity, Reaction Time, Guessing (Tests)
Chester, Mitchell D. – Educational Measurement: Issues and Practice, 2005
This study explores the use of multiple measures to enhance the validity and reliability of inferences about school and district effectiveness. Using data from the state of Ohio, a framework for combining measures is applied to examine the individual and collective impact of multiple measures on both the federal AYP designations and state ratings.…
Descriptors: Inferences, Educational Improvement, School Effectiveness, Accountability
Wang, Ning; Schnipke, Deborah; Witt, Elizabeth A. – Educational Measurement: Issues and Practice, 2005
The task inventory approach is commonly used in job analysis for establishing content validity evidence supporting the use and interpretation of licensure and certification examinations. Although the results of a task inventory survey provide job task-related information that can be used as a reliable and valid source for test development, it is…
Descriptors: Nursing, Test Construction, Job Skills, Knowledge Level
Haladyna, Thomas M.; Downing, Steven M. – Educational Measurement: Issues and Practice, 2004
There are many threats to validity in high-stakes achievement testing. One major threat is construct-irrelevant variance (CIV). This article defines CIV in the context of the contemporary, unitary view of validity and presents logical arguments, hypotheses, and documentation for a variety of CIV sources that commonly threaten interpretations of…
Descriptors: Student Evaluation, Evaluation Methods, High Stakes Tests, Construct Validity
Zwick, Rebecca; Schlemer, Lizabeth – Educational Measurement: Issues and Practice, 2004
The validity of the SAT as an admissions criterion for Latinos and Asian Americans who are not native English speakers was examined. The analyses, based on 1997 and 1998 UCSB freshmen, focused on the effectiveness of SAT scores and high school grade-point average (HSGPA) in predicting college freshman grade-point average (FGPA). When regression…
Descriptors: Test Validity, Language Minorities, Asian American Students, Hispanic American Students

Peer reviewed
Direct link
