Publication Date
| In 2015 | 0 |
| Since 2014 | 18 |
| Since 2011 (last 5 years) | 65 |
| Since 2006 (last 10 years) | 157 |
| Since 1996 (last 20 years) | 288 |
Descriptor
| Elementary Secondary Education | 176 |
| Educational Assessment | 133 |
| Test Use | 128 |
| Test Construction | 117 |
| Testing Problems | 98 |
| Testing Programs | 80 |
| Scores | 79 |
| Test Validity | 77 |
| Educational Testing | 76 |
| Achievement Tests | 75 |
| More ▼ | |
Source
| Educational Measurement:… | 582 |
Author
| Mehrens, William A. | 12 |
| Plake, Barbara S. | 11 |
| Hills, John R. | 9 |
| Linn, Robert L. | 9 |
| Popham, W. James | 9 |
| Sireci, Stephen G. | 9 |
| Brennan, Robert L. | 8 |
| Cizek, Gregory J. | 8 |
| Frisbie, David A. | 8 |
| Stiggins, Richard J. | 8 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 39 |
| Higher Education | 13 |
| Elementary Education | 9 |
| Postsecondary Education | 9 |
| Secondary Education | 8 |
| Grade 3 | 7 |
| Grade 4 | 7 |
| Grade 5 | 7 |
| High Schools | 7 |
| Grade 6 | 3 |
| More ▼ | |
Audience
| Researchers | 9 |
| Teachers | 6 |
| Practitioners | 3 |
| Counselors | 1 |
Showing 16 to 30 of 582 results
Feinberg, Richard A.; Wainer, Howard – Educational Measurement: Issues and Practice, 2014
Subscores can be of diagnostic value for tests that cover multiple underlying traits. Some items require knowledge or ability that spans more than a single trait. It is thus natural for such items to be included on more than a single subscore. Subscores only have value if they are reliable enough to justify conclusions drawn from them and if they…
Descriptors: Scores, Test Items, Reliability
Buzick, Heather; Stone, Elizabeth – Educational Measurement: Issues and Practice, 2014
Read aloud is a testing accommodation that has been studied by many researchers, and its use on K-12 assessments continues to be debated because of its potential to change the measured construct or unfairly increase test scores. This study is a summary of quantitative research on the read aloud accommodation. Previous studies contributed…
Descriptors: Meta Analysis, Reading Aloud to Others, Educational Research, Statistical Analysis
Plake, Barbara S.; Wise, Lauress L. – Educational Measurement: Issues and Practice, 2014
With the 2014 publication of the 5th revision of the "Standards for Educational and Psychological Testing," the cochairs of the Joint Committee for the revision process were asked to consider the role and importance of the "Standards" for the educational testing community, and in particular for members of the National Council…
Descriptors: Standards, Educational Testing, Psychological Testing, Role
Kingston, Neal M.; Anderson, Gretchen – Educational Measurement: Issues and Practice, 2013
Scores on state standards-based assessments are readily available and may be an appropriate alternative to traditional placement tests for assigning or accepting students into particular courses. Many community colleges do not require test scores for admissions purposes but do require some kind of placement scores for first-year English and math…
Descriptors: Dual Enrollment, Student Placement, High School Students, Scores
Lakin, Joni M.; Young, John W. – Educational Measurement: Issues and Practice, 2013
In recent years, many U.S. states have introduced growth models as part of their educational accountability systems. Although the validity of growth-based accountability models has been evaluated for the general population, the impact of those models for English language learner (ELL) students, a growing segment of the student population, has not…
Descriptors: English Language Learners, Accountability, Educational Policy, Models
Mee, Janet; Clauser, Brian E.; Margolis, Melissa J. – Educational Measurement: Issues and Practice, 2013
Despite being widely used and frequently studied, the Angoff standard setting procedure has received little attention with respect to an integral part of the process: how judges incorporate examinee performance data in the decision-making process. Without performance data, subject matter experts have considerable difficulty accurately making the…
Descriptors: Standard Setting (Scoring), Judges, Data, Decision Making
Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2013
Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…
Descriptors: Educational Assessment, Test Items, Automation, Computer Assisted Testing
Arffman, Inga – Educational Measurement: Issues and Practice, 2013
The article reviews research and findings on problems and issues faced when translating international academic achievement tests. The purpose is to draw attention to the problems, to help to develop the procedures followed when translating the tests, and to provide suggestions for further research. The problems concentrate on the following: the…
Descriptors: Achievement Tests, Translation, Testing Problems, Test Construction
Welsh, Megan E.; D'Agostino, Jerome V.; Kaniskan, Burcu – Educational Measurement: Issues and Practice, 2013
Standards-based progress reports (SBPRs) require teachers to grade students using the performance levels reported by state tests and are an increasingly popular report card format. They may help to increase teacher familiarity with state standards, encourage teachers to exclude nonacademic factors from grades, and/or improve communication with…
Descriptors: Grades (Scholastic), Grading, Report Cards, State Standards
Templin, Jonathan; Hoffman, Lesa – Educational Measurement: Issues and Practice, 2013
Diagnostic classification models (aka cognitive or skills diagnosis models) have shown great promise for evaluating mastery on a multidimensional profile of skills as assessed through examinee responses, but continued development and application of these models has been hindered by a lack of readily available software. In this article we…
Descriptors: Classification, Models, Language Tests, English (Second Language)
Tiffin-Richards, Simon P.; Pant, Hans Anand; Koller, Olaf – Educational Measurement: Issues and Practice, 2013
Cut-scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard-setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item…
Descriptors: Foreign Countries, English (Second Language), Language Tests, Standard Setting (Scoring)
Banks, Kathleen – Educational Measurement: Issues and Practice, 2013
The purpose of this article was to present a synthesis of the peer-reviewed differential bundle functioning (DBF) research that has been conducted to date. A total of 16 studies were synthesized according to the following characteristics: tests used and learner groups, organizing principles used for developing bundles, DBF detection methods used,…
Descriptors: Test Bias, Research, Tests, Student Characteristics
Taherbhai, Husein; Seo, Daeryong – Educational Measurement: Issues and Practice, 2013
Calibration and equating is the quintessential necessity for most large-scale educational assessments. However, there are instances when no consideration is given to the equating process in terms of context and substantive realization, and the methods used in its execution. In the view of the authors, equating is not merely an exhibit of the…
Descriptors: Item Response Theory, Equated Scores, Measurement, Educational Assessment
Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013
We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…
Descriptors: Tests, Test Construction, Test Format, Change
Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013
The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…
Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests

Peer reviewed
Direct link
