Publication Date
| In 2015 | 0 |
| Since 2014 | 2 |
| Since 2011 (last 5 years) | 5 |
| Since 2006 (last 10 years) | 17 |
| Since 1996 (last 20 years) | 29 |
Descriptor
| Psychometrics | 45 |
| Test Items | 18 |
| Test Construction | 12 |
| Item Response Theory | 10 |
| Computer Assisted Testing | 9 |
| Difficulty Level | 6 |
| Educational Assessment | 6 |
| Models | 6 |
| Standardized Tests | 6 |
| Comparative Analysis | 5 |
| More ▼ | |
Source
| Applied Measurement in… | 45 |
Author
| Angoff, William H. | 2 |
| Beck, Michael D. | 2 |
| Brennan, Robert L. | 2 |
| Hambleton, Ronald K. | 2 |
| Moshinsky, Avital | 2 |
| Puhan, Gautam | 2 |
| Alves, Cecilia B. | 1 |
| Anastasi, Anne | 1 |
| Antal, Judit | 1 |
| Bahry, Louise M. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 45 |
| Reports - Evaluative | 19 |
| Reports - Research | 19 |
| Speeches/Meeting Papers | 5 |
| Information Analyses | 4 |
| Reports - Descriptive | 4 |
| Opinion Papers | 3 |
| Collected Works - General | 1 |
Education Level
| Elementary Education | 2 |
| Grade 10 | 2 |
| High Schools | 2 |
| Higher Education | 2 |
| Elementary Secondary Education | 1 |
| Grade 3 | 1 |
| Grade 5 | 1 |
| Grade 7 | 1 |
| Primary Education | 1 |
| Secondary Education | 1 |
| More ▼ | |
Audience
| Researchers | 1 |
Showing 1 to 15 of 45 results
Antal, Judit; Proctor, Thomas P.; Melican, Gerald J. – Applied Measurement in Education, 2014
In common-item equating the anchor block is generally built to represent a miniature form of the total test in terms of content and statistical specifications. The statistical properties frequently reflect equal mean and spread of item difficulty. Sinharay and Holland (2007) suggested that the requirement for equal spread of difficulty may be too…
Descriptors: Test Items, Equated Scores, Difficulty Level, Item Response Theory
Roduta Roberts, Mary; Alves, Cecilia B.; Chu, Man-Wai; Thompson, Margaret; Bahry, Louise M.; Gotzmann, Andrea – Applied Measurement in Education, 2014
The purpose of this study was to evaluate the adequacy of three cognitive models, one developed by content experts and two generated from student verbal reports for explaining examinee performance on a grade 3 diagnostic mathematics test. For this study, the items were developed to directly measure the attributes in the cognitive model. The…
Descriptors: Foreign Countries, Mathematics Tests, Cognitive Processes, Models
Boyd, Aimee M.; Dodd, Barbara; Fitzpatrick, Steven – Applied Measurement in Education, 2013
This study compared several exposure control procedures for CAT systems based on the three-parameter logistic testlet response theory model (Wang, Bradlow, & Wainer, 2002) and Masters' (1982) partial credit model when applied to a pool consisting entirely of testlets. The exposure control procedures studied were the modified within 0.10 logits…
Descriptors: Computer Assisted Testing, Item Response Theory, Test Construction, Models
Edwards, Michael C.; Flora, David B.; Thissen, David – Applied Measurement in Education, 2012
This article describes a computerized adaptive test (CAT) based on the uniform item exposure multi-form structure (uMFS). The uMFS is a specialization of the multi-form structure (MFS) idea described by Armstrong, Jones, Berliner, and Pashley (1998). In an MFS CAT, the examinee first responds to a small fixed block of items. The items comprising…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Test Items
Kahraman, Nilufer; De Champlain, Andre; Raymond, Mark – Applied Measurement in Education, 2012
Item-level information, such as difficulty and discrimination are invaluable to the test assembly, equating, and scoring practices. Estimating these parameters within the context of large-scale performance assessments is often hindered by the use of unbalanced designs for assigning examinees to tasks and raters because such designs result in very…
Descriptors: Performance Based Assessment, Medicine, Factor Analysis, Test Items
Randall, Jennifer; Engelhard, George, Jr. – Applied Measurement in Education, 2010
The psychometric properties and multigroup measurement invariance of scores across subgroups, items, and persons on the "Reading for Meaning" items from the Georgia Criterion Referenced Competency Test (CRCT) were assessed in a sample of 778 seventh-grade students. Specifically, we sought to determine the extent to which score-based inferences on…
Descriptors: Testing Accommodations, Test Items, Learning Disabilities, Factor Analysis
Brennan, Robert L. – Applied Measurement in Education, 2010
This paper provides an overview of evidence-centered assessment design (ECD) and some general information about of the Advanced Placement (AP[R]) Program. Then the papers in this special issue are discussed, as they relate to the use of ECD in the revision of various AP tests. This paper concludes with some observations about the need to validate…
Descriptors: Advanced Placement Programs, Equivalency Tests, Evidence, Test Construction
Hein, Serge F.; Skaggs, Gary E. – Applied Measurement in Education, 2009
Only a small number of qualitative studies have investigated panelists' experiences during standard-setting activities or the thought processes associated with panelists' actions. This qualitative study involved an examination of the experiences of 11 panelists who participated in a prior, one-day standard-setting meeting in which either the…
Descriptors: Focus Groups, Standard Setting, Cutting Scores, Cognitive Processes
Leighton, Jacqueline P.; Cui, Ying; Cor, M. Ken – Applied Measurement in Education, 2009
The objective of the present investigation was to compare the adequacy of two cognitive models for predicting examinee performance on a sample of algebra I and II items from the March 2005 administration of the SAT[TM]. The two models included one generated from verbal reports provided by 21 examinees as they solved the SAT[TM] items, and the…
Descriptors: Test Items, Inferences, Cognitive Ability, Prediction
Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009
In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…
Descriptors: Test Items, Test Content, Testing Programs, Simulation
Puhan, Gautam – Applied Measurement in Education, 2009
The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…
Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory
Briggs, Derek C. – Applied Measurement in Education, 2008
This article illustrates the use of an explanatory item response modeling (EIRM) approach in the context of measuring group differences in science achievement. The distinction between item response models and EIRMs, recently elaborated by De Boeck and Wilson (2004), is presented within the statistical framework of generalized linear mixed models.…
Descriptors: Science Achievement, Science Tests, Measurement, Error of Measurement
Beck, Michael D. – Applied Measurement in Education, 2007
This article addresses the set of articles in this special issue of "Applied Measurement in Education" and reflects on the issues that underlie the articles. The authors, as a set, represent many of the professionals who have developed and studied the methodological procedures related to instruction-assessment alignment over the past decade. This…
Descriptors: Psychometrics, Research Methodology, Review (Reexamination), Measurement
Elliott, Stephen N.; Roach, Andrew T. – Applied Measurement in Education, 2007
This article examines three typical approaches to alternate assessment for students with significant cognitive disabilities--portfolios, performance assessments, and rating scales. A detailed analysis of common and unique design features of these approaches is provided, including features of each approach that influence the psychometric quality of…
Descriptors: Psychometrics, Validity, Rating Scales, Alternative Assessment
Beck, Michael D. – Applied Measurement in Education, 2007
This article addresses the set of articles in this special issue of "Applied Measurement in Education" and reflects on the issues that underlie the articles. The authors, as a set, represent many of the professionals who have developed and studied the methodological procedures related to instruction-assessment alignment over the past decade.…
Descriptors: Educational Research, Psychometrics, Journal Articles, Review (Reexamination)

Peer reviewed
Direct link
