Publication Date
| In 2015 | 0 |
| Since 2014 | 2 |
| Since 2011 (last 5 years) | 5 |
| Since 2006 (last 10 years) | 8 |
| Since 1996 (last 20 years) | 8 |
Descriptor
| Equated Scores | 7 |
| Testing | 5 |
| Accuracy | 3 |
| Correlation | 3 |
| Sample Size | 3 |
| Comparative Analysis | 2 |
| Error of Measurement | 2 |
| Scoring | 2 |
| Simulation | 2 |
| Statistical Bias | 2 |
| More ▼ | |
Source
| Journal of Educational… | 8 |
Author
| Puhan, Gautam | 8 |
| Moses, Timothy P. | 2 |
| Dorans, Neil J. | 1 |
| Grant, Mary C. | 1 |
| Guo, Hongwen | 1 |
| McHale, Frederick | 1 |
| Yu, Lei | 1 |
| Zu, Jiyun | 1 |
Publication Type
| Journal Articles | 8 |
| Reports - Evaluative | 6 |
| Reports - Descriptive | 1 |
| Reports - Research | 1 |
Education Level
Audience
Showing all 8 results
Zu, Jiyun; Puhan, Gautam – Journal of Educational Measurement, 2014
Preequating is in demand because it reduces score reporting time. In this article, we evaluated an observed-score preequating method: the empirical item characteristic curve (EICC) method, which makes preequating without item response theory (IRT) possible. EICC preequating results were compared with a criterion equating and with IRT true-score…
Descriptors: Item Response Theory, Equated Scores, Item Analysis, Item Sampling
Guo, Hongwen; Puhan, Gautam – Journal of Educational Measurement, 2014
In this article, we introduce a section preequating (SPE) method (linear and nonlinear) under the randomly equivalent groups design. In this equating design, sections of Test X (a future new form) and another existing Test Y (an old form already on scale) are administered. The sections of Test X are equated to Test Y, after adjusting for the…
Descriptors: Equated Scores, Correlation, Simulation, Testing
Puhan, Gautam – Journal of Educational Measurement, 2013
When a constructed-response test form is reused, raw scores from the two administrations of the form may not be comparable. The solution to this problem requires a rescoring, at the current administration, of examinee responses from the previous administration. The scores from this "rescoring" can be used as an anchor for equating. In…
Descriptors: Scoring, Equated Scores, Testing, Correlation
Puhan, Gautam – Journal of Educational Measurement, 2012
Tucker and chained linear equatings were evaluated in two testing scenarios. In Scenario 1, referred to as rater comparability scoring and equating, the anchor-to-total correlation is often very high for the new form but moderate for the reference form. This may adversely affect the results of Tucker equating, especially if the new and reference…
Descriptors: Testing, Scoring, Equated Scores, Statistical Analysis
Puhan, Gautam – Journal of Educational Measurement, 2011
The impact of log-linear presmoothing on the accuracy of small sample chained equipercentile equating was evaluated under two conditions. In the first condition the small samples differed randomly in ability from the target population. In the second condition the small samples were systematically different from the target population. Results…
Descriptors: Equated Scores, Data Analysis, Sample Size, Accuracy
Puhan, Gautam – Journal of Educational Measurement, 2010
In this study I compared results of chained linear, Tucker, and Levine-observed score equatings under conditions where the new and old forms samples were similar in ability and also when they were different in ability. The length of the anchor test was also varied to examine its effect on the three different equating methods. The three equating…
Descriptors: Testing, Equated Scores, Comparative Analysis, Causal Models
Puhan, Gautam; Moses, Timothy P.; Grant, Mary C.; McHale, Frederick – Journal of Educational Measurement, 2009
A single-group (SG) equating with nearly equivalent test forms (SiGNET) design was developed by Grant to equate small-volume tests. Under this design, the scored items for the operational form are divided into testlets or mini tests. An additional testlet is created but not scored for the first form. If the scored testlets are testlets 1-6 and the…
Descriptors: Equated Scores, Test Construction, Measurement, Measures (Individuals)
Puhan, Gautam; Moses, Timothy P.; Yu, Lei; Dorans, Neil J. – Journal of Educational Measurement, 2009
This study examined the extent to which log-linear smoothing could improve the accuracy of differential item functioning (DIF) estimates in small samples of examinees. Examinee responses from a certification test were analyzed using White examinees in the reference group and African American examinees in the focal group. Using a simulation…
Descriptors: Test Items, Reference Groups, Testing Programs, Raw Scores

Peer reviewed
Direct link
