NotesFAQContact Us
Collection
Advanced
Search Tips
50 Years of ERIC
50 Years of ERIC
The Education Resources Information Center (ERIC) is celebrating its 50th Birthday! First opened on May 15th, 1964 ERIC continues the long tradition of ongoing innovation and enhancement.

Learn more about the history of ERIC here. PDF icon

Showing all 13 results
Peer reviewed Peer reviewed
Direct linkDirect link
Sanchez, Juan D. – Journal of Applied Measurement, 2011
The San Francisco Unified School District (SFUSD) uses the Language and Literacy Assessment Rubric (LALAR) as the secondary measurement required by the No Child Left Behind (NCLB) Act to measure English proficiency of English language learners (ELLs). In this analysis, the Rasch model is used to identify whether the LALAR is a valid measurement…
Descriptors: Validity, English (Second Language), Language Proficiency, English Language Learners
Peer reviewed Peer reviewed
Direct linkDirect link
Mat Daud, Nuraihan; Abu Kassim, Noor Lide – Journal of Applied Measurement, 2011
Students' evaluations of teaching staff can be considered high-stakes, as they are often used to determine promotion, reappointment, and merit pay to academics. Using Facets, the reliability and validity of one student rating questionnaire is analysed. A total of 13,940 respondents of the Human Science Division of International Islamic University…
Descriptors: Student Evaluation of Teacher Performance, Questionnaires, Validity, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Babiar, Tasha Calvert – Journal of Applied Measurement, 2011
Traditionally, women and minorities have not been fully represented in science and engineering. Numerous studies have attributed these differences to gaps in science achievement as measured by various standardized tests. Rather than describe mean group differences in science achievement across multiple cultures, this study focused on an in-depth…
Descriptors: Test Bias, Science Achievement, Standardized Tests, Grade 8
Peer reviewed Peer reviewed
Direct linkDirect link
Weaver, Christopher – Journal of Applied Measurement, 2011
This study presents a systematic investigation concerning the performance of different rating scales used in the English section of a university entrance examination to assess 1,287 Japanese test takers' ability to write a third-person introduction speech. Although the rating scales did not conform to all of the expectations of the Rasch model,…
Descriptors: Rating Scales, English (Second Language), Language Tests, College Entrance Examinations
Peer reviewed Peer reviewed
Direct linkDirect link
Bassiri, Dina; Schulz, E. Mathew – Journal of Applied Measurement, 2011
In this study, the Rasch rating scale model (Andrich, 1978) was applied to college grades of four freshman cohorts from a large public university. After editing, the data represented approximately 34,000 students, 1,700 courses and 119 departments. The rating scale model analysis yielded measures of student achievement and course difficulty.…
Descriptors: Grade Point Average, Courses, Difficulty Level, Academic Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Lunz, Mary; Suanthong, Surintorn – Journal of Applied Measurement, 2011
The desirability of test equating to maintain the same criterion standard from test administration to test administration has long been accepted for multiple choice tests. The same consistency of expectations is desirable for performance tests, especially if they are part of a licensure or certification process or used for other high stakes…
Descriptors: Testing, Equated Scores, Performance Based Assessment
Peer reviewed Peer reviewed
Seraphine, Anne E.; Algina, James J.; Miller, M. David – Journal of Applied Measurement, 2001
Examined the Type I error rate and the power of the Stout T procedure (DIMTEST) (W. Stout, 19987, 1990) and the Holland-Rosenbaum procedure (P. Holland and P. Rosenbaum, 1986) for normal and nonnormal data sets through a Monte Carlo study. Both procedures performed adequately under some conditions, but the Stout T procedure showed adequate power…
Descriptors: Evaluation Methods, Monte Carlo Methods, Nonparametric Statistics
Peer reviewed Peer reviewed
Prieto, Luis; Roset, Montse; Badia, Xavier – Journal of Applied Measurement, 2001
Tested the metric properties of a Spanish version of the Assessment of Growth Hormone Deficiency in Adults (AGHDA) questionnaire through Rasch analysis with a sample of 356 adult patients in Spain. Results suggest that the Spanish AGHDA could be a useful complement of the clinical evaluation of growth hormone deficiency patients at group and…
Descriptors: Adults, Evaluation Methods, Foreign Countries, Individual Development
Peer reviewed Peer reviewed
Wang, LihShing; Li, Chun-Shan – Journal of Applied Measurement, 2001
Used Monte Carlo simulation to compare the relative measurement efficiency of polytomous modeling and dichotomous modeling under different scoring schemes and termination criteria. Results suggest that polytomous computerized adaptive testing (CAT) yields marginal gains over dichotomous CAT when termination criteria are more stringent. Discusses…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Monte Carlo Methods
Peer reviewed Peer reviewed
Karabatsos, George – Journal of Applied Measurement, 2000
Offers a critical analysis of the residual fit statistics of the Rasch model, demonstrating that Rasch fit analysis is not as simple as it appears to be. Calls for the use of residual-free Rasch fit statistics that are based on the number of Guttman response errors or indices that are optimal statistically for detecting measurement disturbances.…
Descriptors: Goodness of Fit, Item Response Theory
Peer reviewed Peer reviewed
Wang, Wen-Chung – Journal of Applied Measurement, 2000
Proposes a factorial procedure for investigating differential distractor functioning in multiple choice items that models each distractor with a distinct distractibility parameter. Results of a simulation study show that the parameters of the proposed modeling were recovered very well. Analysis of 10 4-choice items from a college entrance…
Descriptors: College Entrance Examinations, Distractors (Tests), Factor Structure, Foreign Countries
Peer reviewed Peer reviewed
Smith, Everett V., Jr. – Journal of Applied Measurement, 2000
Describes problems in score reporting with the True Score model, defines the Rasch measurement unit (the logit), reviews transformations of the logit metric, and provides examples of score reporting procedures. Uses dichotomous data from an examination taken by 126 Ph.D. students and polychotomous data from a self-efficacy assessment completed by…
Descriptors: Graduate Students, Graduate Study, Item Response Theory, Posttraumatic Stress Disorder
Peer reviewed Peer reviewed
Dawson, Theo Linda – Journal of Applied Measurement, 2000
Re-examined the 13-year lifespan study of moral and evaluative reasoning conducted by C. Armon, who interviewed 23 females and 19 males ranging in age from 5 at first test time (1977) to 86 at the fourth interview (1989). Rasch analysis of Armon's data show that the measures used tap a single underlying dimension of reasoning. Discusses results…
Descriptors: Adults, Age Differences, Children, Individual Development