Publication Date
| In 2015 | 0 |
| Since 2014 | 0 |
| Since 2011 (last 5 years) | 0 |
| Since 2006 (last 10 years) | 1 |
| Since 1996 (last 20 years) | 1 |
Descriptor
| Test Validity | 5 |
| Achievement Tests | 4 |
| Cutting Scores | 4 |
| Test Construction | 4 |
| Test Interpretation | 4 |
| Test Use | 4 |
| Evaluation Methods | 3 |
| Higher Education | 3 |
| Scores | 3 |
| Secondary Education | 3 |
| More ▼ | |
Source
| Journal of Educational… | 21 |
Author
| Cross, Lawrence H. | 2 |
| Bejar, Isaac I. | 1 |
| Bell, John F. | 1 |
| Beuk, Cees H. | 1 |
| Finch, Holmes | 1 |
| Fitzpatrick, Anne R. | 1 |
| Flynn, James R. | 1 |
| Hanna, Gila | 1 |
| Hardy, Roy A. | 1 |
| Hastings, C. Nicholas | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 21 |
| Reports - Research | 18 |
| Information Analyses | 2 |
| Opinion Papers | 1 |
| Reports - Evaluative | 1 |
| Tests/Questionnaires | 1 |
Education Level
Audience
| Researchers | 21 |
Showing 1 to 15 of 21 results
Finch, Holmes – Journal of Educational Measurement, 2008
Missing data are a common problem in a variety of measurement settings, including responses to items on both cognitive and affective assessments. Researchers have shown that such missing data may create problems in the estimation of item difficulty parameters in the Item Response Theory (IRT) context, particularly if they are ignored. At the same…
Descriptors: Simulation, Item Response Theory, Researchers, Computation
Peer reviewedShepard, Lorrie A.; And Others – Journal of Educational Measurement, 1985
The purpose of this research was to recommend an item bias procedure when the number of minority examinees is too small to use preferred three-parameter item response theory (IRT) methods. The chi-square, Angoff delta-plot, and pseudo-IRT indices were compared with both real and simulated data. (Author/DWH)
Descriptors: Estimation (Mathematics), Item Analysis, Latent Trait Theory, Minority Groups
Peer reviewedJohnson, Sandra; Bell, John F. – Journal of Educational Measurement, 1985
The assessment framework underlying a science performance monitoring program is process-oriented and intended to appeal to generalizability theory for a suitable estimation paradigm. Preliminary applications are described. Results suggest that computerized question-banking, domain-sampling of questions, and generalizablity theory together provide…
Descriptors: Academic Achievement, Computer Assisted Testing, Educational Assessment, Foreign Countries
Peer reviewedCross, Lawrence H.; And Others – Journal of Educational Measurement, 1985
This study evaluated procedures for establishing a minimum performance standard for the essay subtest of the National Teacher Examinations Communications Skills test. Results indicated the preferred procedure for setting standards on essays should involve a blind review followed by an informed review. (Author/DWH)
Descriptors: Beginning Teachers, Cutting Scores, Essay Tests, Evaluation Methods
Peer reviewedMcMorris, Robert F.; And Others – Journal of Educational Measurement, 1985
Two matched forms of a 50 item grammar test were developed. Twenty items designed to be humorous were included in one form. Inclusion of humorous items did not affect grammar scores on matched humorous/nonhumorous items, nor on commmon post-treatment items. Inclusion did not affect results of anxiety measures. (Author/DWH)
Descriptors: Grade 8, Humor, Junior High Schools, Performance Factors
Peer reviewedVan De Vijver, Fon J. R.; Poortinga, Ype H. – Journal of Educational Measurement, 1985
McCauley and Colberg described a theory of transportability and provided data to demonstrate the feasibility of their approach. It is argued that the transportability notion resembles earlier cross cultural work and does not add new insights into cross cultural comparison. Their statistical checks do not preclude the possibility of bias.…
Descriptors: Cross Cultural Studies, Cultural Interrelationships, Culture Fair Tests, Research Problems
Peer reviewedLord, Frederic M. – Journal of Educational Measurement, 1984
Four methods are outlined for estimating or approximating from a single test administration the standard error of measurement of number-right test score at specified ability levels or cutting scores. The methods are illustrated and compared on one set of real test data. (Author)
Descriptors: Academic Ability, Cutting Scores, Error of Measurement, Scoring Formulas
Peer reviewedLinn, Robert L.; Hastings, C. Nicholas – Journal of Educational Measurement, 1984
Using predictive validity studies of the Law School Admissions Test (LSAT) and the undergraduate grade-point average (UGPA), this study examined the large variation in the magnitude of the validity coefficients across schools. LSAT standard deviation and correlation between LSAT and UGPA accounted for 58.5 percent of the variability. (Author/EGS)
Descriptors: Academic Achievement, College Applicants, College Entrance Examinations, Grade Point Average
Peer reviewedJackson, Douglas N.; And Others – Journal of Educational Measurement, 1984
The Jackson Vocational Interest Survey was administered to 10,134 high school college-bound students to appraise the degree to which student profiles for different academic majors could be clustered in a meaningful way. The clustering approach showed definite advantages over the traditional item-based scoring, resulting in more reliable score.…
Descriptors: Career Choice, Classification, Cluster Analysis, College Students
Peer reviewedHughes, David C.; Keeling, Brian – Journal of Educational Measurement, 1984
Several studies have shown that essays receive higher marks when preceded by poor quality scripts than when preceded by good quality scripts. This study investigated the effectiveness of providing scorers with model essays to reduce the influence of context. Context effects persisted despite the scoring procedures used. (Author/EGS)
Descriptors: Context Effect, Essay Tests, Essays, High Schools
Peer reviewedFlynn, James R. – Journal of Educational Measurement, 1984
Thorndike's Stanford-Binet data suggest that from 1932 to 1971-72 preschool children enjoyed greater IQ gains than older children, possibly due to the rise of television. Additional analysis indicated that gains were either due to sampling error or totally antedated 1947. Gains of 12 IQ points were found for Americans. (Author/EGS)
Descriptors: Achievement Gains, Age Differences, Intelligence Differences, Intelligence Quotient
Peer reviewedHardy, Roy A. – Journal of Educational Measurement, 1984
To determine to what extent competencies to be measured by the Alabama High School Graduation Examination were being taught in the Alabama public schools, a survey was conducted of teachers of grades 7, 8, 9, and 10. Competencies that were not being taught are identified and possible explanations are outlined. (Author/EGS)
Descriptors: Academic Standards, Competency Based Education, Evaluation Criteria, Graduation Requirements
Peer reviewedFitzpatrick, Anne R. – Journal of Educational Measurement, 1984
This article reviews the Basic Achievement Skills Individual Screener (BASIS), an individually administered achievement battery that consists of skills tests in reading, mathematics, and spelling as well as an optional writing exercise. BASIS is found to be an effective and efficient means of assessing basic skills. (Author/EGS)
Descriptors: Achievement Tests, Basic Skills, Screening Tests, Test Construction
Peer reviewedWainer, Howard – Journal of Educational Measurement, 1984
Techniques of exploratory data analysis (EDA) were used to decompose data tables portraying performance of ethnic groups on the Scholastic Aptitude Test. These analyses indicate the size and structure of differences in performance among groups studied, nature of changes across time, and interactions between group membership and time. (Author/DWH)
Descriptors: College Entrance Examinations, Data Analysis, Educational Trends, Ethnic Groups
Peer reviewedYen, Wendy M. – Journal of Educational Measurement, 1984
A procedure for obtaining maximum likelihood trait estimates from number-correct (NC) scores for the three-parameter logistic model is presented. It produces an NC score to trait estimate conversion table. Analyses in the estimated true score metric confirm the conclusions made in the trait metric. (Author/DWH)
Descriptors: Achievement Tests, Error of Measurement, Estimation (Mathematics), Latent Trait Theory
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
