Publication Date
| In 2015 | 0 |
| Since 2014 | 2 |
| Since 2011 (last 5 years) | 11 |
| Since 2006 (last 10 years) | 28 |
| Since 1996 (last 20 years) | 48 |
Descriptor
| Scoring Formulas | 470 |
| Test Reliability | 124 |
| Multiple Choice Tests | 113 |
| Guessing (Tests) | 98 |
| Test Validity | 90 |
| Higher Education | 82 |
| Scoring | 76 |
| Test Interpretation | 72 |
| Test Construction | 61 |
| Test Items | 56 |
| More ▼ | |
Source
Author
| Weiss, David J. | 11 |
| Frary, Robert B. | 10 |
| Wilcox, Rand R. | 7 |
| Lord, Frederic M. | 6 |
| Angoff, William H. | 5 |
| Echternacht, Gary | 5 |
| Plake, Barbara S. | 5 |
| Albanese, Mark A. | 4 |
| Cross, Lawrence H. | 4 |
| Hambleton, Ronald K. | 4 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 13 |
| Elementary Secondary Education | 7 |
| Adult Education | 4 |
| Secondary Education | 3 |
| Postsecondary Education | 2 |
| Early Childhood Education | 1 |
| Elementary Education | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| High Schools | 1 |
| More ▼ | |
Audience
| Researchers | 12 |
| Practitioners | 8 |
| Policymakers | 2 |
| Teachers | 1 |
Showing 1 to 15 of 470 results
Yu, Eunjyu – Research & Teaching in Developmental Education, 2014
In a study designed to analyze faculty and student perceptions of the value of digital writing in the first year composition classroom, 21 first-year college students and a nationwide sample of 50 college composition teachers participated in conceptualizing digital multimodal composition and defining the benchmarks for first-year college digital…
Descriptors: Developmental Programs, Freshman Composition, Electronic Publishing, Benchmarking
Saupe, Joe L.; Eimers, Mardy T. – Association for Institutional Research (NJ1), 2011
Critics of testing for admission purposes cite the moderate correlations of admissions test scores with success in college. In response, this study applies formulas from classical measurement theory to observed correlations to correct for restricted variances in predictor and success variables. Estimates of the correlations in the population of…
Descriptors: High School Graduates, College Entrance Examinations, Scores, Correlation
Dorans, Neil J.; Liang, Longjuan; Puhan, Gautam – Educational Testing Service, 2010
Scores are the most visible and widely used products of a testing program. The choice of score scale has implications for test specifications, equating, and test reliability and validity, as well as for test interpretation. At the same time, the score scale should be viewed as infrastructure likely to require repair at some point. In this report…
Descriptors: Testing Programs, Standard Setting (Scoring), Test Interpretation, Certification
Baldi, Stephane, Ed.; Kutner, Mark; Greenberg, Elizabeth; Jin, Ying; Baer, Justin; Moore, Elizabeth; Dunleavy, Eric; Berlin, Martha; Mohadjer, Leyla; Binzer, Greg; Krenzke, Thomas; Hogan, Jacqueline; Amsbary, Michelle; Forsyth, Barbara; Clark, Lyn; Annis, Terri; Bernstein, Jared; White, Sheida – National Center for Education Statistics, 2009
The 2003 National Assessment of Adult Literacy (NAAL) assessed the English literacy skills of a nationally representative sample of more than 19,000 U.S. adults (age 16 and older) residing in households and correctional institutions. NAAL is the first national assessment of adult literacy since the 1992 National Adult Literacy Survey (NALS). The…
Descriptors: Correctional Institutions, Scaling, Numeracy, Field Tests
Murphy, Brooke; Dionigi, Rylee A.; Litchfield, Chelsea – Issues in Educational Research, 2014
We argue that gender issues in physical education (PE) remain in some schools, despite advances in PE research and curricula aimed at engaging females in PE. We interviewed five Australian PE teachers (1 male and 4 females) at a co-educational, regional high school about the factors affecting female participation in PE and the strategies they used…
Descriptors: Physical Education, Females, Case Studies, Teacher Attitudes
Laitsch, Dan – Association for Supervision and Curriculum Development, 2005
Standardized testing plays an increasingly important role in the lives of today's students and educators. The U.S. No Child Left Behind Act (NCLB) requires assessment in math and literacy in grades 3-8 and 10 and, as of 2007-08, in science once in grades 3-5, 6-9, and 10-12. Based on National Center for Education Statistics enrollment projections,…
Descriptors: Testing, Standardized Tests, Enrollment Projections, Accountability
Bennett, John; Tognolini, Jim; Pickering, Samantha – Assessment in Education: Principles, Policy & Practice, 2012
This paper describes how a state education system in Australia introduced standards-referenced assessments into its large-scale, high-stakes, curriculum-based examinations in a way that enables comparison of performance across time even though the examinations are different each year. It describes the multi-stage modified Angoff standard-setting…
Descriptors: Feedback (Response), Tests, Foreign Countries, Cutting Scores
Barkaoui, Khaled – Assessment in Education: Principles, Policy & Practice, 2011
This study examined the effects of marking method and rater experience on ESL (English as a Second Language) essay test scores and rater performance. Each of 31 novice and 29 experienced raters rated a sample of ESL essays both holistically and analytically. Essay scores were analysed using a multi-faceted Rasch model to compare test-takers'…
Descriptors: Writing Evaluation, Writing Tests, Essay Tests, Interrater Reliability
Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011
At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are "better" at it. We outline a…
Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance
Attali, Yigal – Applied Psychological Measurement, 2011
Recently, Attali and Powers investigated the usefulness of providing immediate feedback on the correctness of answers to constructed response questions and the opportunity to revise incorrect answers. This article introduces an item response theory (IRT) model for scoring revised responses to questions when several attempts are allowed. The model…
Descriptors: Feedback (Response), Item Response Theory, Models, Error Correction
Kreiner, Svend – Applied Psychological Measurement, 2011
To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…
Descriptors: Item Analysis, Correlation, Item Response Theory, Models
Wang, Tsung Juang – Teaching in Higher Education, 2011
Virtual world technology is now being incorporated into various higher education programs, often with enthusiastic claims about the improvement of students' abilities to experience learning problems and tasks in computer-mediated virtual reality through the use of computer-generated personal agents or avatars. The interactivity of the avatars with…
Descriptors: Constructivism (Learning), Learning Problems, Computer Simulation, Scoring Formulas
Stewart, Jeffrey; White, David A. – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2011
Multiple-choice tests such as the Vocabulary Levels Test (VLT) are often viewed as a preferable estimator of vocabulary knowledge when compared to yes/no checklists, because self-reporting tests introduce the possibility of students overreporting or underreporting scores. However, multiple-choice tests have their own unique disadvantages. It has…
Descriptors: Guessing (Tests), Scoring Formulas, Multiple Choice Tests, Test Reliability
Marasini, Donata; Quatto, Piero – Journal of Applied Quantitative Methods, 2011
Let X be a statistical variable representing student ratings of University teaching. It is natural to assume for X an ordinal scale consisting of k categories (in ascending order of satisfaction). At first glance, student ratings can be summarized by a location index (such as the mode or the median of X) associated with a convenient measure of…
Descriptors: Scientific Concepts, College Instruction, Student Evaluation of Teacher Performance, Data Interpretation
Huerta-Wong, Juan Enrique; Schoech, Richard – Journal of Social Work Education, 2010
Social work education research frequently has suggested an interaction between teaching techniques and learning environments. However, this interaction has never been tested. This study compared virtual and face-to-face learning environments and included active listening concepts to test whether the effectiveness of learning environments depends…
Descriptors: Foreign Countries, Social Work, Higher Education, Relationship

Peer reviewed
Direct link
