Publication Date
| In 2015 | 0 |
| Since 2014 | 0 |
| Since 2011 (last 5 years) | 4 |
| Since 2006 (last 10 years) | 5 |
| Since 1996 (last 20 years) | 22 |
Descriptor
| Performance Based Assessment | 47 |
| Educational Assessment | 17 |
| Evaluation Methods | 17 |
| Test Construction | 14 |
| Elementary Secondary Education | 10 |
| Scoring | 10 |
| Standards | 10 |
| Decision Making | 9 |
| Standard Setting (Scoring) | 9 |
| Evaluators | 7 |
| More ▼ | |
Source
| Applied Measurement in… | 47 |
Author
| Plake, Barbara S. | 4 |
| Hambleton, Ronald K. | 3 |
| Klein, Stephen P. | 3 |
| Shavelson, Richard J. | 3 |
| Clauser, Brian E. | 2 |
| Crocker, Linda | 2 |
| Gao, Xiaohong | 2 |
| Goldberg, Gail Lynn | 2 |
| Lane, Suzanne | 2 |
| Stecher, Brian M. | 2 |
| More ▼ | |
Publication Type
| Journal Articles | 47 |
| Reports - Evaluative | 28 |
| Reports - Research | 14 |
| Information Analyses | 7 |
| Speeches/Meeting Papers | 6 |
| Reports - Descriptive | 3 |
| Book/Product Reviews | 1 |
| Guides - Non-Classroom | 1 |
| Opinion Papers | 1 |
Education Level
| Grade 5 | 1 |
| Grade 8 | 1 |
| Higher Education | 1 |
Audience
Showing 1 to 15 of 47 results
Kuhlemeier, Hans; Hemker, Bas; van den Bergh, Huub – Applied Measurement in Education, 2013
In recent years many countries have introduced authentic performance-based assessments in their national exam systems. Teachers' ratings of their own candidates' performances may suffer from errors of leniency and range restriction. The goal of this study was to examine the impact of manipulating the descriptiveness, balancedness, and polarity of…
Descriptors: Performance Based Assessment, Rating Scales, Scores, High Stakes Tests
Gattamorta, Karina A.; Penfield, Randall D. – Applied Measurement in Education, 2012
The study of measurement invariance in polytomous items that targets individual score levels is known as differential step functioning (DSF). The analysis of DSF requires the creation of a set of dichotomizations of the item response variable. There are two primary approaches for creating the set of dichotomizations to conduct a DSF analysis: the…
Descriptors: Measurement, Item Response Theory, Test Bias, Test Items
Kahraman, Nilufer; De Champlain, Andre; Raymond, Mark – Applied Measurement in Education, 2012
Item-level information, such as difficulty and discrimination are invaluable to the test assembly, equating, and scoring practices. Estimating these parameters within the context of large-scale performance assessments is often hindered by the use of unbalanced designs for assigning examinees to tasks and raters because such designs result in very…
Descriptors: Performance Based Assessment, Medicine, Factor Analysis, Test Items
Sinha, Ruchi; Oswald, Frederick; Imus, Anna; Schmitt, Neal – Applied Measurement in Education, 2011
The current study examines how using a multidimensional battery of predictors (high-school grade point average (GPA), SAT/ACT, and biodata), and weighting the predictors based on the different values institutions place on various student performance dimensions (college GPA, organizational citizenship behaviors (OCBs), and behaviorally anchored…
Descriptors: Grade Point Average, Interrater Reliability, Rating Scales, College Admission
Wang, Lihshing; Beckett, Gulbahar H.; Brown, Lionel – Applied Measurement in Education, 2006
Standardized assessment in school systems has been the center of debate for decades. Although the voices of opponents of standardized tests have dominated the public forum, only a handful of scholars and practitioners have argued in defense of standardized tests. This article provides a critical synthesis of the controversial issues on…
Descriptors: Accountability, Educational Change, Standardized Tests, Academic Achievement
Schafer, William D. – Applied Measurement in Education, 2005
Two concerns related to setting performance standards on educational assessments are discussed. First, criteria for a standard-setting process from the point of view of a standard-setting sponsor, called here "institutional criteria," are developed using a state department of education as an example. Four institutional criteria are proposed: (a)…
Descriptors: Evaluation Criteria, Standard Setting, Educational Policy, Educational Objectives
Peer reviewedClauser, Brian E.; Kane, Michael T.; Swanson, David B. – Applied Measurement in Education, 2002
Attempts to place the issues associated with computer-automated scoring within the context of current validity theory and presents a taxonomy of automated scoring procedures as a framework for discussing threats to validity that may take on increased importance for specific approaches to automated scoring. (SLD)
Descriptors: Classification, Computer Uses in Education, Performance Based Assessment, Test Construction
Peer reviewedWolfe, Edward W.; Gitomer, Drew H. – Applied Measurement in Education, 2001
Attempted to improve the measurement quality of a complex performance assessment through principled assessment design using the example of the National Board for Professional Teaching Standards Early Childhood/Generalist examination. All indexes examined improved after revisions were made. Results show the importance of attention to assessment…
Descriptors: Change, Performance Based Assessment, Psychometrics, Scores
Peer reviewedGoldberg, Gail Lynn; Roswell, Barbara Sherr – Applied Measurement in Education, 2001
To determine the factors that contribute to or compromise the effectiveness of multiscored items, this study combined analysis of statewide score data from the 1996 Maryland School Performance Assessment Program tests with systematic analyses of 60 activities providing measures of writing, language usage, or both, and one or more content areas.…
Descriptors: Performance Based Assessment, Scores, State Programs, Testing Programs
Peer reviewedGao, Xiaohong; Brennan, Robert L. – Applied Measurement in Education, 2001
Studied the sampling variability of estimated variance components using data collected over several years for a listening and writing performance assessment and evaluated the stability of estimated measurement precision. Results indicate that the estimated variance components varied from one year to another and suggest that the measurement…
Descriptors: Estimation (Mathematics), Generalizability Theory, Listening Comprehension Tests, Performance Based Assessment
Peer reviewedFuchs, Lynn S.; Fuchs, Douglas; Karns, Kathy; Hamlett, Carol L.; Dutka, Sue; Katzaroff, Michelle – Applied Measurement in Education, 2000
Examined the effects of providing students with background information about the structure and scoring of mathematics performance assessments (PA). Results for 187 elementary school students who had PA orientation and 182 who did not show the effects of test wiseness training for average and above-average students, but not for below-average…
Descriptors: Background, Elementary Education, Elementary School Students, Mathematics
Peer reviewedStecher, Brian M.; Klein, Stephen P.; Solano-Flores, Guillermo; McCaffrey, Dan; Robyn, Abby; Shavelson, Richard J.; Haertel, Edward – Applied Measurement in Education, 2000
Studied content domain, format, and level of inquiry as factors contributing to the large variation in student performance across open-ended measures. Results for more than 1,200 eighth graders do not support the hypothesis that tasks similar in content, format, and level of inquiry would correlate higher with each other than with measures…
Descriptors: Correlation, Inquiry, Junior High School Students, Junior High Schools
Peer reviewedKlein, Stephen P.; Stecher, Brian M.; Shavelson, Richard J.; McCaffrey, Daniel; Ormseth, Tor; Bell, Robert M.; Comfort, Kathy; Othman, Abdul R. – Applied Measurement in Education, 1998
Two studies involving 368 elementary and high school students and 29 readers were conducted to investigate reader consistency, score reliability, and reader time requirements of three hands-on science performance tasks. Holistic scores were as reliable as analytic scores, and there was a high correlation between them after they were disattenuated…
Descriptors: Elementary School Students, Elementary Secondary Education, Hands on Science, High School Students
Peer reviewedMcBee, Maridyth M.; Barnes, Laura L. B. – Applied Measurement in Education, 1998
The temporal stability and intertask consistency of an eighth-grade mathematics performance assessment and how task similarity affects the ability to generalize results of the assessments were studied with results from 101 eighth graders. Results support the suggestion that large-scale performance assessments be used with considerable caution…
Descriptors: Academic Achievement, Grade 8, Junior High School Students, Junior High Schools
Peer reviewedPlake, Barbara S. – Applied Measurement in Education, 1998
Credentialing programs were surveyed to determine the procedures they use to set performance standards on multiple-choice and open-ended assessments. Implications of the various standard-setting approaches for the National Assessment of Educational Progress are discussed, and it is asserted that generalizing from standard-setting in professional…
Descriptors: Certification, Credentials, Elementary Secondary Education, Licensing Examinations (Professions)

Direct link
