NotesFAQContact Us
Collection
Advanced
Search Tips
Source
Educational and Psychological…19
Audience
Laws, Policies, & Programs
No Child Left Behind Act 20012
What Works Clearinghouse Rating
Showing 1 to 15 of 19 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Yan; Kim, Eun Sook; Dedrick, Robert F.; Ferron, John M.; Tan, Tony – Educational and Psychological Measurement, 2018
Wording effects associated with positively and negatively worded items have been found in many scales. Such effects may threaten construct validity and introduce systematic bias in the interpretation of results. A variety of models have been applied to address wording effects, such as the correlated uniqueness model and the correlated traits and…
Descriptors: Test Items, Test Format, Correlation, Construct Validity
Lockwood, J. R.; Castellano, Katherine E. – Educational and Psychological Measurement, 2017
Student Growth Percentiles (SGPs) increasingly are being used in the United States for inferences about student achievement growth and educator effectiveness. Emerging research has indicated that SGPs estimated from observed test scores have large measurement errors. As such, little is known about "true" SGPs, which are defined in terms…
Descriptors: Item Response Theory, Correlation, Student Characteristics, Academic Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Huang, Francis L.; Cornell, Dewey G. – Educational and Psychological Measurement, 2016
Bullying among youth is recognized as a serious student problem, especially in middle school. The most common approach to measuring bullying is through student self-report surveys that ask questions about different types of bullying victimization. Although prior studies have shown that question-order effects may influence participant responses, no…
Descriptors: Victims of Crime, Bullying, Middle School Students, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Cheng, Ying; Shao, Can; Lathrop, Quinn N. – Educational and Psychological Measurement, 2016
Due to its flexibility, the multiple-indicator, multiple-causes (MIMIC) model has become an increasingly popular method for the detection of differential item functioning (DIF). In this article, we propose the mediated MIMIC model method to uncover the underlying mechanism of DIF. This method extends the usual MIMIC model by including one variable…
Descriptors: Test Bias, Models, Simulation, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Konstantopoulos, Spyros; Li, Wei; Miller, Shazia R.; van der Ploeg, Arie – Educational and Psychological Measurement, 2016
We use data from a large-scale experiment conducted in Indiana in 2009-2010 to examine the impact of two interim assessment programs (mCLASS and Acuity) across the mathematics and reading achievement distributions. Specifically, we focus on whether the use of interim assessments has a particularly strong effect on improving outcomes for low…
Descriptors: Educational Assessment, Mathematics Achievement, Reading Achievement, Regression (Statistics)
Peer reviewed Peer reviewed
Direct linkDirect link
Attali, Yigal; Laitusis, Cara; Stone, Elizabeth – Educational and Psychological Measurement, 2016
There are many reasons to believe that open-ended (OE) and multiple-choice (MC) items elicit different cognitive demands of students. However, empirical evidence that supports this view is lacking. In this study, we investigated the reactions of test takers to an interactive assessment with immediate feedback and answer-revision opportunities for…
Descriptors: Test Items, Questioning Techniques, Differences, Student Reaction
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015
Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…
Descriptors: Item Response Theory, Test Format, Language Usage, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Wetzel, Eunike; Xu, Xueli; von Davier, Matthias – Educational and Psychological Measurement, 2015
In large-scale educational surveys, a latent regression model is used to compensate for the shortage of cognitive information. Conventionally, the covariates in the latent regression model are principal components extracted from background data. This operational method has several important disadvantages, such as the handling of missing data and…
Descriptors: Surveys, Regression (Statistics), Models, Research Methodology
Peer reviewed Peer reviewed
Direct linkDirect link
Nezhnov, Peter; Kardanova, Elena; Vasilyeva, Marina; Ludlow, Larry – Educational and Psychological Measurement, 2015
The present study tested the possibility of operationalizing levels of knowledge acquisition based on Vygotsky's theory of cognitive growth. An assessment tool (SAM-Math) was developed to capture a hypothesized hierarchical structure of mathematical knowledge consisting of procedural, conceptual, and functional levels. In Study 1, SAM-Math was…
Descriptors: Knowledge Level, Mathematics, Cognitive Development, Vertical Organization
Peer reviewed Peer reviewed
Direct linkDirect link
Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014
Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…
Descriptors: Observation, Teacher Evaluation, Reliability, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Dowdy, Erin; Nylund-Gibson, Karen; Felix, Erika D.; Morovati, Diane; Carnazzo, Katherine W.; Dever, Bridget V. – Educational and Psychological Measurement, 2014
The practice of screening students to identify behavioral and emotional risk is gaining momentum, with limited guidance regarding the frequency with which screenings should occur. Screening frequency decisions are influenced by the stability of the constructs assessed and changes in risk status over time. This study investigated the 4-year…
Descriptors: Screening Tests, Risk, Behavior Disorders, Emotional Disturbances
Peer reviewed Peer reviewed
Direct linkDirect link
Gable, Robert K.; Ludlow, Larry H.; McCoach, D. Betsy; Kite, Stacey L. – Educational and Psychological Measurement, 2011
The development of the Survey of Knowledge of Internet Risk and Internet Behavior is described. A total of 1,366 Grades 7 and 8 male and female students from an urban, suburban, and rural school offered agree-disagree responses to 26 statements defining one Knowledge Scale and five behavior dimensions. Literature-based support is presented for…
Descriptors: Content Validity, Construct Validity, Risk, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009
In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…
Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Pomplun, Mark R. – Educational and Psychological Measurement, 2009
This study investigated convergent validity evidence for student growth scores with high school course grades. The Measures of Academic Progress and Educational Planning and Assessment System growth scores for approximately 1,800 ninth-grade students over 2 years were related to language, arts, and mathematics course grades for developmental,…
Descriptors: Honors Curriculum, Grades (Scholastic), Academic Achievement, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Attali, Yigal; Powers, Donald – Educational and Psychological Measurement, 2009
A developmental writing scale for timed essay-writing performance was created on the basis of automatically computed indicators of writing fluency, word choice, and conventions of standard written English. In a large-scale data collection effort that involved a national sample of more than 12,000 students from 4th, 6th, 8th, 10th, and 12th grade,…
Descriptors: Validity, Measures (Individuals), Scoring, Essays
Previous Page | Next Page ยป
Pages: 1  |  2