NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
Elementary and Secondary…2
Assessments and Surveys
What Works Clearinghouse Rating
Showing 1 to 15 of 42 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Han, Yuting; Zhang, Jihong; Jiang, Zhehan; Shi, Dexin – Educational and Psychological Measurement, 2023
In the literature of modern psychometric modeling, mostly related to item response theory (IRT), the fit of model is evaluated through known indices, such as X[superscript 2], M2, and root mean square error of approximation (RMSEA) for absolute assessments as well as Akaike information criterion (AIC), consistent AIC (CAIC), and Bayesian…
Descriptors: Goodness of Fit, Psychometrics, Error of Measurement, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Goldin, Ilya; Galyardt, April – Journal of Educational Data Mining, 2018
Data from student learning provide learning curves that, ideally, demonstrate improvement in student performance over time. Existing data mining methods can leverage these data to characterize and improve the domain models that support a learning environment, and these methods have been validated both with already-collected data, and in…
Descriptors: Predictor Variables, Models, Learning Processes, Matrices
Peer reviewed Peer reviewed
Direct linkDirect link
Pfeiffer, Steven I.; Jarosewich, Tania – Gifted Child Quarterly, 2007
This study analyzes the standardization sample of a new teacher rating scale designed to assist in the identification of gifted students. The Gifted Rating Scales-School Form (GRS-S) is based on a multidimensional model of giftedness. Results indicate no age or race/ethnicity differences on any of the scales and small but significant differences…
Descriptors: Academically Gifted, Teacher Evaluation, Intelligence Quotient, Rating Scales
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2022
Power analysis based on the analytical t-test is an important aspect of a research study to determine the sample size required to detect the effect for the comparison of two means. The current paper presents a reader-friendly procedure for carrying out the t-test power analysis using the various R add-on packages. While there is a growing of R…
Descriptors: Programming Languages, Sample Size, Bayesian Statistics, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Jin – Journal of Educational and Behavioral Statistics, 2022
Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the…
Descriptors: Longitudinal Studies, Individual Differences, Scores, Models
Echternacht, Gary – 1980
The Normal Curve Equivalent (NCE) gain statistic is examined, and considerations for its interpretation are highlighted. The NCE gain is made up of an observed and an expected part. The observed score is the posttest result. The expected score can never be observed nor verified for any of the Title I Evaluation and Reporting System (TIERS) models;…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Program Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016
Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…
Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Woods, Carol M. – Applied Psychological Measurement, 2011
Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…
Descriptors: Simulation, Item Response Theory, Testing, Questionnaires
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017
This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…
Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Baker, Ryan S.; Hershkovitz, Arnon; Rossi, Lisa M.; Goldstein, Adam B.; Gowda, Sujith M. – Journal of the Learning Sciences, 2013
We present a new method for analyzing a student's learning over time for a specific skill: analysis of the graph of the student's moment-by-moment learning over time. Moment-by-moment learning is calculated using a data-mined model that assesses the probability that a student learned a skill or concept at a specific time during learning (Baker,…
Descriptors: Learning Processes, Intelligent Tutoring Systems, Probability, Skill Development
Peer reviewed Peer reviewed
Direct linkDirect link
Åström, Therese; Gumpert, Clara Hellner; Andershed, Anna-Karin; Forster, Martin – Research on Social Work Practice, 2017
Purpose: This study investigated the utility of the risk assessment "Structured Assessment of Violence Risk in Youth" (SAVRY) within the social services in Stockholm County, Sweden. Method: SAVRY assessments of 56 adolescents were compared to assessments guided by another instrument (Adolescent Drug Abuse Diagnosis [ADAD]; n = 38) and…
Descriptors: Violence, Risk, Recidivism, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Martin, Andrew J.; Darlow, Brian A.; Salt, Alison; Hague, Wendy; Sebastian, Lucille; Mann, Kristy; Tarnow-Mordi, William – Developmental Medicine & Child Neurology, 2012
Aim: The collection of data on longer-term neurodevelopmental outcomes within large neonatal randomized controlled trials by trained assessors can greatly increase costs and present many operational difficulties. The aim of this study was to develop a more practical alternative for identifying major cognitive delay in infants at the age of 24…
Descriptors: Infants, Parents, Cognitive Development, Cognitive Ability
Peer reviewed Peer reviewed
Direct linkDirect link
Coffman, Donna L.; Millsap, Roger E. – Structural Equation Modeling: A Multidisciplinary Journal, 2006
The usefulness of assessing individual fit in latent growth curve models was examined. The study used simulated data based on an unconditional and a conditional latent growth curve model with a linear component and a small quadratic component and a linear model was fit to the data. Then the overall fit of linear and quadratic models to these data…
Descriptors: Structural Equation Models, Evaluation Methods, Goodness of Fit, Individual Development
Peer reviewed Peer reviewed
Direct linkDirect link
Roberts, James S. – Applied Psychological Measurement, 2008
Orlando and Thissen (2000) developed an item fit statistic for binary item response theory (IRT) models known as S-X[superscript 2]. This article generalizes their statistic to polytomous unfolding models. Four alternative formulations of S-X[superscript 2] are developed for the generalized graded unfolding model (GGUM). The GGUM is a…
Descriptors: Item Response Theory, Goodness of Fit, Test Items, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Dube, Chad; Rotello, Caren M.; Heit, Evan – Psychological Review, 2011
In "Assessing the Belief Bias Effect With ROCs: It's a Response Bias Effect," Dube, Rotello, and Heit (2010) examined the form of receiver operating characteristic (ROC) curves for reasoning and the effects of belief bias on measurement indices that differ in whether they imply a curved or linear ROC function. We concluded that the ROC…
Descriptors: Response Style (Tests), Evaluation Methods, Statistics, Validity
Previous Page | Next Page »
Pages: 1  |  2  |  3