Publication Date
In 2024 | 0 |
Since 2023 | 1 |
Since 2020 (last 5 years) | 3 |
Since 2015 (last 10 years) | 8 |
Since 2005 (last 20 years) | 30 |
Descriptor
Evaluation Methods | 42 |
Regression (Statistics) | 15 |
Item Response Theory | 14 |
Models | 12 |
Simulation | 10 |
Computation | 9 |
Bayesian Statistics | 8 |
Comparative Analysis | 8 |
Correlation | 8 |
Goodness of Fit | 6 |
Program Evaluation | 5 |
More ▼ |
Source
Author
Echternacht, Gary | 2 |
Woods, Carol M. | 2 |
Allenger, Robert | 1 |
Andershed, Anna-Karin | 1 |
Baker, Ryan S. | 1 |
Bauer, Daniel J. | 1 |
Bazan, Jorge Luis | 1 |
Bejar, Isaac I. | 1 |
Betz, Stacy K. | 1 |
Bolfarine, Heleno | 1 |
Bracey, Gerald W. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 2 |
Researchers | 1 |
Location
Cyprus | 1 |
Kenya | 1 |
Michigan | 1 |
New Zealand | 1 |
Pakistan | 1 |
Pennsylvania | 1 |
Peru | 1 |
South Korea | 1 |
Sweden (Stockholm) | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Han, Yuting; Zhang, Jihong; Jiang, Zhehan; Shi, Dexin – Educational and Psychological Measurement, 2023
In the literature of modern psychometric modeling, mostly related to item response theory (IRT), the fit of model is evaluated through known indices, such as X[superscript 2], M2, and root mean square error of approximation (RMSEA) for absolute assessments as well as Akaike information criterion (AIC), consistent AIC (CAIC), and Bayesian…
Descriptors: Goodness of Fit, Psychometrics, Error of Measurement, Item Response Theory
Goldin, Ilya; Galyardt, April – Journal of Educational Data Mining, 2018
Data from student learning provide learning curves that, ideally, demonstrate improvement in student performance over time. Existing data mining methods can leverage these data to characterize and improve the domain models that support a learning environment, and these methods have been validated both with already-collected data, and in…
Descriptors: Predictor Variables, Models, Learning Processes, Matrices
Pfeiffer, Steven I.; Jarosewich, Tania – Gifted Child Quarterly, 2007
This study analyzes the standardization sample of a new teacher rating scale designed to assist in the identification of gifted students. The Gifted Rating Scales-School Form (GRS-S) is based on a multidimensional model of giftedness. Results indicate no age or race/ethnicity differences on any of the scales and small but significant differences…
Descriptors: Academically Gifted, Teacher Evaluation, Intelligence Quotient, Rating Scales
Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2022
Power analysis based on the analytical t-test is an important aspect of a research study to determine the sample size required to detect the effect for the comparison of two means. The current paper presents a reader-friendly procedure for carrying out the t-test power analysis using the various R add-on packages. While there is a growing of R…
Descriptors: Programming Languages, Sample Size, Bayesian Statistics, Intervention
Liu, Jin – Journal of Educational and Behavioral Statistics, 2022
Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the…
Descriptors: Longitudinal Studies, Individual Differences, Scores, Models
Echternacht, Gary – 1980
The Normal Curve Equivalent (NCE) gain statistic is examined, and considerations for its interpretation are highlighted. The NCE gain is made up of an observed and an expected part. The observed score is the posttest result. The expected score can never be observed nor verified for any of the Title I Evaluation and Reporting System (TIERS) models;…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Program Evaluation
Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016
Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…
Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics
Woods, Carol M. – Applied Psychological Measurement, 2011
Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…
Descriptors: Simulation, Item Response Theory, Testing, Questionnaires
Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017
This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…
Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics
Baker, Ryan S.; Hershkovitz, Arnon; Rossi, Lisa M.; Goldstein, Adam B.; Gowda, Sujith M. – Journal of the Learning Sciences, 2013
We present a new method for analyzing a student's learning over time for a specific skill: analysis of the graph of the student's moment-by-moment learning over time. Moment-by-moment learning is calculated using a data-mined model that assesses the probability that a student learned a skill or concept at a specific time during learning (Baker,…
Descriptors: Learning Processes, Intelligent Tutoring Systems, Probability, Skill Development
Åström, Therese; Gumpert, Clara Hellner; Andershed, Anna-Karin; Forster, Martin – Research on Social Work Practice, 2017
Purpose: This study investigated the utility of the risk assessment "Structured Assessment of Violence Risk in Youth" (SAVRY) within the social services in Stockholm County, Sweden. Method: SAVRY assessments of 56 adolescents were compared to assessments guided by another instrument (Adolescent Drug Abuse Diagnosis [ADAD]; n = 38) and…
Descriptors: Violence, Risk, Recidivism, Measures (Individuals)
Martin, Andrew J.; Darlow, Brian A.; Salt, Alison; Hague, Wendy; Sebastian, Lucille; Mann, Kristy; Tarnow-Mordi, William – Developmental Medicine & Child Neurology, 2012
Aim: The collection of data on longer-term neurodevelopmental outcomes within large neonatal randomized controlled trials by trained assessors can greatly increase costs and present many operational difficulties. The aim of this study was to develop a more practical alternative for identifying major cognitive delay in infants at the age of 24…
Descriptors: Infants, Parents, Cognitive Development, Cognitive Ability
Coffman, Donna L.; Millsap, Roger E. – Structural Equation Modeling: A Multidisciplinary Journal, 2006
The usefulness of assessing individual fit in latent growth curve models was examined. The study used simulated data based on an unconditional and a conditional latent growth curve model with a linear component and a small quadratic component and a linear model was fit to the data. Then the overall fit of linear and quadratic models to these data…
Descriptors: Structural Equation Models, Evaluation Methods, Goodness of Fit, Individual Development
Roberts, James S. – Applied Psychological Measurement, 2008
Orlando and Thissen (2000) developed an item fit statistic for binary item response theory (IRT) models known as S-X[superscript 2]. This article generalizes their statistic to polytomous unfolding models. Four alternative formulations of S-X[superscript 2] are developed for the generalized graded unfolding model (GGUM). The GGUM is a…
Descriptors: Item Response Theory, Goodness of Fit, Test Items, Models
Dube, Chad; Rotello, Caren M.; Heit, Evan – Psychological Review, 2011
In "Assessing the Belief Bias Effect With ROCs: It's a Response Bias Effect," Dube, Rotello, and Heit (2010) examined the form of receiver operating characteristic (ROC) curves for reasoning and the effects of belief bias on measurement indices that differ in whether they imply a curved or linear ROC function. We concluded that the ROC…
Descriptors: Response Style (Tests), Evaluation Methods, Statistics, Validity