Publication Date
| In 2024 | 338 |
| Since 2023 | 796 |
| Since 2020 (last 5 years) | 2344 |
| Since 2015 (last 10 years) | 4041 |
| Since 2005 (last 20 years) | 6436 |
Descriptor
| Test Construction | 16088 |
| Test Validity | 5493 |
| Test Reliability | 4088 |
| Foreign Countries | 3395 |
| Test Items | 2610 |
| Higher Education | 1946 |
| Evaluation Methods | 1804 |
| Factor Analysis | 1792 |
| Elementary Secondary Education | 1689 |
| Psychometrics | 1660 |
| Student Evaluation | 1543 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 641 |
| Teachers | 446 |
| Researchers | 431 |
| Administrators | 122 |
| Students | 66 |
| Policymakers | 64 |
| Counselors | 23 |
| Parents | 23 |
| Community | 10 |
| Support Staff | 5 |
| Media Staff | 3 |
| More ▼ | |
Location
| Turkey | 545 |
| Australia | 326 |
| Canada | 243 |
| China | 142 |
| United States | 134 |
| Indonesia | 127 |
| United Kingdom | 125 |
| Germany | 106 |
| California | 104 |
| Taiwan | 102 |
| United Kingdom (England) | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 2 |
Yanyan Fu – Educational Measurement: Issues and Practice, 2024
The template-based automated item-generation (TAIG) approach that involves template creation, item generation, item selection, field-testing, and evaluation has more steps than the traditional item development method. Consequentially, there is more margin for error in this process, and any template errors can be cascaded to the generated items.…
Descriptors: Error Correction, Automation, Test Items, Test Construction
Becker, Benjamin; Weirich, Sebastian; Goldhammer, Frank; Debeer, Dries – Journal of Educational Measurement, 2023
When designing or modifying a test, an important challenge is controlling its speededness. To achieve this, van der Linden (2011a, 2011b) proposed using a lognormal response time model, more specifically the two-parameter lognormal model, and automated test assembly (ATA) via mixed integer linear programming. However, this approach has a severe…
Descriptors: Test Construction, Automation, Models, Test Items
Miguel A. García-Pérez – Educational and Psychological Measurement, 2024
A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…
Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis
Mahmood Ul Hassan; Frank Miller – Journal of Educational Measurement, 2024
Multidimensional achievement tests are recently gaining more importance in educational and psychological measurements. For example, multidimensional diagnostic tests can help students to determine which particular domain of knowledge they need to improve for better performance. To estimate the characteristics of candidate items (calibration) for…
Descriptors: Multidimensional Scaling, Achievement Tests, Test Items, Test Construction
Leifeng Xiao; Kit-Tai Hau; Melissa Dan Wang – Educational Measurement: Issues and Practice, 2024
Short scales are time-efficient for participants and cost-effective in research. However, researchers often mistakenly expect short scales to have the same reliability as long ones without considering the effect of scale length. We argue that applying a universal benchmark for alpha is problematic as the impact of low-quality items is greater on…
Descriptors: Measurement, Benchmarking, Item Sampling, Sample Size
Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024
Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…
Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction
Barry B. Gelston – ProQuest LLC, 2024
The purpose of this study was originally to create an operational definition of the "appearance of competence" to design valid questions for educational professionals supporting twice-exceptional (2e) learners to create a testing instrument. Through the methodological process of grounded theory, a replacement research question emerged as…
Descriptors: Definitions, Competence, Academically Gifted, Models
Po-Chun Huang; Ying-Hong Chan; Ching-Yu Yang; Hung-Yuan Chen; Yao-Chung Fan – IEEE Transactions on Learning Technologies, 2024
Question generation (QG) task plays a crucial role in adaptive learning. While significant QG performance advancements are reported, the existing QG studies are still far from practical usage. One point that needs strengthening is to consider the generation of question group, which remains untouched. For forming a question group, intrafactors…
Descriptors: Automation, Test Items, Computer Assisted Testing, Test Construction
Hernandez, Nestor; Olson, Kristen; Smyth, Jolene D. – Field Methods, 2023
Questionnaire designers are encouraged to write questions as complete sentences. In self-administered surveys, incomplete question stems may reduce visual clutter but may also increase burden when respondents need to scan the response options to fully complete the question. We experimentally examine the effects of three categories of incomplete…
Descriptors: Surveys, Questionnaires, Test Construction, Reaction Time
Welsandt, Nina Charlotte Johanna; Abs, Hermann Josef – Journal of Social Science Education, 2023
Purpose: This paper analyses and classifies currently available English- and German-language measurement instruments for assessing economic literacy. It shows the content-related focuses and gaps of the extracted test instruments, the cognitive level of demand that characterises the instruments, the technical forms of implementation, and the…
Descriptors: Economics, Knowledge Level, Measures (Individuals), German
Anna Planas-Lladó; Xavier Úcar – American Journal of Evaluation, 2024
Empowerment is a concept that has become increasingly used over recent years. However, little research has been undertaken into how empowerment can be evaluated, particularly in the case of young people. The aim of this article is to present an inventory of dimensions and indicators of youth empowerment. The article describes the various phases in…
Descriptors: Youth, Empowerment, Test Construction, Test Validity
Laurie Lachance; Barbara L. Brush; Graciela Mentz; Shoou-Yih D. Lee; P. Paul Chandanabhumma; Chris M. Coombe; Ricardo DeMajo; Adena Gabrysiak; Megan Jensen; Angela G. Reyes; Zachary Rowe; Amy J. Schulz; Eliza Wilson-Powers; Barbara A. Israel – Health Education & Behavior, 2024
Conceptualizing and testing factors that contribute to the success of community-academic partnerships are critical to understanding their contributions to the health and well-being of communities. Most measures to date focus on factors that contribute to the development of new partnerships, and only a few have been adequately tested and validated.…
Descriptors: Community Involvement, Participatory Research, Evaluation Methods, Questionnaires
Julia Mang; Helmut Küchenhoff; Sabine Meinck – Large-scale Assessments in Education, 2024
Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the "Programme for International Student Assessment" (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Alisha M. Hardman; Donna J. Peterson; Mariah S. Morgan; H. Elizabeth Solace – Journal of Extension, 2024
Evaluation data is needed to demonstrate the impact of 4-H science, technology, engineering, and mathematics (STEM) programming on children and youth. However, collecting evaluation data from cloverbuds (ages 5-7) is particularly challenging given their developmental age. We adapted an observational Cloverbud evaluation tool to measure basic life…
Descriptors: Youth Programs, STEM Education, Young Children, Test Construction
Candra Skrzypek – Psychology in the Schools, 2024
Teachers play a critical role in school mental health. They aid in the identification and referral of students in need of mental health services and are key players in implementing interventions. Nevertheless, teachers often lack the education and training needed to support youths' mental health. Increasing teachers' mental health literacy (MHL)…
Descriptors: Teachers, Mental Health, Student Welfare, Multiple Literacies

Peer reviewed
Direct link
