Publication Date
| In 2024 | 112 |
| Since 2023 | 301 |
| Since 2020 (last 5 years) | 977 |
| Since 2015 (last 10 years) | 2300 |
| Since 2005 (last 20 years) | 3264 |
Descriptor
| Science Tests | 4298 |
| Foreign Countries | 1778 |
| Science Instruction | 1382 |
| Science Achievement | 1302 |
| Science Education | 987 |
| Scores | 932 |
| Achievement Tests | 906 |
| Mathematics Tests | 872 |
| Teaching Methods | 811 |
| Elementary Secondary Education | 744 |
| Comparative Analysis | 724 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 206 |
| Teachers | 197 |
| Researchers | 88 |
| Students | 51 |
| Administrators | 48 |
| Policymakers | 38 |
| Community | 12 |
| Parents | 11 |
| Counselors | 2 |
Location
| Turkey | 247 |
| Australia | 121 |
| Canada | 117 |
| United States | 106 |
| Indonesia | 82 |
| United Kingdom (England) | 78 |
| Singapore | 73 |
| Germany | 72 |
| Taiwan | 69 |
| Japan | 59 |
| United Kingdom | 59 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 10 |
Kang, Yewon; Ha, Hyorim; Lee, Hee Seung – Educational Psychology Review, 2023
Natural category learning is important in science education. One strategy that has been empirically supported for enhancing category learning is testing, which facilitates not only the learning of previously studied information (backward testing effect) but also the learning of newly studied information (forward testing effect). However, in…
Descriptors: Science Education, Science Tests, Testing, Classification
Ted M. Clark – Journal of Chemical Education, 2023
The artificial intelligence chatbot ChatGPT was used to answer questions from final exams administered in two general chemistry courses, including questions with closed-response format and with open-response format. For closed-response questions, ChatGPT was very capable at identifying the concept even when the question included a great deal of…
Descriptors: Artificial Intelligence, Science Tests, Chemistry, Science Instruction
Purwoko Haryadi Santoso; Bayu Setiaji; Wahyudi; Johan Syahbrudin; Syamsul Bahri; Fathurrahman; A. Suci Rizky Ananda; Yusuf Sodhiqin – Physical Review Physics Education Research, 2024
The Force Concept Inventory (FCI) is one of the research-based assessments established by the physics education research community to measure students' understanding of Newtonian mechanics. Former works have often recorded the notion of gendered mean FCI scores favoring male students notably in the North American (NA) based studies. Nevertheless,…
Descriptors: Gender Differences, Physics, Science Instruction, Science Tests
Su, Kun; Henson, Robert A. – Journal of Educational and Behavioral Statistics, 2023
This article provides a process to carefully evaluate the suitability of a content domain for which diagnostic classification models (DCMs) could be applicable and then optimized steps for constructing a test blueprint for applying DCMs and a real-life example illustrating this process. The content domains were carefully evaluated using a set of…
Descriptors: Classification, Models, Science Tests, Physics
Ntumi, Simon; Agbenyo, Sheilla; Bulala, Tapela – Shanlax International Journal of Education, 2023
There is no need or point to testing of knowledge, attributes, traits, behaviours or abilities of an individual if information obtained from the test is inaccurate. However, by and large, it seems the estimation of psychometric properties of test items in classroomshas been completely ignored otherwise dying slowly in most testing environments. In…
Descriptors: Psychometrics, Accuracy, Test Validity, Factor Analysis
Putica, Katarina B. – Research in Science Education, 2023
Previous studies noted the scantiness of diagnostic instruments for the assessment of students' understanding of fundamental biochemistry concepts. Consequently, within this study, a four-tier test for the examination of secondary school students' conceptual understanding of amino acids, proteins, and enzymes has been developed. Items in the test…
Descriptors: Test Construction, Test Validity, Secondary School Students, Science Tests
Cuilan Qiao; Yuqing Chen; Qing Guo; Yunwei Yu – International Journal of STEM Education, 2024
In the era defined by the fourth paradigm of science research, the burgeoning volume of science data poses a formidable challenge. The established data-related requisites within science literacy now fall short of addressing the evolving needs of researchers and STEM students. Consequently, the emergence of science data literacy becomes imperative.…
Descriptors: Scientific Literacy, Data, STEM Education, Majors (Students)
Amy D. Robertson; Lisa M. Goodhew; Lauren C. Bauman; Brynna Hansen; Anne T. Alesandrini – Physical Review Physics Education Research, 2023
[This paper is part of the Focused Collection on Qualitative Methods in PER: A Critical Examination.] Identifying student ideas about particular physics topics is one of the earliest and longest-standing foci of physics education research. This paper presents a method for identifying common conceptual resources for understanding physics, using…
Descriptors: Physics, Scientific Concepts, Science Process Skills, Science Tests
Nedungadi, Sachin; Rinco Michels, Olga; Kreke, Patricia J.; Raker, Jeffrey R.; Murphy, Kristen L. – Journal of Chemical Education, 2022
Practice examinations developed at the ACS Examinations Institute ask students to self-report mental effort when answering items. This self-reported mental effort together with performance can be represented in the form of a cognitive efficiency graph for each student giving information on the utilization of cognitive resources and content…
Descriptors: Cognitive Processes, Science Tests, Test Items, Difficulty Level
David G. Schreurs; Jaclyn M. Trate; Shalini Srinivasan; Melonie A. Teichert; Cynthia J. Luxford; Jamie L. Schneider; Kristen L. Murphy – Chemistry Education Research and Practice, 2024
With the already widespread nature of multiple-choice assessments and the increasing popularity of answer-until-correct, it is important to have methods available for exploring the validity of these types of assessments as they are developed. This work analyzes a 20-question multiple choice assessment covering introductory undergraduate chemistry…
Descriptors: Multiple Choice Tests, Test Validity, Introductory Courses, Science Tests
Jia-qi Zheng; Li-hua Tan – International Journal of Science Education, 2024
Cultivating students' enjoyment of science is one of the significant aims of science education. However, it seems challenging to achieve this goal in most countries or economies. Using the Trends in International Mathematics and Science Study (TIMSS) 2019 Hong Kong data, this study aims to identify multiple configurations of school conditions to…
Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, Science Achievement
Serhan Sarioglu; Bulut Demir; Ümmühan Ormanci; Salih Çepni – Journal of Teacher Education and Educators, 2023
This study aims to obtain and compare the opinions of exam question writers and teachers on skill-based exam questions. 24 science teachers and 11 context-based exam question writers participated in the study, which was carried out according to the convergent parallel design. Opinions of both parties on the skill-based exam questions were…
Descriptors: Tests, Authors, Science Tests, Foreign Countries
Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023
Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…
Descriptors: Item Response Theory, Test Items, Test Format, Science Tests
Gonzales, Fredrick – ProQuest LLC, 2023
This study examines the relationship between two secondary End of Course (EOC) exams, the Biology EOC and the English I EOC exams, and their impact on Emergent Bilingual (EB) students in a small district in South Texas. This study is a mixed methods study which uses both quantitative and qualitative data to answer three research questions: (a)…
Descriptors: Secondary School Students, Bilingual Students, Biology, Exit Examinations
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries

Peer reviewed
Direct link
