Publication Date
| In 2015 | 8 |
| Since 2014 | 55 |
| Since 2011 (last 5 years) | 206 |
| Since 2006 (last 10 years) | 509 |
| Since 1996 (last 20 years) | 1047 |
Descriptor
| Test Validity | 781 |
| Higher Education | 571 |
| Correlation | 536 |
| Factor Analysis | 531 |
| Test Reliability | 481 |
| Factor Structure | 423 |
| Statistical Analysis | 421 |
| Scores | 368 |
| Comparative Analysis | 356 |
| Test Construction | 347 |
| More ▼ | |
Author
| Michael, William B. | 66 |
| Thompson, Bruce | 26 |
| Krus, David J. | 21 |
| Marcoulides, George A. | 20 |
| Vegelius, Jan | 20 |
| Aiken, Lewis R. | 19 |
| Plake, Barbara S. | 19 |
| Wang, Wen-Chung | 19 |
| Wilcox, Rand R. | 19 |
| Powers, Stephen | 18 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 86 |
| Postsecondary Education | 35 |
| Elementary Education | 30 |
| High Schools | 27 |
| Secondary Education | 24 |
| Middle Schools | 17 |
| Elementary Secondary Education | 16 |
| Grade 4 | 14 |
| Grade 3 | 12 |
| Grade 8 | 11 |
| More ▼ | |
Audience
| Researchers | 4 |
| Practitioners | 3 |
| Students | 1 |
Showing 916 to 930 of 3,486 results
Peer reviewedAnd Others; Birenbaum, Menucha – Educational and Psychological Measurement, 1997
The agreement of diagnostic classifications from two parallel subtests assessing a mathematics skill with three levels of scoring was studied with 431 Arab Israeli 10th graders. Results indicate that, even when parallel form reliability is high, less agreement is apparent when performance is evaluated at the micro level. (SLD)
Descriptors: Arabs, Classification, Diagnostic Tests, Evaluation Methods
Peer reviewedMaranon, Pedro Prieto; And Others – Educational and Psychological Measurement, 1997
A modification of the Mantel Haenszel (MH) procedure in which the sample is broken in half and subsamples are studied separately was compared with a residual procedure based on item response theory as a way to detect differential item functioning (DIF). Results suggest that the residual analysis achieves higher DIF detection. (SLD)
Descriptors: Comparative Analysis, Identification, Item Bias, Item Response Theory
Peer reviewedFields, Dail L.; Herold, David M. – Educational and Psychological Measurement, 1997
Investigated whether dimensions of transformational and transactional leadership can be inferred from subordinate reports of leadership behaviors collected through the Leadership Practices Inventory (LPI) (B. Posner and J. Kouzes, 1988) completed by 1,892 subordinates about 344 managers. Results support use of the LPI to measure transformation and…
Descriptors: Administrators, Evaluation Methods, Leadership, Measurement Techniques
Peer reviewedNdalichako, Joyce L.; Rogers, W. Todd – Educational and Psychological Measurement, 1997
Ability estimates obtained from applying finite state score theory, item response models, and classical test theory to score multiple-choice items were compared using responses of 1,230 examinees. Scoring models provided essentially the same ranking of examinees, but ease of use and interpretation support the use of the classical test model. (SLD)
Descriptors: Ability, Comparative Analysis, Estimation (Mathematics), High School Students
Peer reviewedSalter, Daniel W.; And Others – Educational and Psychological Measurement, 1997
A test-retest study of the Myers-Briggs Type Indicator with 99 graduate students over 20 months yields findings consistent with previous studies, but an examination of type dynamics using log-linear analyses indicates that dominant thinking and dominant sensing do not retest as well as dominant intuition and feeling. (SLD)
Descriptors: Affective Behavior, Graduate Students, Graduate Study, Intuition
Peer reviewedHancock, Gregory R. – Educational and Psychological Measurement, 1997
Methods are offered for conducting hypothesis testing associated with disattenuated validity coefficients to overcome limitations of some other suggested approaches. Through using classical test theory's notion of reliability in the form of structured path models, such hypothesis testing may be done with hierarchically related structural equation…
Descriptors: Correlation, Hypothesis Testing, Reliability, Scores
In-Basket Assessment by Fully Objective Methods: Development and Evaluation of a Self-Report System.
Peer reviewedHakstian, A. Ralph; Scratchley, Linda S. – Educational and Psychological Measurement, 1997
The feasibility and efficacy of using self-report response methods with an In-Basket exercise were evaluated in two studies involving 258 managers and 55 college students, respectively. Results suggest that high face validity of the In-Basket exercise can be combined with the scoring ease and objectivity of self-reports. (SLD)
Descriptors: Administrators, College Students, Evaluation Methods, Higher Education
Peer reviewedRae, Gordon – Educational and Psychological Measurement, 1997
Although Camp's tetrachoric correlation approximation (B. Camp, 1934) has been shown to yield excellent results over a fairly wide range, its derivation remains something of a mystery. This article shows how Camp might have arrived at his formula using a theoretical, rather than an empirical, approach. (Author/SLD)
Descriptors: Correlation, Estimation (Mathematics), Theories
Peer reviewedCaruso, John C.; Cliff, Norman – Educational and Psychological Measurement, 1997
Several methods of constructing confidence intervals for Spearman's rho (rank correlation coefficient) (C. Spearman, 1904) were tested in a Monte Carlo study using 2,000 samples of 3 different sizes. Results support the continued use of Spearman's rho in behavioral research. (SLD)
Descriptors: Behavioral Science Research, Correlation, Monte Carlo Methods, Power (Statistics)
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1997
Some results on how the Alexander-Govern heteroscedastic analysis of variance (ANOVA) procedure (R. Alexander and D. Govern, 1994) performs under nonnormality are presented. This method can provide poor control of Type I errors in some cases, and in some situations power decreases as differences among the means get large. (SLD)
Descriptors: Analysis of Variance, Error of Measurement, Power (Statistics), Statistical Distributions
Peer reviewedTinsley, Barbara J.; And Others – Educational and Psychological Measurement, 1997
The convergent validity of peer, self, and teacher methods of assessing youths' risk propensity and the relation of these measures to health risk behavior were studied with 436 elementary and junior high school students. Findings demonstrate low congruence between rater sources. Prediction depended on behavior assessed and grade level. (SLD)
Descriptors: Age Differences, Behavior Patterns, Children, Elementary Education
Peer reviewedPonsoda, Vincente; And Others – Educational and Psychological Measurement, 1997
A study involving 209 Spanish high school students compared computer-based English vocabulary tests: (1) a self-adapted test (SAT); (2) a computerized adaptive test (CAT); (3) a conventional test; and (4) a test combining SAT and CAT. No statistically significant differences were found among test types for estimated ability or posttest anxiety.…
Descriptors: Ability, Adaptive Testing, Anxiety, Comparative Analysis
Peer reviewedMudrack, Peter E. – Educational and Psychological Measurement, 1997
The Time Structure Questionnaire (M. J. Bond and N. T. Feather, 1988) and the Time Management Behavior Scale (T. H. Macan and colleagues, 1990) were evaluated through results from 701 and 453 adults respectively. Results confirm the importance of examining subscales of these measures rather than simply aggregate scores. (SLD)
Descriptors: Adults, Attitude Measures, Attitudes, Scores
Peer reviewedThornton, George C., III; And Others – Educational and Psychological Measurement, 1997
Ethnic and gender differences in motivation to manage were studied through sentence completion tests and in-basket exercises completed by 138 White, 9 American Indian, 28 Black, 64 Hispanic, 44 Asian, and 5 "other" college students. Both types of differences were found. Implications for identification of management talent are discussed. (SLD)
Descriptors: Administration, American Indians, Asian Americans, Black Students
Peer reviewedDuan, Bin; Dunlap, William P. – Educational and Psychological Measurement, 1997
A Monte Carlo study compared the accuracy of different estimates of the standard error of correlations corrected for restriction in range. The procedure suggested by P. Bobko and A. Rieck (1980) generated the most accurate estimates of the standard error. Aspects of accuracy are discussed. (SLD)
Descriptors: Correlation, Error of Measurement, Estimation (Mathematics), Monte Carlo Methods


