Yang, Jing Chang, Hua-Hua Tao, Jian Shi, Ningzhong
Published in
Applied psychological measurement
Cognitive diagnostic computerized adaptive testing (CD-CAT) aims to obtain more useful diagnostic information by taking advantages of computerized adaptive testing (CAT). Cognitive diagnosis models (CDMs) have been developed to classify examinees into the correct proficiency classes so as to get more efficient remediation, whereas CAT tailors optim...
Sinharay, Sandip
Published in
Applied psychological measurement
Benefiting from item preknowledge is a major type of fraudulent behavior during educational assessments. This article suggests a new statistic that can be used for detecting the examinees who may have benefited from item preknowledge using their response times. The statistic quantifies the difference in speed between the compromised items and the n...
Li, Zhushan
Published in
Applied psychological measurement
Crossing SIBTEST or CSIB is designed to detect crossing differential item functioning (DIF) as well as unidirectional DIF. A theoretical formula for the power of CSIB is derived based on the asymptotic distribution of the test statistic under the null and alternative hypotheses. The derived power formula provides insights on the factors that influe...
Cui, Zhongmin
Published in
Applied psychological measurement
The R package catR is a useful developmental and testing platform for adaptive tests. Developers and researchers have been using catR for their assessments or research projects. However, there is a flaw in catR that can potentially cause misleading results. This article shows the flaw and provides a solution. Suggestions in using seeds for random n...
Wang, Wenhao Kingston, Neal
Published in
Applied psychological measurement
Previous studies indicated that the assumption of logistic form of parametric item response functions (IRFs) is violated often enough to be worth checking. Using nonparametric item response theory (IRT) estimation methods with the posterior predictive model checking method can obtain significance probabilities of fit statistics in a Bayesian framew...
Strachan, Tyler Ip, Edward Fu, Yanyan Ackerman, Terry Chen, Shyh-Huei Willse, John
Published in
Applied psychological measurement
As a method to derive a "purified" measure along a dimension of interest from response data that are potentially multidimensional in nature, the projective item response theory (PIRT) approach requires first fitting a multidimensional item response theory (MIRT) model to the data before projecting onto a dimension of interest. This study aims to ex...
Hong, Maxwell R Cheng, Ying
Published in
Applied psychological measurement
In the context of high-stakes tests, test takers who do not have enough time to complete a test rush toward the end and may engage in speeded behavior when tests do not penalize guessing. Using mathematical derivations and simulations, previous research showed that random guessing responses should attenuate interitem correlations, and therefore, de...
Albano, Anthony D Wiberg, Marie
Published in
Applied psychological measurement
Research has recently demonstrated the use of multiple anchor tests and external covariates to supplement or substitute for common anchor items when linking and equating with nonequivalent groups. This study examines the conditions under which external covariates improve linking and equating accuracy, with internal and external anchor tests of vary...
Jewsbury, Paul A
Published in
Applied psychological measurement
Criterion-related validation of diagnostic test scores for a construct of interest is complicated by the unavailability of the construct directly. The standard method, Known Group Validation, assumes an infallible reference test in place of the construct, but infallible reference tests are rare. In contrast, Mixed Group Validation allows for a fall...
Leroux, Audrey J Waid-Ebbs, J Kay Wen, Pey-Shan Helmer, Drew A Graham, David P O'Connor, Maureen K Ray, Kathleen
Published in
Applied psychological measurement
The purpose of this simulation study was to investigate the effect of several different item exposure control procedures in computerized adaptive testing (CAT) with variable-length stopping rules using the partial credit model. Previous simulation studies on CAT exposure control methods with polytomous items rarely considered variable-length tests....