Wilcox, Rand R.2011-02-122011-02-121980Wilcox, Rand R. (1980). Determining the length of a criterion-referenced test. Applied Psychological Measurement, 4, 425-446. doi:10.1177/014662168000400402doi:10.1177/014662168000400402https://hdl.handle.net/11299/100199When determining how many items to include on a criterion-referenced test, practitioners must resolve various nonstatistical issues before a particular solution can be applied. A fundamental problem is deciding which of three true scores should be used. The first is based on the probability that an examinee is correct on a "typical" test item. The second is the probability of having acquired a typical skill among a domain of skills, and the third is based on latent trait models. Once a particular true score is settled upon, there are several perspectives that might be used to determine test length. The paper reviews and critiques these solutions. Some new results are described that apply when latent structure models are used to estimate an examinee’s true score.enDetermining the length of a criterion-referenced testArticle