The Prevalence and Implications of Slipping on Low-Stakes, Large-Scale Assessments

Research output: Contribution to journalArticlepeer-review

Abstract

In the absence of clear incentives, achievement tests may be subject to the effect of slipping where item response functions have upper asymptotes below one. Slipping reduces score precision for higher latent scores and distorts test developers’ understandings of item and test information. A multidimensional four-parameter normal ogive model was developed for large-scale assessments and applied to dichotomous items of the 2011 National Assessment of Educational Progress eighth-grade mathematics and reading tests. The results suggest that the probability of slipping exceeded 5% for 47.2% and 51.1% of the dichotomous mathematics and reading items, respectively. Furthermore, allowing for slipping resulted in larger item discrimination parameters, increased information in the lower-to-middle range of the latent trait, and decreased precision for scores one standard deviation above the mean. The results provide evidence that slipping is a factor that should be considered during test development and construction to ensure adequate measurement across the latent continuum.

Original languageEnglish (US)
Pages (from-to)706-725
Number of pages20
JournalJournal of Educational and Behavioral Statistics
Volume42
Issue number6
DOIs
StatePublished - Dec 1 2017

Keywords

  • NAEP
  • achievement
  • item response theory
  • psychometrics
  • testing

ASJC Scopus subject areas

  • Education
  • Social Sciences (miscellaneous)

Fingerprint Dive into the research topics of 'The Prevalence and Implications of Slipping on Low-Stakes, Large-Scale Assessments'. Together they form a unique fingerprint.

Cite this