Data for Appendix 7 PMID Duplication in the Union List of "Analyzing the consistency of retraction indexing"

  • Corinne McCumber (Creator)
  • Malik Oyewale Salami (Creator)

Dataset

Description

This project investigates retraction indexing agreement among data sources: BCI, BIOABS, CCC, Compendex, Crossref, GEOBASE, MEDLINE, PubMed, Retraction Watch, Scopus, and Web of Science Core. Post-retraction citation may be partly due to authors’ and publishers' challenges in systematically identifying retracted publications. To investigate retraction indexing quality, we investigate the agreement in indexing retracted publications between 11 database sources, restricting to their coverage, resulting in a union list of 85,392 unique items. This dataset highlights items that went through a DOI augmentation process to have PubMed added as a source and that have duplicated PMIDs, indicating data quality issues.
Date made availableNov 18 2025
PublisherUniversity of Illinois Urbana-Champaign

Keywords

  • PMID duplication
  • retraction indexing
  • RISRS
  • retraction status
  • data quality
  • metadata
  • indexing
  • identifier granularity
  • meta-science

Cite this