Analyzing and Normalizing Type Metadata for a Large Aggregated Digital Library

Joshua D. Lynch, Jessica Gibson, Myung-Ja Han

Research output: Contribution to journalArticlepeer-review


The Illinois Digital Heritage Hub (IDHH) gathers and enhances metadata from contributing institutions around the state of Illinois and provides this metadata to the Digital Public Library of America (DPLA) for greater access. The IDHH helps contributors shape their metadata to the standards recommended and required by the DPLA in part by analyzing and enhancing aggregated metadata. In late 2018, the IDHH undertook a project to address a particularly problematic field, Type metadata. This paper walks through the project, detailing the process of gathering and analyzing metadata using the DPLA API and OpenRefine, data remediation through XSL transformations in conjunction with local improvements by contributing institutions, and the DPLA ingestion system's quality controls.
Original languageEnglish (US)
JournalCode4Lib Journal
Issue number47
StatePublished - Feb 17 2020


  • Digital libraries
  • Metadata
  • Public libraries
  • Libraries
  • Quality control
  • Public institutions
  • Illinois


Dive into the research topics of 'Analyzing and Normalizing Type Metadata for a Large Aggregated Digital Library'. Together they form a unique fingerprint.

Cite this