Enhancing search and Browse using automated clustering of subject metadata

Kat Hagedorn, Suzanne Chapman, David Newman

Research output: Contribution to journalArticlepeer-review


The Web puzzle of online information resources often hinders end-users from effective and efficient access to these resources. Clustering resources into appropriate subject-based groupings may help alleviate these difficulties, but will it work with heterogeneous material? The University of Michigan and the University of California Irvine joined forces to test automatically enhancing metadata records using the Topic Modeling algorithm on the varied OAIster corpus. We created labels for the resulting clusters of metadata records, matched the clusters to an inhouse classification system, and developed a prototype that would showcase methods for search and retrieval using the enhanced records. Results indicated that while the algorithm was somewhat time-intensive to run and using a local classification scheme had its drawbacks, precise clustering of records was achieved and the prototype interface proved that faceted classification could be powerful in helping end-users find resources.

Original languageEnglish (US)
JournalD-Lib Magazine
Issue number7-8
StatePublished - Jul 2007

ASJC Scopus subject areas

  • Library and Information Sciences

Fingerprint Dive into the research topics of 'Enhancing search and Browse using automated clustering of subject metadata'. Together they form a unique fingerprint.

Cite this