Big data at scale for digital humanities: An architecture for the HathiTrust research center

Stacy T. Kowalczyk, Yiming Sun, Zong Peng, Beth Plale, Aaron Todd, Loretta Auvil, Craig Willis, Jiaan Zeng, Milinda Pathirage, Samitha Liyanage, Guangchen Ruan, J. Stephen Downie

Research output: Chapter in Book/Report/Conference proceedingChapter


Big Data in the humanities is a new phenomenon that is expected to revolutionize the process of humanities research. The HathiTrust Research Center (HTRC) is a cyberinfrastructure to support humanities research on big humanities data. The HathiTrust Research Center has been designed to make the technology serve the researcher to make the content easy to find, to make the research tools efficient and effective, to allow researchers to customize their environment, to allow researchers to combine their own data with that of the HTRC, and to allow researchers to contribute tools. The architecture has multiple layers of abstraction providing a secure, scalable, extendable, and generalizable interface for both human and computational users.

Original languageEnglish (US)
Title of host publicationBig Data Management, Technologies, and Applications
EditorsWen-Chen Hu, Naima Kaabouch
PublisherIGI Global
Number of pages25
ISBN (Electronic)9781466647008
ISBN (Print)1466646993, 9781466646995
StatePublished - Oct 31 2013

ASJC Scopus subject areas

  • General Computer Science
  • General Economics, Econometrics and Finance
  • General Business, Management and Accounting


Dive into the research topics of 'Big data at scale for digital humanities: An architecture for the HathiTrust research center'. Together they form a unique fingerprint.

Cite this