TY - GEN
T1 - Advanced information systems for archival appraisals of contemporary documents
AU - McFadden, William
AU - McHenry, Kenton
AU - Kooper, Rob
AU - Ondrejcek, Michal
AU - Yahja, Alex
AU - Bajcsy, Peter
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2008
Y1 - 2008
N2 - This work addresses the problem of designing a scalable framework for archival appraisals of contemporary PDF documents. The motivation for our work is to provide an e-Science solution that (a) fuses the independent research methodologies focusing on specific information types to one comprehensive analytical framework, (b) optimizes tradeoffs between computational requirements and preservation costs, and (b) bridges the small scale and large scale computational studies. The e-Science solution presented here consists of (1) a methodology for comprehensive comparisons of contemporary documents containing text, images and vector graphics, (2) a framework for including 3D and 3D+time data sets into the appraisal analyses, (3) interfaces supporting exploratory archival appraisal analyses with small scale data sets, and (4) infrastructure supporting the transition from small scale to large scale computations using commodity and high performance computing resources. The novelty of our work is in designing methodologies, mathematical frameworks and prototypes for comprehensive and scalable document appraisals that include text, images, vector graphics, and high dimensional data.
AB - This work addresses the problem of designing a scalable framework for archival appraisals of contemporary PDF documents. The motivation for our work is to provide an e-Science solution that (a) fuses the independent research methodologies focusing on specific information types to one comprehensive analytical framework, (b) optimizes tradeoffs between computational requirements and preservation costs, and (b) bridges the small scale and large scale computational studies. The e-Science solution presented here consists of (1) a methodology for comprehensive comparisons of contemporary documents containing text, images and vector graphics, (2) a framework for including 3D and 3D+time data sets into the appraisal analyses, (3) interfaces supporting exploratory archival appraisal analyses with small scale data sets, and (4) infrastructure supporting the transition from small scale to large scale computations using commodity and high performance computing resources. The novelty of our work is in designing methodologies, mathematical frameworks and prototypes for comprehensive and scalable document appraisals that include text, images, vector graphics, and high dimensional data.
UR - http://www.scopus.com/inward/record.url?scp=62749128531&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=62749128531&partnerID=8YFLogxK
U2 - 10.1109/eScience.2008.140
DO - 10.1109/eScience.2008.140
M3 - Conference contribution
AN - SCOPUS:62749128531
SN - 9780769535357
T3 - Proceedings - 4th IEEE International Conference on eScience, eScience 2008
SP - 440
EP - 441
BT - Proceedings - 4th IEEE International Conference on eScience, eScience 2008
T2 - 4th IEEE International Conference on eScience, eScience 2008
Y2 - 7 December 2008 through 12 December 2008
ER -