TY - GEN
T1 - Biological databases for linking large microbial and environmental datasets
AU - Jacob, C.
AU - Kent, A. D.
AU - Benson, B. J.
AU - Newton, R. J.
AU - Triplett, E. W.
AU - McMahon, K. D.
PY - 2005
Y1 - 2005
N2 - New analytical techniques in microbiology have created the potential to investigate microbial communities, their interactions, and their role in ecosystem functions in novel ways. Combinations of such techniques allow the rapid generation of large datasets describing microbial community composition and variation across time and space. In order to address ecologically relevant questions, microbial community datasets must be linked with related environmental datasets. This challenging task is made feasible in a rich data mining environment through a complex data model and interactive querying tools against a relational database. This paper discusses the motivation and design for one such microbial ecology database at the North Temperate Lakes Microbial Observatory and the research questions made tractable by this informatics project. The design principles and data-model used during database development are presented. Architecture that supports the progressive evolution of the informatics system is also discussed. Interactions with the user community in data model development were essential. This application is customdesigned to the needs and objectives of linking microbial and environmental questions, highlighting contributions from informatics to ecology. The bio-data model reflects the data mining regimes of the microbial disciplines and supports research questions that could not have been asked without such informatics tools. The project also serves as an illustrative case study in the design of data models and information systems, not only for microbial-environmental datasets, but in a broader perspective, for other biological databases that could adapt the techniques used here for data integration and mining.
AB - New analytical techniques in microbiology have created the potential to investigate microbial communities, their interactions, and their role in ecosystem functions in novel ways. Combinations of such techniques allow the rapid generation of large datasets describing microbial community composition and variation across time and space. In order to address ecologically relevant questions, microbial community datasets must be linked with related environmental datasets. This challenging task is made feasible in a rich data mining environment through a complex data model and interactive querying tools against a relational database. This paper discusses the motivation and design for one such microbial ecology database at the North Temperate Lakes Microbial Observatory and the research questions made tractable by this informatics project. The design principles and data-model used during database development are presented. Architecture that supports the progressive evolution of the informatics system is also discussed. Interactions with the user community in data model development were essential. This application is customdesigned to the needs and objectives of linking microbial and environmental questions, highlighting contributions from informatics to ecology. The bio-data model reflects the data mining regimes of the microbial disciplines and supports research questions that could not have been asked without such informatics tools. The project also serves as an illustrative case study in the design of data models and information systems, not only for microbial-environmental datasets, but in a broader perspective, for other biological databases that could adapt the techniques used here for data integration and mining.
KW - Biological databases
KW - Microbial and environmental datasets
KW - Scientific database design
UR - http://www.scopus.com/inward/record.url?scp=84867348309&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84867348309&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84867348309
SN - 9806560582
SN - 9789806560581
T3 - WMSCI 2005 - The 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Proceedings
SP - 279
EP - 282
BT - WMSCI 2005 - The 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Proceedings
T2 - 9th World Multi-Conference on Systemics, Cybernetics and Informatics, WMSCI 2005
Y2 - 10 July 2005 through 13 July 2005
ER -