Biological databases for linking large microbial and environmental datasets

C. Jacob, A. D. Kent, B. J. Benson, R. J. Newton, E. W. Triplett, K. D. McMahon

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

New analytical techniques in microbiology have created the potential to investigate microbial communities, their interactions, and their role in ecosystem functions in novel ways. Combinations of such techniques allow the rapid generation of large datasets describing microbial community composition and variation across time and space. In order to address ecologically relevant questions, microbial community datasets must be linked with related environmental datasets. This challenging task is made feasible in a rich data mining environment through a complex data model and interactive querying tools against a relational database. This paper discusses the motivation and design for one such microbial ecology database at the North Temperate Lakes Microbial Observatory and the research questions made tractable by this informatics project. The design principles and data-model used during database development are presented. Architecture that supports the progressive evolution of the informatics system is also discussed. Interactions with the user community in data model development were essential. This application is customdesigned to the needs and objectives of linking microbial and environmental questions, highlighting contributions from informatics to ecology. The bio-data model reflects the data mining regimes of the microbial disciplines and supports research questions that could not have been asked without such informatics tools. The project also serves as an illustrative case study in the design of data models and information systems, not only for microbial-environmental datasets, but in a broader perspective, for other biological databases that could adapt the techniques used here for data integration and mining.

Original languageEnglish (US)
Title of host publicationWMSCI 2005 - The 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Proceedings
Pages279-282
Number of pages4
StatePublished - 2005
Externally publishedYes
Event9th World Multi-Conference on Systemics, Cybernetics and Informatics, WMSCI 2005 - Orlando, FL, United States
Duration: Jul 10 2005Jul 13 2005

Publication series

NameWMSCI 2005 - The 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Proceedings
Volume6

Other

Other9th World Multi-Conference on Systemics, Cybernetics and Informatics, WMSCI 2005
Country/TerritoryUnited States
CityOrlando, FL
Period7/10/057/13/05

Keywords

  • Biological databases
  • Microbial and environmental datasets
  • Scientific database design

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Information Systems

Fingerprint

Dive into the research topics of 'Biological databases for linking large microbial and environmental datasets'. Together they form a unique fingerprint.

Cite this