Merging sets of taxonomically organized data using concept mappings under uncertainty

David Thau, Shawn Bowers, Bertram Ludäscher

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present a method for using aligned ontologies to merge taxonomically organized data sets that have apparently compatible schemas, but potentially different semantics for corresponding domains. We restrict the relationships involved in the alignment to basic set relations and disjunctions of these relations. A merged data set combines the domains of the source data set attributes, conforms to the observations reported in both data sets, and minimizes uncertainty introduced by ontology alignments. We find that even in very simple cases, merging data sets under this scenario is non-trivial. Reducing uncertainty introducced by the ontology alignments in combination with the data set observations often results in many possible merged data sets, which are managed using a possible worlds semantics. The primary contributions of this paper are a framework for representing aligned data sets and algorithms for merging data sets that report the presence and absence of taxonomically organized entities, including an efficient algorithm for a common data set merging scenario.

Original languageEnglish (US)
Title of host publicationOn the Move to Meaningful Internet Systems
Subtitle of host publicationOTM 2009 - Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009, Proceedings
Pages1103-1120
Number of pages18
EditionPART 2
DOIs
StatePublished - 2009
Externally publishedYes
EventConfederated International Conferences on On the Move to Meaningful Internet Systems, OTM 2009: CoopIS 2009, DOA 2009, IS 2009 and ODBASE 2009 - Vilamoura, Portugal
Duration: Nov 1 2009Nov 6 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume5871 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

OtherConfederated International Conferences on On the Move to Meaningful Internet Systems, OTM 2009: CoopIS 2009, DOA 2009, IS 2009 and ODBASE 2009
Country/TerritoryPortugal
CityVilamoura
Period11/1/0911/6/09

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Merging sets of taxonomically organized data using concept mappings under uncertainty'. Together they form a unique fingerprint.

Cite this