Approximate query translation across heterogeneous information sources

Chen Chuan K. Chang, Hector Garcia-Molina

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper we present a mechanism for approximately translating Boolean query constraints across heterogeneous information sources. Achieving the best translation is challenging because sources support different constraints for formulating queries, and often these constraints cannot be precisely translated. For instance, a query Iscore > 81 might be 'perfectly' translated as (rating > 0.81 at some site, but can only be approximated as lgrade = A( at another. Unlike other work, our general framework adopts a cus(omiiable 'closeness' metric for the translation that combines both precision and recall. Our results show that for query translation we need to handle interdependencies among both query conJuncts as well as disjuncts. As the basis, we identify the essential requirements of a rule system for Users to encode the mappings for atomic semantic Units. Our algorithm then translates complex queries by rewriting them in terms of the semantic units. We show that, under practical assumptions, our algorithm generates the best approximate translations with respect to the closeness metric of choice. We also present a case study to show how our technique may be applied in practice.

Original languageEnglish (US)
Title of host publicationProceedings of the 26th International Conference on Very Large Data Bases, VLDB'00
Pages566-577
Number of pages12
StatePublished - Dec 1 2000
Externally publishedYes
Event26th International Conference on Very Large Data Bases, VLDB 2000 - Cairo, Egypt
Duration: Sep 10 2000Sep 14 2000

Publication series

NameProceedings of the 26th International Conference on Very Large Data Bases, VLDB'00

Other

Other26th International Conference on Very Large Data Bases, VLDB 2000
CountryEgypt
CityCairo
Period9/10/009/14/00

ASJC Scopus subject areas

  • Hardware and Architecture
  • Information Systems
  • Software
  • Information Systems and Management

Fingerprint Dive into the research topics of 'Approximate query translation across heterogeneous information sources'. Together they form a unique fingerprint.

Cite this