Retrieval of patent documents from heterogeneous sources using ontologies and similarity analysis

Siddharth Taduri, Gloria T. Lau, Kincho H. Law, Jay P. Kesan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In the past few years, there has been an explosive growth in scientific and legal information related to the patent system. Patents and related documents are siloed into multiple heterogeneous sources. Retrieving relevant information from diverse sources is a non-trivial task and poses many technical challenges. Among the challenges is the issue of terminological inconsistencies that are used in the documents. We tackle the terminological inconsistency issue by exploring domain knowledge through the use of ontology standards. Furthermore, we take advantage of cross-references and structural dependencies between the information sources to enhance terminological comparison. In this paper, we present a similarity analysis methodology which combines knowledge from two distinct sources - (1) domain ontologies and (2) ontologies which describe the information sources to assist a user in identifying relevant documents across several information sources simultaneously. Specifically, we explore the use of a rule-based system to infer relationships between documents based on pre-defined heuristics. We present our results through a use case in the bio-patent domain with a collection of 1150 patents and 30 court cases.

Original languageEnglish (US)
Title of host publicationProceedings - 5th IEEE International Conference on Semantic Computing, ICSC 2011
Pages538-545
Number of pages8
DOIs
StatePublished - 2011
Event5th Annual IEEE International Conference on Semantic Computing, ICSC 2011 - Palo Alto, CA, United States
Duration: Sep 18 2011Sep 21 2011

Publication series

NameProceedings - 5th IEEE International Conference on Semantic Computing, ICSC 2011

Other

Other5th Annual IEEE International Conference on Semantic Computing, ICSC 2011
Country/TerritoryUnited States
CityPalo Alto, CA
Period9/18/119/21/11

Keywords

  • Court cases
  • Information retrieval
  • Knowledgebase
  • Ontology
  • Patent

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Retrieval of patent documents from heterogeneous sources using ontologies and similarity analysis'. Together they form a unique fingerprint.

Cite this