Full of beans: A study on the alignment of two flowering plants classification systems

Yi Yun Cheng, Bertram Ludaescher

Research output: Contribution to journalConference article

Abstract

Advancements in technologies such as DNA analysis have given rise to new ways in organizing organisms in biodiversity classification systems. In this paper, we examine the feasibility of aligning two classification systems for flowering plants using a logic-based, Region Connection Calculus (RCC-5) approach. The older "Cronquist system" (1981) classifies plants using their morphological features, while the more recent Angiosperm Phylogeny Group IV (APG IV) (2016) system classifies based on many new methods including genome- level analysis. In our approach, we align pairwise concepts X and Y from two taxonomies using five basic set relations: Congruence (X=Y), inclusion (X>Y), inverse inclusion (X<Y), overlap (X><Y), and disjointness (X!Y). With some of the RCC-5 relationships among the Fabaceae family (beans family) and the Sapindaceae family (maple family) uncertain, we anticipate that the merging of the two classification systems will lead to numerous merged solutions, socalled possible worlds. Our research demonstrates how logic-based alignment with ambiguities can lead to multiple merged solutions, which would not have been feasible when aligning taxonomies, classifications, or other knowledge organization systems (KOS) manually. We believe that this work can introduce a novel approach for aligning KOS, where merged possible worlds can serve as a minimum viable product for engaging domain experts in the loop.

Original languageEnglish (US)
Pages (from-to)53-63
Number of pages11
JournalCEUR Workshop Proceedings
Volume2200
StatePublished - Jan 1 2018
Event18th European Networked Knowledge Organization Systems Workshop, NKOS 2018 - Porto, Portugal
Duration: Sep 13 2018 → …

Fingerprint

Taxonomies
Biodiversity
Merging
DNA
Genes
Phylogeny

Keywords

  • Interoperability
  • KOS alignment
  • Raxonomy alignment

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Full of beans : A study on the alignment of two flowering plants classification systems. / Cheng, Yi Yun; Ludaescher, Bertram.

In: CEUR Workshop Proceedings, Vol. 2200, 01.01.2018, p. 53-63.

Research output: Contribution to journalConference article

@article{68f2a92b57b44c0cb41dc89c3d081fde,
title = "Full of beans: A study on the alignment of two flowering plants classification systems",
abstract = "Advancements in technologies such as DNA analysis have given rise to new ways in organizing organisms in biodiversity classification systems. In this paper, we examine the feasibility of aligning two classification systems for flowering plants using a logic-based, Region Connection Calculus (RCC-5) approach. The older {"}Cronquist system{"} (1981) classifies plants using their morphological features, while the more recent Angiosperm Phylogeny Group IV (APG IV) (2016) system classifies based on many new methods including genome- level analysis. In our approach, we align pairwise concepts X and Y from two taxonomies using five basic set relations: Congruence (X=Y), inclusion (X>Y), inverse inclusion (X<Y), overlap (X><Y), and disjointness (X!Y). With some of the RCC-5 relationships among the Fabaceae family (beans family) and the Sapindaceae family (maple family) uncertain, we anticipate that the merging of the two classification systems will lead to numerous merged solutions, socalled possible worlds. Our research demonstrates how logic-based alignment with ambiguities can lead to multiple merged solutions, which would not have been feasible when aligning taxonomies, classifications, or other knowledge organization systems (KOS) manually. We believe that this work can introduce a novel approach for aligning KOS, where merged possible worlds can serve as a minimum viable product for engaging domain experts in the loop.",
keywords = "Interoperability, KOS alignment, Raxonomy alignment",
author = "Cheng, {Yi Yun} and Bertram Ludaescher",
year = "2018",
month = "1",
day = "1",
language = "English (US)",
volume = "2200",
pages = "53--63",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "CEUR-WS",

}

TY - JOUR

T1 - Full of beans

T2 - A study on the alignment of two flowering plants classification systems

AU - Cheng, Yi Yun

AU - Ludaescher, Bertram

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Advancements in technologies such as DNA analysis have given rise to new ways in organizing organisms in biodiversity classification systems. In this paper, we examine the feasibility of aligning two classification systems for flowering plants using a logic-based, Region Connection Calculus (RCC-5) approach. The older "Cronquist system" (1981) classifies plants using their morphological features, while the more recent Angiosperm Phylogeny Group IV (APG IV) (2016) system classifies based on many new methods including genome- level analysis. In our approach, we align pairwise concepts X and Y from two taxonomies using five basic set relations: Congruence (X=Y), inclusion (X>Y), inverse inclusion (X<Y), overlap (X><Y), and disjointness (X!Y). With some of the RCC-5 relationships among the Fabaceae family (beans family) and the Sapindaceae family (maple family) uncertain, we anticipate that the merging of the two classification systems will lead to numerous merged solutions, socalled possible worlds. Our research demonstrates how logic-based alignment with ambiguities can lead to multiple merged solutions, which would not have been feasible when aligning taxonomies, classifications, or other knowledge organization systems (KOS) manually. We believe that this work can introduce a novel approach for aligning KOS, where merged possible worlds can serve as a minimum viable product for engaging domain experts in the loop.

AB - Advancements in technologies such as DNA analysis have given rise to new ways in organizing organisms in biodiversity classification systems. In this paper, we examine the feasibility of aligning two classification systems for flowering plants using a logic-based, Region Connection Calculus (RCC-5) approach. The older "Cronquist system" (1981) classifies plants using their morphological features, while the more recent Angiosperm Phylogeny Group IV (APG IV) (2016) system classifies based on many new methods including genome- level analysis. In our approach, we align pairwise concepts X and Y from two taxonomies using five basic set relations: Congruence (X=Y), inclusion (X>Y), inverse inclusion (X<Y), overlap (X><Y), and disjointness (X!Y). With some of the RCC-5 relationships among the Fabaceae family (beans family) and the Sapindaceae family (maple family) uncertain, we anticipate that the merging of the two classification systems will lead to numerous merged solutions, socalled possible worlds. Our research demonstrates how logic-based alignment with ambiguities can lead to multiple merged solutions, which would not have been feasible when aligning taxonomies, classifications, or other knowledge organization systems (KOS) manually. We believe that this work can introduce a novel approach for aligning KOS, where merged possible worlds can serve as a minimum viable product for engaging domain experts in the loop.

KW - Interoperability

KW - KOS alignment

KW - Raxonomy alignment

UR - http://www.scopus.com/inward/record.url?scp=85053867660&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85053867660&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85053867660

VL - 2200

SP - 53

EP - 63

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -