A multi-layer graphical model for approximate identity matching

G. Alan Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Many organizations maintain identity information for their customers, vendors, and employees, etc. However, identities being compromised cannot be retrieved effectively. In this paper we first present a case study on identity problems existing in a local police department. The study show that more than half of the sampled suspects have altered identities existing in the police information system due to deception and errors. We build a taxonomy of identity problems based on our findings. The decision to determine matching identities involves some uncertainty because of the problems identified. We propose a probability-based multi-layer graphical model to capture the uncertainty. Experiments show that the proposed model performs significantly better than the searching technique based on exact-match. With 20% of training data labeled, the model with semi-supervised learning achieved performance comparable to that of fully supervised learning.

Original languageEnglish (US)
Title of host publicationAssociation for Information Systems - 11th Americas Conference on Information Systems, AMCIS 2005
Subtitle of host publicationA Conference on a Human Scale
Pages2495-2505
Number of pages11
StatePublished - Dec 1 2005
Externally publishedYes
Event11th Americas Conference on Information Systems, AMCIS 2005 - Omaha, NE, United States
Duration: Aug 11 2005Aug 15 2005

Publication series

NameAssociation for Information Systems - 11th Americas Conference on Information Systems, AMCIS 2005: A Conference on a Human Scale
Volume5

Other

Other11th Americas Conference on Information Systems, AMCIS 2005
Country/TerritoryUnited States
CityOmaha, NE
Period8/11/058/15/05

Keywords

  • Bayesian network
  • Graphical model
  • Identity matching
  • Record linkage
  • Semi-supervised learning
  • Similarity

ASJC Scopus subject areas

  • Computer Science Applications
  • Computer Networks and Communications
  • Information Systems
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'A multi-layer graphical model for approximate identity matching'. Together they form a unique fingerprint.

Cite this