PRIVACY PROTECTION IN DATA MINING

Jinquan Li, Michael J. Shaw, Fu Ren Lin

Research output: Contribution to conferencePaperpeer-review

Abstract

Data mining as one of the important means to discover interesting and potential useful patterns or knowledge from large data sources has been widely used for improving business intelligence. Since some data items may be specific to individuals, companies increasingly pay attention to privacy issues while implementing business intelligence solutions. In this paper, we present a framework for privacy-enhancing data mining and develop such privacy-enhancing technologies as attribute selection, discretization, fixed-data perturbation, probability distribution, and randomization. Specifically, we address the issue of privacy protection through using the attribute selection, discretization, and randomization techniques and give an example of inducing the decision-trees from training data in which the values of sensitive attributes have been modified by using these techniques. The results show that we can achieve comparative predictive accuracies without accessing the actual values of the sensitive attributes.

Original languageEnglish (US)
Pages2417-2425
Number of pages9
StatePublished - 2003
Event9th Americas Conference on Information Systems, AMCIS 2003 - Tampa, United States
Duration: Aug 4 2003Aug 6 2003

Conference

Conference9th Americas Conference on Information Systems, AMCIS 2003
Country/TerritoryUnited States
CityTampa
Period8/4/038/6/03

Keywords

  • Data mining
  • business intelligence
  • classification
  • information loss
  • privacy-enhancing system

ASJC Scopus subject areas

  • Library and Information Sciences
  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'PRIVACY PROTECTION IN DATA MINING'. Together they form a unique fingerprint.

Cite this