Abstract
Data mining as one of the important means to discover interesting and potential useful patterns or knowledge from large data sources has been widely used for improving business intelligence. Since some data items may be specific to individuals, companies increasingly pay attention to privacy issues while implementing business intelligence solutions. In this paper, we present a framework for privacy-enhancing data mining and develop such privacy-enhancing technologies as attribute selection, discretization, fixed-data perturbation, probability distribution, and randomization. Specifically, we address the issue of privacy protection through using the attribute selection, discretization, and randomization techniques and give an example of inducing the decision-trees from training data in which the values of sensitive attributes have been modified by using these techniques. The results show that we can achieve comparative predictive accuracies without accessing the actual values of the sensitive attributes.
Original language | English (US) |
---|---|
Pages | 2417-2425 |
Number of pages | 9 |
State | Published - 2003 |
Event | 9th Americas Conference on Information Systems, AMCIS 2003 - Tampa, United States Duration: Aug 4 2003 → Aug 6 2003 |
Conference
Conference | 9th Americas Conference on Information Systems, AMCIS 2003 |
---|---|
Country/Territory | United States |
City | Tampa |
Period | 8/4/03 → 8/6/03 |
Keywords
- Data mining
- business intelligence
- classification
- information loss
- privacy-enhancing system
ASJC Scopus subject areas
- Library and Information Sciences
- Computer Networks and Communications
- Computer Science Applications
- Information Systems