Abstract
The paper studies issues related to privacy protection of health information in data mining. We introduce the implications of the Health Insurance Portability and Accountability Act (HIPAA) and the privacy of protected health information (PHI). We present an attribute analysis framework of health information and a technological approach for protecting PHI in data mining. Specifically, we develop such effective privacy-enhancing techniques as data filter, discretisation and randomisation and give an example of inducing the decision-trees from training data in which the values of sensitive attributes have been either removed or modified by using these techniques. The results show that we can achieve comparative predictive accuracies without accessing be original values a the sensitive attributes.
Original language | English (US) |
---|---|
Pages (from-to) | 210-222 |
Number of pages | 13 |
Journal | International Journal of Healthcare Technology and Management |
Volume | 6 |
Issue number | 2 |
DOIs | |
State | Published - 2004 |
Keywords
- Classification
- Data mining
- HIPAA
- Privacy
- Privacy-enhancing techniques
ASJC Scopus subject areas
- Leadership and Management
- Health Informatics