The paper studies issues related to privacy protection of health information in data mining. We introduce the implications of the Health Insurance Portability and Accountability Act (HIPAA) and the privacy of protected health information (PHI). We present an attribute analysis framework of health information and a technological approach for protecting PHI in data mining. Specifically, we develop such effective privacy-enhancing techniques as data filter, discretisation and randomisation and give an example of inducing the decision-trees from training data in which the values of sensitive attributes have been either removed or modified by using these techniques. The results show that we can achieve comparative predictive accuracies without accessing be original values a the sensitive attributes.

Original languageEnglish (US)
Pages (from-to)210-222
Number of pages13
JournalInternational Journal of Healthcare Technology and Management
Issue number2
StatePublished - 2004


  • Classification
  • Data mining
  • Privacy
  • Privacy-enhancing techniques

ASJC Scopus subject areas

  • Leadership and Management
  • Health Informatics


Dive into the research topics of 'Protection of health information in data mining'. Together they form a unique fingerprint.

Cite this