TY - GEN
T1 - Association mining in large databases
T2 - 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2007
AU - Wu, Tianyi
AU - Chen, Yuguo
AU - Han, Jiawei
PY - 2007
Y1 - 2007
N2 - In the literature of data mining and statistics, numerous interestingness measures have been proposed to disclose succinct object relationships of association patterns. However, it is still not clear when a measure is truly effective in large data sets. Recent studies have identified a critical property, null-(transaction) invariance, for measuring event associations in large data sets, but many existing measures do not have this property. We thus re-examine the null-invariant measures and find interestingly that they can be expressed as a generalized mathematical mean, and there exists a total ordering of them. This ordering provides insights into the underlying philosophy of the measures and helps us understand and select the proper measure for different applications.
AB - In the literature of data mining and statistics, numerous interestingness measures have been proposed to disclose succinct object relationships of association patterns. However, it is still not clear when a measure is truly effective in large data sets. Recent studies have identified a critical property, null-(transaction) invariance, for measuring event associations in large data sets, but many existing measures do not have this property. We thus re-examine the null-invariant measures and find interestingly that they can be expressed as a generalized mathematical mean, and there exists a total ordering of them. This ordering provides insights into the underlying philosophy of the measures and helps us understand and select the proper measure for different applications.
UR - http://www.scopus.com/inward/record.url?scp=38149100327&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38149100327&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-74976-9_66
DO - 10.1007/978-3-540-74976-9_66
M3 - Conference contribution
AN - SCOPUS:38149100327
SN - 9783540749752
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 621
EP - 628
BT - Knowledge Discovery in Database
PB - Springer
Y2 - 17 September 2007 through 21 September 2007
ER -