TY - GEN
T1 - Text mining in supporting software systems risk assurance
AU - Huang, Li Guo
AU - Port, Daniel
AU - Wang, Liang
AU - Xie, Tao
AU - Menzies, Tim
PY - 2010
Y1 - 2010
N2 - Insufficient risk analysis often leads to software system design defects and system failures. Assurance of software risk documents aims to increase the confidence that identified risks are complete, specific, and correct. Yet assurance methods rely heavily on manual analysis that requires significant knowledge of historical projects and subjective, perhaps biased judgment from domain experts. To address the issue, we have developed RARGen, a text mining-based approach based on well-established methods aiming to automatically create and maintain risk repositories to identify usable risk association rules (RARs) from a corpus of risk analysis documents. RARs are risks that have frequently occurred in historical projects. We evaluate RARGen on 20 publicly available e-service projects. Our evaluation results show that RARGen can effectively reason about RARs, increase confidence and cost-effectiveness of risk assurance, and support difficult-to-perform activities such as assuring complete-risk identification.
AB - Insufficient risk analysis often leads to software system design defects and system failures. Assurance of software risk documents aims to increase the confidence that identified risks are complete, specific, and correct. Yet assurance methods rely heavily on manual analysis that requires significant knowledge of historical projects and subjective, perhaps biased judgment from domain experts. To address the issue, we have developed RARGen, a text mining-based approach based on well-established methods aiming to automatically create and maintain risk repositories to identify usable risk association rules (RARs) from a corpus of risk analysis documents. RARs are risks that have frequently occurred in historical projects. We evaluate RARGen on 20 publicly available e-service projects. Our evaluation results show that RARGen can effectively reason about RARs, increase confidence and cost-effectiveness of risk assurance, and support difficult-to-perform activities such as assuring complete-risk identification.
KW - Association rule
KW - Latent semantic analysis
KW - Mining software repositories
KW - Risk assurance
KW - Risk reduction
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=78649762455&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78649762455&partnerID=8YFLogxK
U2 - 10.1145/1858996.1859027
DO - 10.1145/1858996.1859027
M3 - Conference contribution
AN - SCOPUS:78649762455
SN - 9781450301169
T3 - ASE'10 - Proceedings of the IEEE/ACM International Conference on Automated Software Engineering
SP - 163
EP - 166
BT - ASE'10 - Proceedings of the IEEE/ACM International Conference on Automated Software Engineering
T2 - 25th IEEE/ACM International Conference on Automated Software Engineering, ASE'10
Y2 - 20 September 2010 through 24 September 2010
ER -