Abstract
Monitoring and diagnosing performance issues of an online service system are critical to assure satisfactory performance of the system. Given a detected performance issue and collected system metrics for an online service system, engineers usually need to make great efforts to conduct diagnosis by first identifying performance issue beacons, which are metrics that pinpoint to the root causes. In order to reduce the manual efforts, in this paper, we propose a new approach to effectively detecting performance issue beacons to help with performance issue diagnosis. Our approach includes techniques for mining system metric data to address limitations when applying previous classification-based approaches. Our evaluations on both a controlled environment and a real production environment show that our approach can more effectively identify performance issue beacons from system metric data than previous approaches.
Original language | English (US) |
---|---|
Article number | 6424866 |
Pages (from-to) | 273-278 |
Number of pages | 6 |
Journal | Proceedings of the IEEE Symposium on Reliable Distributed Systems |
DOIs | |
State | Published - 2012 |
Externally published | Yes |
Event | 31st IEEE International Symposium on Reliable Distributed Systems, SRDS 2012 - Irvine, CA, United States Duration: Oct 8 2012 → Oct 11 2012 |
Keywords
- class association rule
- monitoring data analysis
- performance issue diagnosis
ASJC Scopus subject areas
- Software
- Theoretical Computer Science
- Hardware and Architecture
- Computer Networks and Communications