TY - GEN
T1 - Dynamic system-wide reconfiguration of grid deployments in response to intrusion detections
AU - Rowanhill, Jonathan
AU - Wasson, Glenn
AU - Hill, Zach
AU - Basney, Jim
AU - Kiryakov, Yuliyan
AU - Knight, John
AU - Nguyen-Tuong, Anh
AU - Grimshaw, Andrew
AU - Humphrey, Marty
PY - 2007
Y1 - 2007
N2 - As Grids become increasingly relied upon as critical infrastructure, it is imperative to ensure the highly-available and secure day-to-day operation of the Grid infrastructure. The current approach for Grid management is generally to have geographically-distributed system administrators contact each other by phone or email to debug Grid behavior and subsequently modify or reconfigure the deployed Grid software. For security-related events such as the required patching of vulnerable Grid software, this ad hoc process can take too much time, is error-prone and tedious, and thus is unlikely to completely solve the problems. In this paper, we present the application of the ANDREA management system to control Grid service functionality in near-real-time at scales of thousands of services with minimal human involvement. We show how ANDREA can be used to better ensure the security of the Grid: In experiments using 11,394 Globus Toolkit v4 deployments we show the performance of ANDREA for three increasingly-sophisticated reactions to an intruder detection: shutting down the entire Grid; incrementally eliminating Grid service for different classes of users; and issuing and applying a patch to the vulnerability exploited by the attacker. We believe that this work is an important first step toward automating the general day-to-day monitoring and reconfiguration of all aspects of Grid deployments.
AB - As Grids become increasingly relied upon as critical infrastructure, it is imperative to ensure the highly-available and secure day-to-day operation of the Grid infrastructure. The current approach for Grid management is generally to have geographically-distributed system administrators contact each other by phone or email to debug Grid behavior and subsequently modify or reconfigure the deployed Grid software. For security-related events such as the required patching of vulnerable Grid software, this ad hoc process can take too much time, is error-prone and tedious, and thus is unlikely to completely solve the problems. In this paper, we present the application of the ANDREA management system to control Grid service functionality in near-real-time at scales of thousands of services with minimal human involvement. We show how ANDREA can be used to better ensure the security of the Grid: In experiments using 11,394 Globus Toolkit v4 deployments we show the performance of ANDREA for three increasingly-sophisticated reactions to an intruder detection: shutting down the entire Grid; incrementally eliminating Grid service for different classes of users; and issuing and applying a patch to the vulnerability exploited by the attacker. We believe that this work is an important first step toward automating the general day-to-day monitoring and reconfiguration of all aspects of Grid deployments.
UR - http://www.scopus.com/inward/record.url?scp=38149092547&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38149092547&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-75444-2_29
DO - 10.1007/978-3-540-75444-2_29
M3 - Conference contribution
AN - SCOPUS:38149092547
SN - 9783540754435
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 260
EP - 272
BT - High Performance Computing and Communications - Third International Conference, HPCC 2007, Proceedings
PB - Springer
T2 - 3rd International Conference on High Performance Computing and Communications, HPCC 2007
Y2 - 26 September 2007 through 28 September 2007
ER -