DocumentCode
652855
Title
Detecting Transient Bottlenecks in n-Tier Applications through Fine-Grained Analysis
Author
Qingyang Wang ; Kanemasa, Yasuhiko ; Li, Jie ; Jayasinghe, Danushka ; Shimizu, Tsuyoshi ; Matsubara, Masaki ; Kawaba, Motoyuki ; Pu, Calton
Author_Institution
Georgia Inst. of Technol., Atlanta, GA, USA
fYear
2013
fDate
8-11 July 2013
Firstpage
31
Lastpage
40
Abstract
Identifying the location of performance bottlenecks is a non-trivial challenge when scaling n-tier applications in computing clouds. Specifically, we observed that an n-tier application may experience significant performance loss when there are transient bottlenecks in component servers. Such transient bottlenecks arise frequently at high resource utilization and often result from transient events (e.g., JVM garbage collection) in an n-tier system and bursty workloads. Because of their short lifespan (e.g., milliseconds), these transient bottlenecks are difficult to detect using current system monitoring tools with sampling at intervals of seconds or minutes. We describe a novel transient bottleneck detection method that correlates throughput (i.e., request service rate) and load (i.e., number of concurrent requests) of each server in an n-tier system at fine time granularity. Both throughput and load can be measured through passive network tracing at millisecond-level time granularity. Using correlation analysis, we can identify the transient bottlenecks at time granularities as short as 50ms. We validate our method experimentally through two case studies on transient bottlenecks caused by factors at the system software layer (e.g., JVM garbage collection) and architecture layer (e.g., Intel SpeedStep).
Keywords
cloud computing; file servers; resource allocation; architecture layer; component servers; computing clouds; correlation analysis; fine time granularity; fine-grained analysis; millisecond-level time granularity; n-tier applications; passive network tracing; resource utilization; system monitoring tools; system software layer; transient bottleneck detection; transient events; Monitoring; Passive networks; Servers; Throughput; Time factors; Time measurement; Transient analysis; Performance evaluations; Web-facing applications; bottleneck; n-tier system; scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
Distributed Computing Systems (ICDCS), 2013 IEEE 33rd International Conference on
Conference_Location
Philadelphia, PA
ISSN
1063-6927
Type
conf
DOI
10.1109/ICDCS.2013.17
Filename
6681573
Link To Document