TraceWeaver: Distributed Request Tracing for Microservices Without Application Modification

Sachin Ashok, Vipul Harsh, Brighten Godfrey, Radhika Mittal, Srinivasan Parthasarathy, Larisa Shwartz

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Monitoring and debugging modern cloud-based applications is challenging since even a single API call can involve many interdependent distributed microservices. To provide observability for such complex systems, distributed tracing frameworks track request flow across the microservice call tree. However, such solutions require instrumenting every component of the distributed application to add and propagate tracing headers, which has slowed adoption. This paper explores whether we can trace requests without any application instrumentation, which we refer to as request trace reconstruction. To that end, we develop TraceWeaver, a system that incorporates readily available information from production settings (e.g., timestamps) and test environments (e.g., call graphs) to reconstruct request traces with usefully high accuracy. At the heart of TraceWeaver is a reconstruction algorithm that uses request-response timestamps to effectively prune the search space for mapping requests and applies statistical timing analysis techniques to reconstruct traces. Evaluation with (1) benchmark microservice applications and (2) a production microservice dataset demonstrates that TraceWeaver can achieve a high accuracy of ∼90% and can be meaningfully applied towards multiple use cases (e.g., finding slow services and A/B testing).

Original languageEnglish (US)
Title of host publicationACM SIGCOMM 2024 - Proceedings of the 2024 ACM SIGCOMM 2024 Conference
PublisherAssociation for Computing Machinery
Pages828-842
Number of pages15
ISBN (Electronic)9798400706141
DOIs
StatePublished - Aug 4 2024
Event2024 ACM SIGCOMM Conference, ACM SIGCOMM 2024 - Sydney, Australia
Duration: Aug 4 2024Aug 8 2024

Publication series

NameACM SIGCOMM 2024 - Proceedings of the 2024 ACM SIGCOMM 2024 Conference

Conference

Conference2024 ACM SIGCOMM Conference, ACM SIGCOMM 2024
Country/TerritoryAustralia
CitySydney
Period8/4/248/8/24

Keywords

  • distributed tracing
  • graph analysis
  • microservices
  • non-intrusive

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'TraceWeaver: Distributed Request Tracing for Microservices Without Application Modification'. Together they form a unique fingerprint.

Cite this