24×7 Infra Observability For USA Based Travels Company

Summary

LogusIMS provides Texas based company with an integrated observability mechanism, allowing them to quickly pinpoint and resolve technical issues.

Client Profile

The client is a travel technology company headquartered in Texas, USA, that operates a travel marketplace and provides SaaS-driven solutions to travel professionals. Having a global presence with offices in multiple countries, their digital platform offers a wide array of travel inventory and tools, including AI-powered booking assistance and serves a large network travel advisors and other users.

Business Challenge

The client had their infrastructure on OCI with more than 40 microservices helping travelers in booking flights and hotels. The log files generated were huge and it was very difficult for client’s team to parse them manually and identify error details before initiating a fix in the application.

Client wanted to set up an observability mechanism to handle huge log files with the ability to identify specific failure points, API names and error code as it was daunting task to parse them manually.

LogusIMS Solution

LogusIMS suggested to setup an observability mechanism to handle huge log files using combination of two different tools viz.,

  • Datadog to;
    • Get waterfall model details of error/exception
    • Latency details for every API end point integrated with
  • GroundCover to;
    • Produce the request/response body traceability
    • Get resource utilization details
    • Get complete pod logs in Kubernetes cluster

Above mechanism provides the ability to quickly identify details of specific failure point, API endpoint and the error code.

This mechanism also helped in controlling developers’ access to OCI console log since it was enough to provide read-only access to Datadog and GroundCover to debug issues.

Client Benefits

  • Easy & quick identification of error/exception details
  • Reduced Turn Around Time for resolution
  • Prevention of developers’ access to OCI console’s log files
  • Early identification of endpoint failure based on daily reports
  • Resource fine tuning based on utilization, resulting in cost savings

Technology/Tools

  • OCI
  • Kubernetes
  • Datadog
  • GroundCover

Share:

Fill out the form below with your details and any specific questions or comments you may have.