Client Profile
The client is a travel technology company headquartered in Texas, USA, that operates a travel marketplace and provides SaaS-driven solutions to travel professionals. Having a global presence with offices in multiple countries, their digital platform offers a wide array of travel inventory and tools, including AI-powered booking assistance and serves a large network travel advisors and other users.
Business Challenge
The client had their infrastructure on OCI with more than 40 microservices helping travelers in booking flights and hotels. The log files generated were huge and it was very difficult for client’s team to parse them manually and identify error details before initiating a fix in the application.
Client wanted to set up an observability mechanism to handle huge log files with the ability to identify specific failure points, API names and error code as it was daunting task to parse them manually.
LogusIMS Solution
LogusIMS suggested to setup an observability mechanism to handle huge log files using combination of two different tools viz.,
- Datadog to;
- Get waterfall model details of error/exception
- Latency details for every API end point integrated with
- GroundCover to;
- Produce the request/response body traceability
- Get resource utilization details
- Get complete pod logs in Kubernetes cluster
Above mechanism provides the ability to quickly identify details of specific failure point, API endpoint and the error code.
This mechanism also helped in controlling developers’ access to OCI console log since it was enough to provide read-only access to Datadog and GroundCover to debug issues.