ExaVault uses Instana to monitor API performance and for error tracking, debugging and alerting. The most important metric ExaVault looks at on a day-to-day basis is latency. “We need to make sure every customer is having a good experience,” Fite says. “If a customer is waiting more than a couple seconds, they might leave.”
With Instana, though, Fite doesn’t have to look at the dashboard all day. Instead, Instana sends an alert to a dedicated Slack channel if anything is out of the ordinary.
When it comes to account-level monitoring, ExaVault uses the Instana software development kit (SDK) to assign metadata to each API call as it comes in. As a result, Fite can filter on a huge number of variables. The most common use case, though, is filtering by account or even by individual users in an account. “If a user is having a problem that we don’t see at the high level, we can drill down and really troubleshoot just looking at their information,” Fite says.
Since ExaVault started using Instana, the mean time to repair (MTTR) for customer-impacting bugs has dropped by 56.6%. In addition, the platform’s slowdowns and downtime have decreased substantially. It was at 99.51% uptime before, and it’s now at 99.99% . “We’re accomplishing the goal that we set out to do,” Fite explains. “The reason we were able to do that is we had better visibility into our problems.”
In some cases, there were bugs ExaVault didn’t even know existed before using Instana. Within days of getting set up with Instana, ExaVault realized there was a bug in the software that was querying the memory cache too frequently and wasn’t saving correctly. Fixing the previously invisible bug immediately reduced the load on application servers.