October 30, 2018 By IBM Instana Team 4 min read

If you’re involved in troubleshooting or optimizing application performance, your life just got much easier with the IBM Instana™ platform’s newly released analyze capabilities. And the best part—you don’t have to pay any extra fees to use our analytics capabilities, so you can benefit from them immediately if you’re already an IBM Instana client.

Troubleshooting and optimizing all the microservices

In our last significant release, we introduced Application Perspectives—the ability to create dynamic application definitions that continuously update as you release new software and scale your environment. Along with this functionality, we delivered a set of curated dashboards that are used to quickly identify any requests that are slow or throwing errors.

In this release, you have two primary ways of getting to the Analyze screens:

  1. You can drill down from an Application Perspectives dashboard to analyze traces or calls. Benefit: The Analyze screen is already prefiltered using the context of the Application Perspectives, service, service endpoint, trace, call, error message, database call and so on.
  2. You can select the top-level Analyze menu item to begin your analysis with the complete data set contained within the IBM Instana platform. Benefit: It’s the perfect place to search for optimization opportunities without being limited to any given portion of the data set.

What are traces and calls?

You need to understand the concept of traces and calls to take full advantage of the data within the IBM Instana platform and solve difficult performance problems.

Quoting the IBM Instana Observability documentation:

  • “A trace is the sequence of synchronous and asynchronous calls between service endpoints. Services talk to each other and deliver a result for a user request. Transforming data in a data flow can involve many services.”
  • “A call describes an activity within a monitored process, typically a request between two services.”

A trace is composed of one or more calls. A call is composed of either one or two spans:

  • Entry span: For example, an HTTP request from an uninstrumented process to an instrumented process.
  • Exit span: For example, a database call from an instrumented process to a database. Databases aren’t instrumented.
  • Exit plus entry span pair: For example, an HTTP request from an instrumented process to another instrumented process. There will be an exit span for the client process and an entry span for the process serving the response.
  • Intermediate span: For example, custom spans are added through the software development kit (SDK), instrumented in-process caching or instrumented view technologies.

Troubleshooting performance problems—from Application Perspectives to Analyze

Application Perspectives offers a set of curated dashboards that make it easy for anyone to identify problematic services, endpoint traces, or calls. When you’ve identified a problem, you can immediately jump to Analyze to explore the traces and calls in the exact context you drilled down from.

Application and service performance optimization through Analyze

Site reliability engineers (SREs), developers and DevOps are all interested in finding opportunities to optimize the services they write or support. Historically, the hard part has been identifying opportunities to make improvements in an easy and focused manner. Most monitoring tools have so much data contained within them that it’s hard to filter out the noise to find the interesting signal contained inside them.

If you’re responsible for identifying opportunities for performance optimization, meet your new best friend.

Figure 1: Extensive filtering capabilities enable deep analysis of the complete data set.

As you can see in Figure 1, exploring the performance of every application, service, endpoint, request type and so on is as simple as finding a place to stay on Airbnb. You’ll find an extensive selection of filters located above the Results section of the Analyze screen. From there you can filter by application, service, endpoint, type, technology, latency, erroneous and more—which gives you access to every tag available within the IBM Instana platform.

Figure 2: Charting capability within Analyze for both traces and calls.

This powerful set of filters allows you to identify traces or calls in whatever manner you need to accomplish your goal. For example, I can set my filters to:

        call.type = Database
        call.latency > 1000 ms
        Grouped By = call.database.statement

The result is that I see a grouped list of every unique database query that takes longer than 1,000 ms to execute across all my services. This list is linked to the underlying data so you can click each query to drill down and explore each statement in context of the associated trace.

Figure 3: Filters applied within Analyze to find database queries that require optimization.

If I’m interested in finding slow HTTP requests, I can set my filters for the following:

        call.type = Http
        call.latency > 5000
        Grouped By = trace.endpoint.name

To learn more, check out the IBM Instana site and give the free trial a shot.

Sign up for a free, two-week trial
Was this article helpful?
YesNo

More from IBM Instana

The business-IT connection: Real-time insights with Instana Observability

4 min read - Applications play a crucial role in both business and IT operations. However, IT organizations often lack visibility into the specific business processes their applications support and They also lack visibility into how these processes connect with the underlying systems. Business processes are a collection of linked activities that produce a business outcome. These processes run across different IT systems, including mobile and web platforms, applications and the infrastructure they rely on. Business process management (BPM) tools play a pivotal role…

Probable Root Cause: Accelerating incident remediation with causal AI 

5 min read - It has been proven time and time again that a business application’s outages are very costly. The estimated cost of an average downtime can run USD 50,000 to 500,000 per hour, and more as businesses are actively moving to digitization. The complexity of applications is growing as well, so Site Reliability Engineers (SREs) require hours—and sometimes days—to identify and resolve problems.   To alleviate this problem, we have introduced the new feature Probable Root Cause as part of Intelligent Incident…

Instana 2023: Recapping our latest innovation

7 min read - You asked, and we delivered! At Instana, addressing our customers' needs and creating a simple tool that is easy to use is fundamental to helping our DevOps and SRE teams reduce burnout rates, allowing them to excel in what they do best. Taking all your feedback and market insights into perspective and careful consideration, we are thrilled to announce that in 2023. Our team announced different product capabilities designed to simplify your teams' ability to observe, debug, remediate and enhance…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters