September 19, 2023 By Cross Ganaway 3 min read

Organizations today struggle to detect, identify and act on business operations incidents. The gap between business and IT continues to grow, leaving orgs unable to link IT outages to business impact.  Site reliability engineers (SREs) want to understand business impact to better prioritize their work but don’t have a way of monitoring business KPIs. They struggle to link IT outages to business impacts because data is often siloed and knowledge is tribal. It forces teams into a highly reactive mode because they only become aware of the business impact when it’s reported by the business team.

Combining a process orchestration solution like Camunda with an observability solution like IBM Instana can break down these silos, help teams better understand their systems and react faster.

What is Camunda?

Camunda enables process orchestration so that you can fully automate your business processes. Camunda customers are at the forefront of understanding and then digitizing and automating their business workflows.

How do the IT operators (ITOps) or site reliability engineers (SREs) understand the business context of what they see when they monitor Camunda with typical application performance monitoring (APM) observability tools?

The problem with traditional APM observability tools

APM observability tools specialize in leveraging traces, metrics and logs to help IT keep applications healthy and running. These tools typically understand a variety of technologies and can even automatically start monitoring them when an agent is deployed. But most APM observability tools treat automation platforms like Camunda as whatever Java/JEE container they execute within. They fail to provide ITOps or SREs any business context in their normal day-to-day operations.

Combining IBM Instana and Camunda for a fuller picture

Now Instana Observability Business Monitoring from IBM can understand when it’s being used to monitor a Camunda environment and automatically add business-process context to IT traces that ITOps and SREs use for troubleshooting. There is also a business activity view where business processes can be viewed in the context of any IT issues that might be impacting them:

This improvement means that an SRE’s troubleshooting experience goes from seeing traces that only have IT calls like RESTful or database calls to an experience where Instana understands and adds the business process or activity that those calls are implementing:

Optionally, data from the executing activities can be captured along with the trace. Since Instana does not sample trace information, capturing these data can unlock use cases like understanding the true impact of an IT outage.

Camunda and IBM Instana in action

Imagine that Carlos is an SRE whose company is using Camunda. One day, Carlos gets an alert from Camunda that one of the automated activities is stuck. Camunda shows an exception in its log indicating that an external task is failing. Carlos is used to troubleshooting the end-to-end automated processes orchestrated with Camunda in detail but has limited visibility into the issue with the external task.

With Instana monitoring both Camunda and the endpoints implementing those external tasks, Carlos can find the business process or activity within Instana, analyze all of the individual calls within the affected time period and view the end-to-end trace for one of the failing ones. Instana can track calls made to the external endpoint and tell Carlos that there is a database call the external task depends on that is timing out. Now Carlos can focus on getting that database healthy and the activities back on track. 

Learn more

Learn more about how you can effectively use Camunda and IBM Instana together in this on-demand webinar, “Real-Time Business Context for IT. Measure Yourself the Way the Business Measures You.”

Watch the webinar
Was this article helpful?
YesNo

More from Automation

Deployable architecture on IBM Cloud: Simplifying system deployment

3 min read - Deployable architecture (DA) refers to a specific design pattern or approach that allows an application or system to be easily deployed and managed across various environments. A deployable architecture involves components, modules and dependencies in a way that allows for seamless deployment and makes it easy for developers and operations teams to quickly deploy new features and updates to the system, without requiring extensive manual intervention. There are several key characteristics of a deployable architecture, which include: Automation: Deployable architecture…

Understanding glue records and Dedicated DNS

3 min read - Domain name system (DNS) resolution is an iterative process where a recursive resolver attempts to look up a domain name using a hierarchical resolution chain. First, the recursive resolver queries the root (.), which provides the nameservers for the top-level domain(TLD), e.g.com. Next, it queries the TLD nameservers, which provide the domain’s authoritative nameservers. Finally, the recursive resolver  queries those authoritative nameservers.   In many cases, we see domains delegated to nameservers inside their own domain, for instance, “example.com.” is delegated…

Using dig +trace to understand DNS resolution from start to finish

2 min read - The dig command is a powerful tool for troubleshooting queries and responses received from the Domain Name Service (DNS). It is installed by default on many operating systems, including Linux® and Mac OS X. It can be installed on Microsoft Windows as part of Cygwin.  One of the many things dig can do is to perform recursive DNS resolution and display all of the steps that it took in your terminal. This is extremely useful for understanding not only how the DNS…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters