February 28, 2024 By Trent Shupe 3 min read

In today’s dynamic software landscape, maintaining high performance and reliability is crucial for businesses. Achieving this requires effective observability and two powerful tools to accomplish this are Steadybit and Instana®. The seamless integration of Steadybit with Instana unlocks proactive reliability engineering techniques and a comprehensive solution for optimizing and managing your applications.

Steadybit chaos engineering platform

Steadybit is a resilience testing platform that is designed to proactively identify weaknesses and potential failures in distributed systems. It empowers organizations to build more robust and resilient applications by simulating various failure scenarios in a controlled environment.

Instana automated observability platform

Instana is a leading observability solution. It provides real-time insights into application health, performance and dependencies, which help teams quickly detect and resolve issues to ensure optimal user experiences.

The power of integration

When these two tools join forces, the result is a comprehensive reliability solution that covers both proactive resilience testing and real-time performance monitoring. With the Instana extension, Steadybit users can gain insights from Instana on their chaos engineering experiments. When executing those experiments, users can check to validate whether Instana observed any events or incidents. When running experiments, you can create an Instana maintenance window directly from the experiment in Steadybit to avoid escalations while running an experiment designed to inject faults.

Key benefits:

By using this new integration, you can validate whether your system works reliably. If the system does not work reliably, Instana will let you know. Using these two powerful tools together provides some key benefits:

  • Reduced time to value: Increase efficiency when implementing your observability strategy. With Steadybit’s ability to precisely model and inject faulty infrastructure conditions into any environment, Instana custom events and alerts can be fine-tuned before the system under observation is deployed to production, effectively shifting observability from a day-2 activity to a day-1 activity.
  • Enhanced reliability:  Steadybit’s resilience testing allows you to identify weaknesses in your system before they impact users. Integrating this with Instana’s monitoring capabilities ensures a holistic approach to system reliability.
  • Faster issue resolution:  Instana’s real-time insights combined with Steadybit’s failure injection capabilities enable teams to quickly identify, isolate and resolve issues, minimizing downtime and improving user satisfaction.
  • Continuous optimization: The integration supports a continuous feedback loop for optimizing system performance. Through Steadybit’s insights, teams can fine-tune applications based on real-world scenarios identified during resilience testing.
  • Cost-efficiency: Proactively addressing potential issues through resilience testing can result in cost savings by preventing large-scale outages and minimizing the need for reactive firefighting.

How to integrate Steadybit with Instana

1. Set up Steadybit’s Instana extension:

Begin by configuring Steadybit to communicate with your Instana instance. For that, you simply install Steadybit’s Instana extension next to your Steadybit agent and provide Instana’s base address and authentication details ( see https://github.com/steadybit/extension-instana ).

2. Identify critical scenario:

Use Steadybit to simulate various failure scenarios such as network outages, service failures and latency spikes. Monitor the impact of these scenarios in real-time using Instana. To start as easy as possible, Steadybit provides a ready-to-be-executed Experiment in their Reliability Hub. This experiment is also explained in this video here.

3. Automated Testing and Monitoring:

Integrate Steadybit into your CI/CD pipeline to automate resilience testing. This ensures that every code change is subjected to a battery of resilience tests before reaching production. To achieve this, you can use Steadybit’s API to run an experiment, a GitHub action or CLI, depending on what fits your context the best. Check out Steadybit’s blog post “Boost your GitOps practices by integrating Chaos Engineering with Steadybit” to learn more.

4. Incident Response Planning:

Utilize the insights gained from both Steadybit and Instana to refine incident response plans. Having a well-defined strategy based on real-world scenarios improves the team’s ability to respond swiftly and effectively.

The integration of Steadybit with Instana presents a powerful synergy for organizations seeking to elevate their observability and resilience practices. By combining proactive resilience testing with real-time performance monitoring, teams can create more robust, reliable, and optimized applications. This integration ultimately contributes to enhanced user experiences, reduced downtime, and increased overall operational efficiency.

Learn more about Steadybit integration with Instana.

Make Steadybit your own Avoid costly downtime by using IBM Instana

More from Automation

Understanding glue records and Dedicated DNS

3 min read - Domain name system (DNS) resolution is an iterative process where a recursive resolver attempts to look up a domain name using a hierarchical resolution chain. First, the recursive resolver queries the root (.), which provides the nameservers for the top-level domain(TLD), e.g.com. Next, it queries the TLD nameservers, which provide the domain’s authoritative nameservers. Finally, the recursive resolver  queries those authoritative nameservers.   In many cases, we see domains delegated to nameservers inside their own domain, for instance, “example.com.” is delegated…

Using dig +trace to understand DNS resolution from start to finish

2 min read - The dig command is a powerful tool for troubleshooting queries and responses received from the Domain Name Service (DNS). It is installed by default on many operating systems, including Linux® and Mac OS X. It can be installed on Microsoft Windows as part of Cygwin.  One of the many things dig can do is to perform recursive DNS resolution and display all of the steps that it took in your terminal. This is extremely useful for understanding not only how the DNS…

The future of application delivery starts with modernization

5 min read - IDC estimates that 750 million cloud native will be built by 2025. Where and how these applications are deployed will impact time to market and value realization. The reality is that application landscapes are complex, and they challenge enterprises to maintain and modernize existing infrastructure, while delivering new cloud-native features. Three in four executives reported disparate systems in their organizations and that a lack of skills, resources and common operational practices challenge business objectives. Executives know they must modernize. In…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters