Watson AIOps component overview

Note: IBM Watson AIOps is available as an IBM Cloud Pak. For details and documentation, see IBM Cloud Pak for Watson AIOps.

IBM Watson® AIOps 2.0 is composed of four components:

You can also purchase extensions that can extend the capabilities of your Watson™ AIOps implementation.

Watson AIOps AI Manager

AI Manager collects information from all of your IT assets, such as applications, the infrastructure that they run on, and the networking systems that support them. It then uses that data to uncover hidden insights and identify root causes of events. Using this data to train your models leads to better event discovery and a more accurate understanding of your topology. With this advanced understanding of your topology, you can pinpoint where events occur (fault localization) and how far-reaching their impact is (blast radius). When AI Manager detects potential incidents, it creates a story about the incident. Then, it uses a ChatOps environment (Slack) to notify, update, and provide your team with near real-time potential remedies.

For more information about AI Manager, see Deriving insight into IT operations with Watson AIOps AI Manager.

For more information about installing AI Manager, see Installing the Watson AIOps AI Manager service.

Watson AIOps Event Manager and Topology

Event Manager monitors the health and performance of IT and network infrastructure across local, cloud, and hybrid environments. It incorporates event management capabilities, and uses real-time alarm, alert analytics, and broader historic data analytics to deliver actionable insight into the performance of services and their associated dynamic network and IT infrastructures. Event Manager helps organizations collect, consolidate, and correlate events and topology data from virtually any source.

Integrated service and topology management provides complete up-to-date visibility and control over dynamic infrastructure and services, which is configurable for real-time or historical viewing.

Watson AIOps brings together the capability to group events that are generated from structured, semi-structured, and unstructured data types. AI Manager and Event Manager work together to correlate events to entities. You can use the results of that correlation to pinpoint where in your topology incidents are occurring.

For more information about monitoring the health and performance of your IT and network infrastructure, see IBM Netcool Operations Insight 1.6.2.

For more information about observing and controlling networked resources in the context of a configurable topology, see IBM Netcool Agile Service Manager 1.1.9.

For more information about installing Event Manager with AI Manager, see Installing Watson AIOps Event Manager.

Watson AIOps Metric Manager

Metric Manager analyzes performance and monitoring data across silos, domains, vendors, and systems to understand the normal operational behavior of the metrics in an organization's environment. Based on data that is ingested from multiple sources and integral time-series algorithms, Metric Manager creates performance models on which it can detect or forecast behavior outside of the modeled range.

Watson AIOps uses this metric-based anomaly detection as an extra source of potential events. Metrics Manager enables users to proactively avoid issues before their organizations are impacted.

For more information about analyzing data and understanding system behavior, see IBM Operations Analytics Predictive Insights 1.3.6.

For more information about installing Metric Manager with AI Manager, see Installing Watson AIOps Metric Manager.

Extensions

Extensions are purchasable, and separately licensed, components that extend application and network infrastructure capabilities in Watson AIOps:

For more information about purchasing and installing extensions, see Installing Watson AIOps extensions.