Two overlapping screenshots of IBM Spectrum Discover


Easily identify, prepare and optimize file and object data to simplify AI organization and achieve faster business results. Leverage IBM Watson® Knowledge Catalog and IBM Cloud Pak® for Data with a simple one-click integration for additional value and insights. Create a successful AI infrastructure with powerful ingest, data mapping, visualization and data activation capabilities.


Save time

Automate cataloging of unstructured data by capturing metadata as it is created. IBM Spectrum Discover supports multiple file and object storage systems from IBM and other vendors.

Increase productivity

Enable comprehensive insight by combining system metadata with custom tags to increase productivity for storage administrators and anyone looking for insights into large data repositories.

Integrate easily

Leverage extensibility using the Action Agent API, custom tags and policy-based workflows to orchestrate deeper content inspection and activate data in AI, machine learning and analytics workflows.

Key features of IBM Spectrum Discover

NEW! Integration to IBM Cloud Pak for Data

Visit product page

With a simple one click export, metadata organized in IBM Spectrum Discover can be easily integrated with IBM Watson Knowledge Catalog. Data users can then leverage enterprise file and object data using IBM Cloud Pak for Data and IBM AI solutions.

NEW! Optimize data with more granularity

IBM Spectrum Discover now contains a new policy engine that can help optimize data capacity and data location based on defined data policies. The first release of this capability optimizes IBM Spectrum Scale capacity by using data and custom tags defined in Spectrum Discover to bring more granular capacity optimization.

Supports heterogeneous file and object storage

Supports both IBM and non-IBM storage systems on-premises and in the cloud, including IBM Spectrum Scale, IBM Cloud Object Storage, IBM Spectrum Protect, Red Hat Ceph Storage, Dell-EMC Isilon, NetApp, Amazon S3 and Windows SMB.

Policy-based metadata tagging for data classification

IBM Spectrum Discover automatically captures system metadata from source storage systems, creates custom metadata from search results and enables extraction of keyword metadata from file headers and content using the Action Agent API. The result is a rich layer of file and object metadata that is managed using one centralized solution.

Dashboard and customizable reporting

The dashboard represents the user environment at a glance. What a user can see or not see is determined using role-based access controls. The dashboard can show usage versus capacity of their registered storage systems and information about potential duplicate files. For users who want additional record detail, IBM Spectrum Discover provides customizable reports. Both summary and detailed reports can be generated.

Continuous metadata ingestion without rescan

When used with IBM Cloud Object Storage, IBM Spectrum Scale or Red Hat Ceph storage, the software provides continuous metadata ingestion. Built-in connectors provide integration with IBM and Red Hat storage systems. Live event notifications automate continuous metadata ingestion. Metadata indexing enables rapid data queries. Customers can scan up to 30,000 records per second — up to 1 billion files in an 8-hour day.

Fast searching enables rapid discovery of data assets

The metadata management software provides both a search bar and a more advanced search pane to help users quickly find subsets of records that have been indexed. Search results are displayed in a columnar table that contains information correlated to search criteria. What a user can see or not see is determined using role-based access controls.

Content-based tagging and search

Apply custom metadata tags based on the occurrence of user-definable keywords found in the content of supported file types, then quickly find that data with low-latency searches using those tags.

Secure and extensible architecture

Role-based access control ensures that only authorized users have access to data. The Action Agent API supports integration with customer-developed and/or third-party software, and policy engine hooks enable automated workflows.

Automatically identify and classify sensitive data

IBM Spectrum Discover automatically identifies and classifies data containing certain kinds of sensitive or personally identifiable information.

Community-supported catalog of third-party extensions

IBM Spectrum Discover Action Agent Catalog enables clients to discover, install and manage third-party Action Agents from a community-supported ecosystem to extend the capabilities of IBM Spectrum Discover without having to write their own code.

Which option is right for you?

IBM Spectrum Discover Free Trial

Unleash metadata-fueled insights for your unstructured data -- free for 90 days.

Monthly License

Enjoy the flexibility of a monthly license for IBM Spectrum Discover

Perpetual License

Ideal for enterprises that prefer the convenience and lasting benefits of a perpetual license