With a simple one click export, metadata organized in IBM Spectrum Discover can be easily integrated with IBM Watson Knowledge Catalog. Data users can then leverage enterprise file and object data using IBM Cloud Pak for Data and IBM AI solutions.
NEW! Optimize data with more granularity
IBM Spectrum Discover now contains a new policy engine that can help optimize data capacity and data location based on defined data policies. The first release of this capability optimizes IBM Spectrum Scale capacity by using data and custom tags defined in Spectrum Discover to bring more granular capacity optimization.
Supports heterogeneous file and object storage
Supports both IBM and non-IBM storage systems on-premises and in the cloud, including IBM Spectrum Scale, IBM Cloud Object Storage, IBM Spectrum Protect, Red Hat Ceph Storage, Dell-EMC Isilon, NetApp, Amazon S3 and Windows SMB.
Policy-based metadata tagging for data classification
IBM Spectrum Discover automatically captures system metadata from source storage systems, creates custom metadata from search results and enables extraction of keyword metadata from file headers and content using the Action Agent API. The result is a rich layer of file and object metadata that is managed using one centralized solution.
Dashboard and customizable reporting
The dashboard represents the user environment at a glance. What a user can see or not see is determined using role-based access controls. The dashboard can show usage versus capacity of their registered storage systems and information about potential duplicate files. For users who want additional record detail, IBM Spectrum Discover provides customizable reports. Both summary and detailed reports can be generated.
Continuous metadata ingestion without rescan
When used with IBM Cloud Object Storage, IBM Spectrum Scale or Red Hat Ceph storage, the software provides continuous metadata ingestion. Built-in connectors provide integration with IBM and Red Hat storage systems. Live event notifications automate continuous metadata ingestion. Metadata indexing enables rapid data queries. Customers can scan up to 30,000 records per second — up to 1 billion files in an 8-hour day.
Fast searching enables rapid discovery of data assets
The metadata management software provides both a search bar and a more advanced search pane to help users quickly find subsets of records that have been indexed. Search results are displayed in a columnar table that contains information correlated to search criteria. What a user can see or not see is determined using role-based access controls.
Content-based tagging and search
Apply custom metadata tags based on the occurrence of user-definable keywords found in the content of supported file types, then quickly find that data with low-latency searches using those tags.
Secure and extensible architecture
Role-based access control ensures that only authorized users have access to data. The Action Agent API supports integration with customer-developed and/or third-party software, and policy engine hooks enable automated workflows.
Automatically identify and classify sensitive data
IBM Spectrum Discover automatically identifies and classifies data containing certain kinds of sensitive or personally identifiable information.
Community-supported catalog of third-party extensions
IBM Spectrum Discover Action Agent Catalog enables clients to discover, install and manage third-party Action Agents from a community-supported ecosystem to extend the capabilities of IBM Spectrum Discover without having to write their own code.