Data Cataloging
This section answers questions that are related to Data Cataloging in IBM Fusion.
-
- What is Data Cataloging?
- Data Cataloging is an IBM Fusion service that provides the capability of analyze the metadata of customer’s data sources. It provides rapid automated data discovery and robust metadata capture, curation an enrichment.
-
- How can you perform that data analysis and metadata capture?
- In first place, you need to create connections to the wanted data source. After that, you need to select the connection and run a scan over it. You see different statuses when running your scan (as scanning or indexing, for example). Once terminated, you see your data that is indexed in the Data Cataloging database.
-
- What other actions can you do with your data?
- You can do more complex management with your data by adding tag management or running specific policies.
-
- What kind of data sources are supported?
- Data Cataloging supports connection to several kinds of data sources, such as IBM Storage Scale, NFSv4, S3, Cloud Object Storage, IBM Storage Protect, and Server Message Block.
-
- Can we use IBM Storage Scale as storage provider for Data Cataloging?
- Yes, IBM Storage Scale can be used as storage provider to install Data Cataloging. You need to have a Scale cluster to be set as remote storage provider, and use the Global Data Platform service available in IBM Fusion to connect your Fusion cluster with the remote Scale.
-
- Can we use Red Hat® OpenShift® Data Foundation as storage provider for Data Cataloging?
- Yes, Red Hat
OpenShift Data Foundation can be used as storage
provider for Data Cataloging. You just need to install
the Data Cataloging service available in IBM Fusion. Once installed, you can install Data Cataloging. When installing Data Cataloging, be sure that you select the
ocs-storagecluster-cephfs
as storage class.
-
- Can we use any storage class to install Data Cataloging?
- Data Cataloging was designed to be storage-agnostic, which means that you can use any storage provider. Only ensure that the storage class selected meets with the requirements that are specified in the prerequisites section.