Automated scanning of an IBM Storage Scale fileset
As an administrator, you can initiate the IBM Storage Scale scan from IBM Spectrum® Discover to collect system metadata from the IBM Storage Scale file set or file sets.
Before you begin
This feature adds a requirement for non-root user IDs that are used for scanning IBM Storage Scale data source systems. This feature uses the mmlsfileset command to retrieve the list of available file sets from the target system when you have root-level permissions. So, if you use a non-root user ID it must have sudo access to mmlsfileset for this function to work.
About this task
Scan the IBM Storage Scale file set or file sets to insert or update the records for the files that are found by IBM Spectrum Discover in that file set or file sets. The scan is scoped to the specified file set, which ensures a faster total scan than scanning the entire file system. Multiple file sets can be specified in a single scan operation, but the scanning of each file set is done successively.
- The status message indicates which file set is being scanned.
- The status message indicates when data operations (such as transferring files or indexing data) occur.
This feature works irrespective of whether the data is returned to IBM Spectrum Discover by using a direct Kafka connection or by using the file copy method. After a file set level scan completes, a scan generation is recorded or committed.
Additionally, an internal reclamation policy is generated to remove any deleted files that did not appear in the updated scan. The scope of this reclamation policy is limited to the file set that is scanned and does not affect other file sets or the actual file system. This limitation helps you achieve consistency with the source IBM Storage Scale system at file set level granularity.