Automated scanning of an IBM Storage Scale data source

As an administrator, you can initiate an IBM Storage Scale scan from IBM Spectrum® Discover to collect system metadata from IBM Storage Scale file system.

Before you begin

You can include or exclude the files during initial IBM Storage Scale scan process by configuring the following environment variable:
INCLUDE_SCALE_SNAPSHOTS
When the INCLUDE_SCALE_SNAPSHOTS variable value is set to 'false' (default value), the IBM Storage Scale scan excludes all the files that are inside the .snapshots directories, otherwise, if the variable value is set to 'true', the scan includes all the files, including the .snapshots directories.

About this task

To set the INCLUDE_SCALE_SNAPSHOTS variable by using configmap, see Enabling skip snapshot directories feature on Red Hat® OpenShift®Enabling skip snapshot directories feature on Red Hat® OpenShift in the IBM Storage Scale: Administration Guide.

When a scan is initiated from the IBM Spectrum Discover graphical user interface, the data moves asynchronously back to the IBM Spectrum Discover.
Remember: Before you initiate a scan, see IBM Storage Scale scanning considerations.

Automated scanning and data ingestion relies on an established and active network connection between the IBM Spectrum Discover instance and the source IBM Storage Scale management node. If the connection cannot be established, the state of the data source connection displays 'unavailable' and the option for automated scanning does not appear in the IBM Spectrum Discover GUI for that connection.

Note: You cannot run scans unless you add override warnings in the configuration file.

Procedure

  1. Log in to IBM Spectrum Discover web interface.
  2. Click menu and go to Data connections > Connections.
  3. Select the data source connection name that you want to scan. Make sure that the connection is online for your system ready to scan. (There is an indicator in the State column.)
  4. Select Scan now to start the scan, and a small message appears to confirm that the connection name you specify is being scanned.
    You can view the status of the scan on the table in the Scan Status column for the target connection. After the Scan Status has a check mark next to it, the scan is complete.
    Remember: You can also specify a time to begin the scan. Any time zones specified default to Coordinated Universal Time (UTC) time. So, if you specify your scan for 12 noon, it is 12 noon in UTC.