Moving data by using InfoSphere Data Click

You can copy selected database tables, data files, data file folders, and Amazon S3 buckets from the catalog to a target distributed file system, such as a Hadoop Distributed File System (HDFS) in IBM® InfoSphere® BigInsights®, by running InfoSphere Data Click activities.

Before you begin

You must have the following roles to run InfoSphere Data Click activities:
  • Data Click Author and Data Click User
  • Suite User
  • DataStage Developer
  • Common Metadata User

Your user account must be assigned to the default InfoSphere DataStage® project DataClick.

You must have any Information Governance Catalog security role.

Procedure

  1. Click Catalog.
  2. Click Information Assets.
  3. In the Select an asset type field, type in Database Tables, Data Files, Data File Folders, or Amazon S3 Buckets.
  4. From the list of assets, select the assets to move, and then select Copy from the menu.
    InfoSphere Data Click opens in a new web browser window.
  5. Complete the wizard pages by specifying a name for the activity, providing the data connection information for the source and target, and reviewing the Hive table information. Click Finish.
  6. Click Run.
    The activity is submitted for processing. When the job is processed, data is copied from the source that you selected and moved to the target that you selected.

What to do next

After you run an activity, you can close the InfoSphere Data Click window and return to InfoSphere Information Governance Catalog. To open InfoSphere Data Click again to view, delete, or check the status of activities, click Catalog > Data Integration > Manage and Monitor My Provisioning Activities.