Moving data by using InfoSphere Data Click
You can copy selected database tables, data files, data file folders, and Amazon S3 buckets from the catalog to a target distributed file system, such as a Hadoop Distributed File System (HDFS) in IBM® InfoSphere® BigInsights®, by running InfoSphere Data Click activities.
Before you begin
You must have the following roles to run InfoSphere Data Click activities:
- Data Click Author and Data Click User
- Suite User
- DataStage Developer
- Common Metadata User
Your user account must be assigned to the default InfoSphere DataStage® project DataClick.
You must have any Information Governance Catalog security role.
Procedure
- Click Catalog.
- Click Information Assets.
- In the Select an asset type field, type in Database Tables, Data Files, Data File Folders, or Amazon S3 Buckets.
-
From the list of assets, select the assets to move, and then select Copy
from the menu.
InfoSphere Data Click opens in a new web browser window.
- Complete the wizard pages by specifying a name for the activity, providing the data connection information for the source and target, and reviewing the Hive table information. Click Finish.
- Click Run. The activity is submitted for processing. When the job is processed, data is copied from the source that you selected and moved to the target that you selected.