Connecting to data sources
Create data connections or browse existing data connections and use them to discover assets.
About this task
- HDFS by using the HDFS connector
- Db2 by using the JDBC connector
- Oracle by using the JDBC connector
- Microsoft SQL Server by using the JDBC connector
- Teradata by using the JDBC connector
Hive and PostgreSQL data sources are also supported. However, connections to these data sources must be configured in InfoSphere® Metadata Asset Manager. You cannot create connections to such data sources here.
Procedure
- Go to Connections.
-
To use an existing data connection, in the Browse tab, select the data
connection that you want to use.
- Optional: To update the connection password, from the menu select Details.
- Optional: To check what assets are included in the connection, from the menu select Browse. This action is supported for file based connections only. When you browse a connection, a list of folders and files is displayed. You can preview the content of the files or add them to a project. To change the file format in which you preview the file and add it to projects, configure file format settings. You can display the first row in a file as a header, and adjust delimiter and character encoding.
- To create a connection, click Create connections.
- Enter the name of the connection and optionally a description.
-
To create a new HDFS connection:
- From the Choose connection menu, select HDFS.
- Specify the name, and optionally a description.
- Select the file system from which you want to import metadata, either WebHDFS or HttpFS.
- Specify the host that provides a REST HTTP gateway that supports the HDFS file system operations. The host is on the name node in WebHDFS, and on the name node or the edge node in HttpFS.
- Specify the credentials to access the database.
- Optional:
Specify the port number. If you don't specify the port, the following ports are used:
- If you don't select Use SSL, the port 50070 is used for WebHDFS, and the port 14000 is used for HttpFS.
- If you select Use SSL, the port 50470 is used for WebHDFS, and the port 14443 is used for HttpFS.
- Optional: Select Use SSL to use Secure Sockets Layer (HTTPS).
- Optional:
Select Use Kerberos to use Kerberos authentication.
In the Service principal field, specify the Kerberos service principal (SPN) that you want to use for the host. Use this property if the realm of the host is different from the realm of the user. This property is used for authenticating across all of your domains. When you specify the service principal for the web application server, you must specific the fully qualified domain name (FQDN) principle of the WLE server with the realm. For example, HTTP/testmach.austin.ibm.com.
When you select the Use keytab option, in the Keytab field specify the name and path of the keytab file to be used instead of the password. For example, /etc/security/keytabs/user1.keytab.
- Optional: Select Use custom URL to use a custom URL instead of the one that is generated based on the values that you specify for Use SSL, Host, and Port. In the Custom URL field, specify the base URL for the server, either HTTP or HTTPS.
- Click Test connection. You receive a confirmation message if your connection is successful.
-
If your connection is successful, click Save connection.
Your connection is ready.
For more information about HDFS connection, see the File connector data connection prerequisites and parameters topic. -
To create a new Db2 connection:
- From the Choose connection menu, select Db2.
- Create a name for your data connection, enter the JDBC host, and the credentials to access the database.
- Click Test connection. You receive a confirmation message if your connection is successful.
-
If your connection is successful, click Save connection.
Your connection is ready.
-
To create a new Oracle connection:
- From the Choose connection menu, select Oracle.
- Enter a service name, the JDBC host, and the credentials to access the database.
- Click Test connection. You receive a confirmation message if your connection is successful.
-
If your connection is successful, click Save connection.
Your connection is ready.
-
To create a new Microsoft SQL Server connection:
- From the Choose connection menu, select SQL Server.
- Create a name for your data connection, enter an instance name, the JDBC host, and the credentials to access the database.
- Click Test connection. You receive a confirmation message if your connection is successful.
-
If your connection is successful, click Save connection.
Your connection is ready.
-
To create a new Teradata connection:
- From the Choose connection menu, select Teradata.
- Create a name for your data connection, enter the database name, the JDBC host, and the credentials to access the database.
- Click Test connection. You receive a confirmation message if your connection is successful.
-
If your connection is successful, click Save connection.
Your connection is ready.