Defining a connection
You can access the local file system on the engine tier or access a Hadoop Distributed File System (HDFS) by using the WebHDFS API or the HttpFS API.
About this task
To access HDFS, you must define a connection that specifies the server name, server port number, user name, and password. Alternatively, you can also specify the connection URL instead of the host name and port.
You can use Kerberos authentication to connect to HDFS by using the WebHDFS API or the HttpFS API. If you use Kerberos authentication, you must specify the Kerberos principal in the User name property and the password in the Password property. Optionally, you can also specify the realm of the principal in the user name property, for example, principal@realm. Alternatively, you can use the Kerberos keytab file instead of the password.
When Kerberos authentication is used, the File connector uses the krb5.conf file. Specify the Kerberos Key Distribution Center (KDC) host name and the default realm in the krb5.conf file. For more information about the krb5.conf file, see the Kerberos documentation.
Procedure
Option | Procedure |
---|---|
Access the local file system |
|
Access a file system on HDFS by using the WebHDFS API or the HttpFS API |
|