Apache HDFS Connection

The Apache HDFS connection type enables connection to Apache HDFS.

Default Connection IDs

Web HDFS Hook uses parameter webhdfs_conn_id for Connection IDs and the value of the parameter as webhdfs_default by default.

Configuring the Connection

Host

The host to connect to, it can be local, yarn or an URL. For Web HDFS Hook it is possible to specify multiple hosts as a comma-separated list.

Port

Specify the port in case of host be an URL.

Login

Effective user for HDFS operations (non-Kerberized).

Extra (optional, connection parameters)

Specify the extra parameters (as json dictionary) that can be used in Web HDFS connection. The following extra parameters can be used to configure SSL for Web HDFS Hook:

Was this entry helpful?