Connecting to a WebHDFS Location

Hadoop Distributed File System (HDFS) is a core component of Apache Hadoop, a framework for distributed storage and processing of very large data sets. Internally, it uses the WebdHDFS REST API, which is built into the default Hadoop installation of Hortonworks HDFS.

File name The address of the namenode server of the HDFS. It starts with the protocol prefix “hdfs://” followed by the hostname and the port.

File name example: hdfs://12.34.56.78:8020

Host Path of the file hosts on Windows: ../system32/drivers/etc/hosts
Path of the file hosts on Linux: /etc/hosts

Path of file host to external URLs: <IP>sandbox-hdp.hortonworks.com
Example: 13.93.26.183 sandbox-hdp.hortonworks.com

Port Port for the host connection.

Supported distributions

See Systems Supported by Jedox Integrator for currently supported distributions. If you are interested in other distributions or other versions, please contact Jedox Support.

By default, HDFS runs in non-secure mode, which requires no actual authentication. Secure mode, whereby each user and service has to be authenticated by Kerberos, is currently not supported in the Hadoop package for Jedox Integrator.

Updated September 25, 2024