Refining data on the Hadoop cluster in Data Refinery
Take advantage of the Hadoop support for large data sets as you refine data on the Hadoop cluster.
These Hadoop Execution Engine connections are supported for refining data in a Hadoop environment:
- HDFS via Execution Engine for Hadoop for Hadoop Distributed File System (HDFS) files
- Hive via Execution Engine for Hadoop for data that is stored in tables in a Hive warehouse
- Impala via Execution Engine for Hadoop for data that is stored in tables in an Impala on the Hadoop cluster
Parent topic: Refining data