Creating a Hive data source

You can create Hive data sources to work with Apache Hadoop, which is the open source software framework, used to reliably managing large volumes of structured and unstructured data.

Before you begin

Make sure you have defined the Hive driver library JAR files so that QMF can connect to Hive data sources.

About this task

Hive data source is a data warehouse infrastructure that provides data summarization and ad hoc querying. Hive data sources are accessed using special drivers for JDBC. The current JDBC interface for Hive only supports running queries and fetching results. For steps to add drivers and connecting as a relational data source, see Creating the JDBC driver configuration file.

To create a Hive data source:

Procedure

  1. Select File > New > Other > Repository > Hive Data Source. The Create New Hive Data Source wizard opens.
  2. On the page of availability type selection, select Personal or Shared. Click Next.
  3. Type the data source name in the Data Source Name field.
  4. Specify the necessary parameters in the Connection Parameters area. You must set the Host name, Port number, and Database name.
  5. Click Set User Information button to specify the necessary user parameters.
  6. Click Advanced button to select the advanced parameters supported by the installed Hive driver.
  7. In the Description field, you can enter the description of the created data source.
  8. Click Finish to create the Hive data source and close the Create New Hive Data Source wizard.