Creating a Hive data source
You can create Hive data sources to work with Apache Hadoop, which is the open source software framework, used to reliably managing large volumes of structured and unstructured data.
Before you begin
About this task
Hive data source is a data warehouse infrastructure that provides data summarization and ad hoc querying. Hive data sources are accessed using special drivers for JDBC. The current JDBC interface for Hive only supports running queries and fetching results.
To create a Hive data source:
Procedure
- In the Repositories view, on the Data Sources tab, click Create a Data Source. The Create New Data Source wizard opens.
- In the wizard, select Hive and click Next.
- Type the data source name in the Data Source Name field.
- Specify the necessary parameters in the Connection Parameters area. You must set the Host name, Port number, and Database name.
- Click Set User Information button to specify the necessary user parameters.
- Click Advanced button to select the advanced parameters supported by the installed Hive driver.
- In the Comment field, you can enter the description of the created data source.
- Click Finish to create the Hive data source and close the Create Hive Data Source wizard.