Creating a Spark data source
You can create Spark data sources to reliably manage large volumes of structured and unstructured data.
Before you begin
- Make sure that you have defined the Hive driver library JAR files so that QMF can connect to Spark data sources. Use the Hive driver library JAR files of version 1.2.1 or later.
- Ensure that the Spark Thrift server is up and running.
About this task
Spark data source is a data warehouse infrastructure that provides data summarization and ad hoc querying. Spark data sources are accessed using special drivers for JDBC. The current JDBC interface for Spark only supports running queries and fetching results. For steps to add drivers and connecting as a relational data source, see Creating the JDBC driver configuration file.
To create a Spark data source: