Developing Db2 Big SQL applications in your Hadoop environment

Db2 Big SQL provides a way to efficiently query data that is stored on a Hadoop cluster.

The IBM® Big SQL server has the following attributes:

  • Takes advantage of HCatalog to support a wide variety of data source formats, such as delimited files, ORC, Parquet, Avro, JSON, and sequence files.
  • Shares catalogs with Hive by using the Hive Metastore, and therefore Db2 Big SQL can query Hive tables and Hive can query Db2 Big SQL tables.
  • Supports HBase table manipulation.

Db2 Big SQL statements are run by the Db2 Big SQL server on your cluster against data on your cluster. The server transforms the specification of a result table into a sequence of internal operations that optimize data retrieval. The server is multi-threaded and is limited only by the available memory and processors.

Db2 Big SQL includes support for SQL Procedural Language (SQL PL), stored procedures, and functions. It includes the IBM Db2 Big SQL JDBC Driver V3.0.0 for a JDBC connection. It uses advanced IBM SQL compiler and runtime technologies that improves performance.