Db2 Big SQL

Version: 8.2.2

Experience: Cloud Pak for Data

Description

Db2 Big SQL is a cloud-native, elastic, scalable SQL engine optimized for workloads on data stored in object stores or in Hadoop Distributed File System (HDFS).

Db2 Big SQL can query data stored on legacy Hadoop clusters, using the configurations of open source components, such as:

  • HDFS
  • Hive metastore
  • Ranger

Db2 Big SQL can be useful in the following situations:

  • You need to query large amounts of data residing on legacy Hadoop secured (Kerberized) or unsecured clusters and on private or public cloud object storage.
  • You need highly optimized queries for multiple open source data formats, including Parquet, ORC, Avro, and CSV.

Licensing information

This service is included in the following licenses:

  • IBM Cloud Pak® for Data Enterprise Edition
  • IBM Cloud Pak for Data Standard Edition

For more information, see Licenses and entitlements.

Quick links

Integrated services

Table 1. Related services. The following related services are often used with this service and provide complementary features, but they are not required.
Service Capability
IBM Db2 Data Management Console Administer, monitor, manage, and optimize the performance of your IBM Db2 databases.
Runtime 24.1 on Python 3.11 for GPU Access compute environments for Jupyter Notebooks that use GPU-accelerated Python 3.11 libraries.
Watson Studio Prepare, analyze, and model data in a collaborative environment with tools for data scientists, developers, and domain experts.