Db2 Big SQL on Cloud Pak for Data
Version: 7.5.3 Included IBM
Description
Db2® Big SQL is a cloud-native, elastic, scalable SQL engine optimized for workloads on data stored in object stores or HDFS.
Db2 Big SQL can query data stored on legacy Hadoop clusters, using the configurations of open source components, such as:
- HDFS
- Hive metastore
- Ranger
Using Db2 Big SQL with IBM Cloud Pak® for Data can be useful in the following situations:
- You need to query large amounts of data residing on legacy Hadoop secured (Kerberized) or unsecured clusters and on private or public cloud object storage.
- You need highly optimized queries for multiple open source data formats, including Parquet, ORC, Avro, and CSV.
Quick links
- Architecture: View the components and software
- Prepare: Prepare to install the service
- Install: Install the service
- Set up: Set up the service after installation
- Upgrade: Upgrade the service
- Administer: Manage and maintain the service
- Use: Work with the service
- What's new: See a list of new features
- Known issues: View limitations
- Troubleshoot: Find solutions to problems
Integrated services
Service | Capability |
---|---|
IBM® Db2 Data Management Console | Administer, monitor, manage, and optimize the performance of your IBM Db2 databases. |
Runtime 22.2 on Python 3.10 for GPU | Access compute environments for Jupyter Notebooks that use GPU-accelerated Python 3.10 libraries. |
Watson™ Studio | Prepare, analyze, and model data in a collaborative environment with tools for data scientists, developers, and domain experts. |