What is IBM Db2 Big SQL?
IBM Db2® Big SQL is an enterprise-grade, hybrid ANSI-compliant SQL-on-Hadoop engine, delivering massively parallel processing (MPP) and advanced data query. Db2 Big SQL offers a single database connection or query for disparate sources such as Hadoop HDFS and WebHDFS, RDMS, NoSQL databases and object stores. Benefit from low latency, high performance, security, SQL compatibility and federation capabilities to do ad hoc and complex queries.
Announcement: Announcing IBM Db2 Big SQL v5.04 on Cloudera’s CDH v6.2 Platform
Read the latest blog: Making the connection: how SQL on Hadoop brings together data for deeper insight
Product benefits

Access, query and analyze data across storage platforms
Understands commonly used ANSI SQL syntax to perform queries on batch and real-time data across Hadoop, object stores and data warehouses.

Scale with hybrid cloud-ready flexibility
Shift workloads within public and private cloud and on-premises environments based on your application requirements.

Drive real-time analytics with Apache Spark integration
Concurrently exploit Hadoop SQL queries across Hive, Hbase and Spark, using a single database connection — even a single query.

Connect your data scientists to their data
Use IBM Watson® Studio and existing Jupyter Notebooks to federate to RDRMS and to Oracle, Db2 and IBM Netezza®.

Access data where it resides
IBM federates data natively with the Db2 product family, including the Db2 AI database. When all data cannot be moved into your Hadoop system, Db2 Big SQL can federate the data across the enterprise.

Improve query performance
Run all 99 TPC-DS queries up to 100 TB with numerous concurrent users. Db2 Big SQL supports multiple workers per node for efficient CPU and memory utilization.