My IBM Log in

Unify and share data across Netezza and watsonx.data for new generative AI applications

21 June 2024

3 min read

In today’s data and AI-driven world, organizations are generating vast amounts of data from various sources. The ability to extract value from AI initiatives relies heavily on the availability and quality of an enterprise’s underlying data. In order to unlock the full potential of data for AI, organizations must be able to effectively navigate their complex IT landscapes across the hybrid cloud.

At this year’s IBM Think conference in Boston, we announced the new capabilities of IBM watsonx.data, an open data lakehouse that enables enterprises to unlock value in their existing data by connecting to existing storage and analytical environments, regardless of where their data resides. It also allows them to prepare their data for AI use cases and cost-optimize workloads with multiple fit-for-purpose query engines and low-cost object storage.

From mobile banking applications to connected cars, clients rely on IBM® databases to store and analyze their most critical data across the hybrid cloud, powering applications and analytics that operate their business every single day. IBM Netezza® Performance Server is a cloud-native enterprise data warehouse designed to operationalize deep analytics, business intelligence and machine-learning (ML) workloads by making data unified, accessible and scalable, anywhere. Today, we are excited to announce the general availability of both IBM Netezza on-premises (Cloud Pak® for Data System) and IBM Netezza SaaS integrations with watsonx.data™.

Let’s explore how the integration of Netezza with watsonx.data can now help clients modernize their data management platform to drive actionable insights with other third party data and applications for generative AI.

Unifying data for AI

Traditional data warehousing approaches often require complex ETL (Extract, Transform, Load) processes, which can be time-consuming and costly. With Netezza and watsonx.data, you can unify data for AI across the hybrid cloud without the need for ETL. This is achieved through a shared metadata layer, open table formats, and cost- effective cloud object storage. With new support for open formats such as Parquet and Apache Iceberg, Netezza empowers data engineers, data scientists and data analysts to run complex workloads without additional ETL or data movement over cloud object storage.

This integration enables Netezza on-premise customers to read and write Apache Iceberg tables stored on cost effective S3 compatible object storage either on-premise or in cloud along with joining Netezza native tables (hot/warm data) and in-frequently accessed stored in Iceberg tables (cold data). Shared metadata, storage, and open table Iceberg data formats facilitate data access and sharing across multiple Netezza instances running on premises and/or Netezza on Cloud.

With greater access to watsonx.data’s Presto and Spark open-source query engines, this integration approach also allows Netezza customers to optimizes workloads for price-performance, pairing the right workload, with the right engine, for the right cost.

Watsonx.data brings new generative AI capabilities for Netezza customers

Watsonx.data’s built-in generative AI capabilities powered by the semantic layer as part of IBM Knowledge Catalog enable Netezza customers to prepare and simplify data for new AI applications using natural language. This reduces the complexity of data preparation and enables faster time-to-insight. Watsonx.data also announced the availability of their integrated vector database, based on open source Milvus. Now, Netezza clients can unify, curate and prepare vectorized embeddings for their generative AI applications at scale across their trusted, governed data within watsonx.data. This helps enhance the relevance and precision of AI outputs, including chatbots, personalized recommendation systems and image similarity search applications.

Deploy your Netezza data warehouse with watsonx.data integration anywhere

Netezza customers running workloads both on-premises with Netezza Cloud Pak for Data System as well as Netezza SaaS customers running on AWS and Azure can now leverage this integration with watsonx.data. For on-premises Netezza clients looking to move to a hybrid cloud architecture to achieve greater flexibility and scale for AI, Netezza makes it easy to modernize to hybrid and fully managed deployments on AWS and Azure with like-for-like compatibility.

Getting started

Netezza on-premises and Netezza SaaS integration with watsonx.data provides a powerful solution for scaling analytics and AI across the hybrid cloud. With zero ETL required, you can unify data for AI, access and share data across the hybrid cloud, and store and share data for AI using open formats and low-cost object storage. By leveraging built-in generative AI capabilities and native integrations, you can simplify data preparation and accelerate your AI journey.

 

Author

Brajesh Pandey

Chief Architect

IBM Data & AI

Hemant Suri

Program Director

Data & AI