IBM and Hortonworks partnership

IBM and Hortonworks Inc. are offering an enterprise-grade Hadoop distribution with data integration, federation and advanced querying tools, combining the best of Hortonworks Data Platform, Hortonworks Data Flow and Db2 Big SQL. This offers enterprise-grade scalability, security and governance, and an ability to federate data at rest and data in motion.

IBM and Hortonworks improve insight discovery, testing and ad hoc and near real-time queries, supporting predictive and prescriptive analytics.


Data accessibility iconography

Unify data from more sources and formats

Secure and enterprise-ready, Apache Hadoop distribution powers near real-time applications and analytics — on premises or in the cloud. Deploy, integrate and analyze massive volumes of structured, semi-structured and unstructured data.

Data preparation iconography

Federate and query virtually any data

The highly scalable, enterprise-grade SQL for Hadoop concurrently exploits Apache Hive, HBase and Spark, using a single query or database connection that reduces latency and supports ad hoc and complex queries.

Agility cycle iconography

Drive machine learning and advanced analytics

Create new analytic models quickly and easily in a collaborative environment. Build and train machine-learning models and prepare and analyze data in a flexible hybrid cloud environment.


Apache Hadoop

Manage large volumes and different types of data, with open-source Hadoop. Tap into unmatched performance, simplicity and standards compliance to use all data, regardless of where it resides.

Apache Spark

Build algorithms quickly, iterate faster and put analytics into action with Spark. Easily create models that capture insight from complex data, and apply that insight in time to drive outcomes.

Stream computing

Stream computing enables you to process data streams, which are always on and never cease. This helps them spot opportunities and risks across all data in time to effect change.

Governance and metadata tools

Governance and metadata tools enable you to locate and retrieve information about data objects, in addition to their meaning, physical location, characteristics and usage.


IBM Hosted Analytics with Hortonworks

A single solution to store, explore and score big data, consisting of the Hortonworks Data Platform (HDP), IBM Db2 Big SQL and IBM Watson® Studio



A hybrid SQL on Hadoop engine, providing low-latency support for ad hoc and complex queries and connecting disparate sources, using a single database connection

IBM Watson Studio

Tools for data scientists, application developers and business users to help them work collaboratively with data to build and train models at scale

IBM Big Replicate

Replicates data as it streams in — so files don’t need to be fully written or closed before transfer — with enterprise-class replication for Hadoop and object storage


Data lake: Taming the data dragon

Learn how to leverage the data lake infrastructure to take data from anywhere, govern it everywhere and create value for everyone.

Connect more data from more sources with a data lake

Learn more about the new types and sources of data that can be leveraged by integrating data lakes into your existing hybrid data management strategy.

Making Sense of Big Data

Learn about challenges confronting today’s enterprise architect, including working with new data sources, more data projects and platforms.

Engage with an expert

Schedule a no-cost, one-on-one call with an experienced IBM expert

Learn about the IBM products, solutions and services available to help you build and grow a successful data lake.