IBM Spark

Power of data. Simplicity of design. Speed of innovation.


Try IBM Analytics for Apache® Spark™ as a service now.

What is Spark?

Apache® Spark™ is an open-source cluster computing framework with in-memory processing to speed analytic applications up to 100 times faster compared to technologies on the market today. Developed in the AMPLab at UC Berkeley, Apache Spark can help reduce data interaction complexity, increase processing speed and enhance mission-critical applications with deep intelligence.

Highly versatile in many environments, Apache Spark is known for its ease of use in creating algorithms that harness insight from complex data. Spark was elevated to a top-level Apache Project in 2014 and continues to expand today.

IBM is committing to the Apache Spark project with investments in design-led innovation and broad-scale education programs to promote open source innovation and accelerate intelligence into every application.

Get the ebook to learn more about Apache Spark

Creating value with Spark and Hadoop

Apache Spark enables you to extend and complement your Apache® Hadoop™ investment. While Hadoop technology is designed to manage and store big data, Spark is the analytics engine that can be used to help you maximize your existing Hadoop investment. Using Spark and Hadoop, you can dramatically reduce time to analytics and generate previously hidden insights from your data.

Learn more about about Hadoop

Datapalooza – Rock Your Data

Coming soon to a city near you

Datapalooza is a 2.5-day immersive experience with a diverse community of data professionals, listen to some great music and create inspirational data products.

Learn more

IBM and Apache Spark: The start of something big in data and design

It’s not just about data access anymore. It’s about building algorithms that put analytics into action. It's about changing data science and driving intelligent apps fueled by data. Combining data, design and speed, IBM and Apache Spark are creating a new blueprint of innovation.

IBM and Spark. Power of data. Simplicity of design. Speed of innovation.

Watch the video

IBM and Spark

Power of data. Simplicity of design. Speed of innovation.

Power of data. Simplicity of design. Speed of innovation

Today is the start of something big in data and design. This is IBM and Apache Spark. Together, we are on fire and changing the role of data and analytics in organizations. It's not just about data access anymore, it's about building algorithms that put analytics into action.

As part of its commitment to Apache Spark, IBM will:

Get started

See what our customers are saying…

“IBM Analytics for Apache Spark provides us with a perfect sandbox for Spark development at Semblent. It's a managed service that enables us to quickly create Apache Spark applications. We got up and running within a day.”

- Sam Forster, CEO of Semblent Group

“With Analytics for Apache Spark, we’ll be able to work with IBM to develop promising new ways to analyze signal data as we hunt for evidence of intelligence elsewhere in the cosmos.  This is an exciting example of synergy in the service of science.”

- Dr. Seth Shostak, Senior Astronomer and Director of the Center for SETI Research

“Spark provided a one stop shop for data preparation and exploratory analytics which enabled our data scientists to conceive viable new product ideas in matter of weeks instead of months.”

- Nilesh Saratkar, Quest Diagnostics

Introducing a Universal Translator for Big Data and Machine Learning

IBM has committed its machine learning platform SystemML to the open source community. SystemML has been recognized as an official Apache Incubator project—giving it the name Apache SystemML. SystemML enables developers who don’t have expertise in machine learning to embed it in their applications once and use it in industry-specific scenarios on a wide variety of computing platforms, from mainframes to smartphones.

Learn more Read the white paper

A data scientist ponders over equations on a chalkboard.

IBM + Spark in the Community

Hear how IBM and Spark are driving insights and accelerating Spark innovation.

Panel Discussion

Lightning talks

IBM Spark Technology Center

Introducing the IBM Spark Technology Center in San Francisco, CA, the hub for the data science community to collaborate and accelerate Spark development.

IBM Business Partners

A partner ecosystem is essential to propel Apache Spark into the data science mainstream.

      1-877-426-3774 Priority code: Analytics Solutions

  • LP dyn button

  Follow us