Apache Spark

June 14, 2018

How to Layout Big Data in IBM Cloud Object Storage for Spark SQL

When you have vast quantities of rectangular data, the way you lay it out on object storage systems like IBM Cloud Object Storage (COS) makes a big difference to both cost and performance of SQL queries. However, this task is not as simple as it sounds. Here we survey some tricks of the trade.

Continue reading

June 5, 2018

Analyze and visualize open data with Apache Spark

Many government agencies and public administrations offer access to data, contributing to open data. Using IBM Watson Studio with Jupyter Notebooks and Apache Spark it is simple to retrieve, combine and analyze data from different sources. The result can be easily visualized. Learn what it takes with this IBM Cloud solution tutorial.

Continue reading

April 3, 2018

IBM Analytics Engine is now available in the London DC

The IBM Analytics Engine team is excited to announce the General Availability (GA) of IBM Analytics Engine, the next generation of IBM’s Apache Spark and Apache Hadoop cloud service in the London DC.

Continue reading

February 26, 2018

Putting the engine to work: how IBM Analytics Engine can help you harness Hadoop and Spark for business benefit

For many companies, the potential of big data analytics may seem both exciting and overwhelming. Technologies like Hadoop and Spark promise to unearth new sources of value from the vast mountains of unstructured data your business generates every day. The newfound opportunity to get insight from data that has been dormant for years could act as an energy source to power the next phase of business growth.

Continue reading

November 17, 2017

Hitting the ground running: how to get your data science initiatives off to a flying start

Data science is rapidly being established as the new frontier for analytics, as it moves from niche interest to the mainstream. Combining elements of statistics, computer science, applied mathematics and visualization, it offers a powerful new set of tools and techniques to enable more effective decision-making.

Continue reading

November 2, 2017

Accelerate analytics development and data science with IBM Analytics Engine

The launch of IBM® Analytics Engine marks the start of a new stage in the evolution of big data analytics—which makes it the perfect time for you to reconsider your analytics architecture. If you are struggling to transform big data into business insight, or your company’s adoption of Hadoop and Spark seem to be stalling, please read on to learn more about what IBM Analytics Engine can do for you.

Continue reading

November 2, 2017

Accelerate to AI, data-driven business with IBM Watson Data Platform

IBM announced a series of upgrades and new offerings to Watson Data Platform, an integrated set of tools, services and data in the IBM Cloud that enables data scientists, developers and business teams to gain intelligence from data.

Continue reading

September 19, 2017

IBM Analytics Engine beta goes live

We are excited to announce the beta of the IBM Analytics Engine, providing a single Hadoop and Spark service under the Watson Data Platform. It makes it easier for data engineers, data scientists and developers to develop and deploy analytics applications. With integration through Jupyter notebooks in Data Science Experience, IBM Analytics Engine provides the foundation for executing data science and machine learning workloads. The IBM Analytics Engine utilizes the Hortonworks Data Platform as the underlying Hadoop distribution, providing access to a market leading open source Hadoop distribution.

Continue reading