Getting started with Analytics Engine powered by Apache Spark

Use the Analytics Engine powered by Apache Spark to automatically spin up lightweight, dedicated Apache Spark clusters to run a wide range of workloads.

With Analytics Engine powered by Apache Spark, you can run jobs on a Spark cluster, run Jupyter notebooks and jobs from other tools in Watson Studio analytics projects by selecting a Spark environment runtime, run Spark SQL or jobs for data transformation, data science, or machine learning using Spark job APIs.

Checking whether the service is installed

An administrator must install Analytics Engine powered by Apache Spark.

To check whether the service is installed:

From the navigation menu, select Services > Services catalog.
Search for Analytics Engine powered by Apache Spark.

If the service is installed and ready to use, the tile in the catalog shows Ready to use.

If the service is installed but no service instances have been created, the tile in the catalog shows Ready to provision.

Important: Even if the service is Ready to use, you must be added to a service instance to use the service.

Accessing the service

Cloud Pak for Data watsonx™ Depending on the other services that are installed, Analytics Engine powered by Apache Spark is available from either of the IBM Cloud Pak® for Data experience or the IBM watsonx experience.

Learn more

To learn more about Analytics Engine powered by Apache Spark, see the following topics based on the experience that is available in your environment:

IBM Cloud Pak for Data documentation

Extending analytics using Spark

IBM watsonx documentation

Extending analytics using Spark