External Spark engines

External Spark engines are engines that exist in a different environment from where watsonx.data is deployed. You can deploy them in the following environments.

watsonx.data on IBM Software Hub

  • Spark instance on Cloud
  • Spark on Cloud Pak for Data
  • Spark on EMR

Based on the environment where the Spark engine is deployed, select the respective section to connect to watsonx.data: