FAQ - When is it a good idea to set Spark locality wait to zero?

Troubleshooting

Problem

This article‚ discusses the benefits and disadvantages of processing data locally or on another DSE Analytics node.

Background

One of the configurable scheduling properties in Apache Spark is‚ spark.locality.wait.

When a job is run, Spark makes a determination of where to execute the task based on certain factors such as available memory or cores‚ on a node, where the data is located in a cluster or available executors.

One of the most important features of DataStax Enterprise Analytics is its‚ awareness of where the data resides in the cluster. This means that in the first instance, it will try to execute tasks local to where the data resides to avoid the overhead of having to transfer the data across the network to another executor on another node.

By default spark.locality.wait‚ is set to 3 seconds. Spark will wait to launch a task on an executor local to the data using this value. After this period, if the data-local node is still‚ unavailable, Spark‚ will give up and launch the task on another less-local node.

Symptoms

SCENARIO 1 - Large data sets

Jobs are taking too long to complete, and on investigation, it was determined that tasks are being run on non-local nodes. Large amounts of data are being‚ transferred to remote nodes (shuffling)‚ resulting in delays in processing.

SCENARIO 2 - Small data sets

Despite jobs processing small amounts of data, it is still taking too long since only 1 or 2 tasks are running on data-local nodes, with the rest of the cluster mostly idle.

Tuning

When configuring a value for‚ spark.locality.wait, remember that there may be a penalty associated with either having a higher or lower value.

In the first scenario above, the cost of having to transfer large amounts of data across the network can be quite expensive. In this situation, data locality should be the primary consideration therefore‚ increase the wait time so tasks will wait to launch on data-local nodes.

In the second scenario‚ where‚ the data sets are small, the cost of shuffling the data around the cluster is relatively inexpensive in contrast to having nodes in the cluster sitting idle. It would make sense to reduce the wait time so tasks get distributed to the rest of the cluster (higher parallelization).

As a general rule, long running jobs would benefit from a higher wait time since the cost of waiting will have less impact on the job's overall completion time. Jobs that only take a short time (e.g., 0.5-2 seconds) will be better off with a small wait time. In some circumstances, really short jobs should use a wait time of zero since they get executed immediately on the next available node.

Ultimately, determine the best wait time by achieving a balance between the cost of shuffling data around the cluster against the parallelization of tasks around the cluster.

Document Location

Worldwide

[{"Type":"MASTER","Line of Business":{"code":"LOB76","label":"Data Platform"},"Business Unit":{"code":"BU048","label":"IBM Software"},"Product":{"code":"SSCR56","label":"IBM DataStax Enterprise"},"ARM Category":[{"code":"","label":""}],"ARM Case Number":"","Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Version(s)"}]

Historical Number

ka0Ui0000000L1FIAU

Was this topic helpful?

Document Information

Modified date:
30 January 2026

UID

ibm17258656

Tips