Setting consumers and resource groups for an instance group
Optionally, modify the default consumer and resource group that the instance group uses.
Before you begin
- Based on your requirements, ensure that you meet the requirements to create an instance group. See Prerequisites for an instance group.
- If you have a local environment with a mixed cluster that uses both Linux and Linux on POWER, the Jupyter notebook packages for Linux must be in a different resource group than the ones for Linux on POWER, since they are different.
About this task
Each instance group is assigned a top-level consumer. When the instance group is created, new consumers are created by default under this top-level consumer for the core components of the instance group: Spark drivers, executors, and batch master. Depending on your configuration, it might also include the shuffle, notebook, and history services.
The default top-level consumer is
a consumer with the same name as the instance group (for example, if your instance group name is ABC,
then the default top-level consumer is /ABC). The
top-level consumer represents the entire cluster and all its resources. The default resource group
for an instance group is the
ComputeHosts resource group. You can change the default top-level consumer and
resource group for the instance group.
You can also change the consumer for each component in the instance group. For information on how to
configure your own resource settings to avoid resource competition and unwanted resource reclaims,
see Best practices for resource group configuration.
When your instances (instance groups, Anaconda distribution instances, and application instances) are deployed to a shared file system, the shuffle service is disabled by default. If you want to use the shuffle service, you must enable the service and set up other configurations. See Enabling and configuring the Spark shuffle service.