Setting Docker/cgroup container definitions for an instance group
Optionally set up the instance group to run within containers, either Docker or control groups (cgroups).
Before you begin
- You can only set Docker/cgroup container definitions for an instance group with certain Spark versions. Spark versions not supported: 1.5.2.
- Based on your requirements, ensure that you meet the requirements to create an instance group. See Prerequisites for an instance group.
- If you are using Docker, see the Docker image requirements and limitations section in Docker overview.
About this task
You can enable Spark drivers and executors in an instance group to run within Docker or cgroup
containers. You can also run Spark services in Docker containers.
- With Docker, you enable the Spark drivers, executors, and services of an instance group to run in separate Docker containers. A Docker container holds everything that an instance group needs to run, including an operating system, user-added files, metadata, and other dependencies.
- With cgroups, you enable per-resource limits (such as CPU shares and memory) for Spark drivers and executors in an instance group. You can also set resource limits for Docker containers through Spark parameters so that Docker containers run within those CPU and memory bounds.
Procedure
Results
What to do next
- Optionally, to add any extra packages that the instance group requires, see Adding dependent packages. To add data connectors to the instance group, see Adding data connectors.
- Create and deploy the instance group.
- Click Create and Deploy Instance Group to create the instance group and deploy its packages simultaneously. In this case, the new instance group appears on the Instance Groups page in the Ready state. Verify your deployment and then start the instance group.
- Click Create Only to create the instance group but manually deploy its packages later. In this case, the new instance group appears on the Instance Groups page in the Registered state. When you are ready to deploy packages, deploy the instance group and verify the deployment. Then, start the instance group.