This collection of AWS CloudFormation resource types allow Databricks to be controlled using AWS CloudFormation.
There is also developer documentation available if you are interested in building or contributing.
Resource | Description | Documentation |
---|---|---|
Databricks::Clusters::Clusters | This resource type manages a Databricks Cluster | /Databricks-Clusters-Cluster |
Databricks::Clusters::Job | This resource type manages a Databricks Cluster Job | /Databricks-Clusters-Job |
To get started:
-
Sign in to the AWS Management Console with your account and navigate to CloudFormation.
-
Select "Public extensions" from the left hand pane and filter Publisher by "Third Party".
-
Use the search bar to filter by the "Databricks" prefix.
Note: All official Databricks resources begin with Databricks::
and specify that they are Published by Databricks
.
-
Select the desired resource name to view more information about its schema, and click Activate.
-
On the Extension details page, specify:
- Extension name
- Execution role ARN
- Automatic updates for minor version releases
- Configuration
- In your terminal, specify the configuration data for the registered Databricks CloudFormation resource type, in the given account and region by using the SetTypeConfiguration operation:
For example:
$ aws cloudformation set-type-configuration \z`
--region us-west-2 --type RESOURCE \
--type-name Databricks::Clusters::Job \
--configuration-alias default \
--configuration "{ \"DatabricksAccess\":{\"DatabricksInstance\":\"https://abc123.cloud.databricks.com\",\"Token\":\"YOURAPIKEY\"}}"
- After you have your resource configured, create your AWS stack that includes any of the activated Databricks resources.
For more information about available commands and workflows, see the official AWS documentation.
The Databricks CloudFormation resources are available on the CloudFormation Public Registry in the following regions:
Code | Name |
---|---|
us-east-1 | US East (N. Virginia) |
us-east-2 | US East (Ohio) |
us-west-1 | US West (N. California) |
us-west-2 | US West (Oregon) |
ap-south-1 | Asia Pacific (Mumbai) |
ap-northeast-1 | Asia Pacific (Tokyo) |
ap-northeast-2 | Asia Pacific (Seoul) |
ap-southeast-1 | Asia Pacific (Singapore) |
ap-southeast-2 | Asia Pacific (Sydney) |
ca-central-1 | Canada (Central) |
eu-central-1 | Europe (Frankfurt) |
eu-west-1 | Europe (Ireland) |
eu-west-2 | Europe (London) |
eu-west-3 | Europe (Paris) |
eu-north-1 | Europe (Stockholm) |
sa-east-1 | South America (São Paulo) |
Note: To privately register a resource in any other region, use the provided packages.
---
AWSTemplateFormatVersion: '2010-09-09'
Description: Shows how to create a Databricks Cluster in Databricks
Resources:
MySampleProject:
Type: Databricks::Clusters::Cluster
Properties:
ClusterName: my-cluster,
SparkVersion: 7.3.x-scala2.12,
NodeTypeId: i3.xlarge,
AwsAttributes:
Availability: ON_DEMAND,
ZoneId: eu-west-1c,
SpotBidPricePercent: 100,
FirstOnDemand: 1,
EbsVolumeSize: 1,
EbsVolumeThroughput: 125,
EbsVolumeCount: 0,
EbsVolumeType: GENERAL_PURPOSE_SSD,
EbsVolumeIops: 3000
NumWorkers: 1,
DriverNodeTypeId: i3.xlarge,
AutoterminationMinutes: 0,
EnableElasticDisk: false,
ApplyPolicyDefaultValues: false,
EnableLocalDiskEncryption: false,
---
AWSTemplateFormatVersion: '2010-09-09'
Description: Shows how to create an Job in Databricks
Resources:
Type: Databricks::Clusters::Jobs
MySampleProject:
Name: mytestjobnextgen,
EmailNotifications:
NoAlertForSkippedRuns: false
TimeoutSeconds: 0,
Schedule:
TimezoneId: Europe/London,
QuartzCronExpression: 20 30 * * * ?
MaxConcurrentRuns: 1,
ExistingClusterId: 1201-092121-123abcsd,
Format: SINGLE_TASK,
AccessControlList:
- UserName: [email protected],
PermissionLevel: IS_OWNER