forked from databrickslabs/ucx
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathlabs.yml
222 lines (192 loc) · 9.08 KB
/
labs.yml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
---
name: ucx
description: Unity Catalog Migration Toolkit (UCX)
install:
script: src/databricks/labs/ucx/install.py
uninstall:
script: src/databricks/labs/ucx/uninstall.py
entrypoint: src/databricks/labs/ucx/cli.py
min_python: 3.10
commands:
- name: open-remote-config
description: Opens remote configuration in the browser
- name: workflows
description: Show deployed workflows and their state
table_template: |-
Step\tState\tStarted
{{range .}}{{.step}}\t{{.state}}\t{{.started}}
{{end}}
- name: logs
description: Show logs from the latest job run
flags:
- name: workflow
description: Name of the workflow to show logs for - assessment, table-migration, etc
- name: installations
description: Show installations by different users on the same workspace
table_template: |-
Path\tDatabase\tWarehouse
{{range .}}{{.path}}\t{{.database}}\t{{.warehouse_id}}
{{end}}
- name: skip
description: Create a skip comment on a schema or a table
flags:
- name: schema
description: Schema Name to Skip
- name: table
description: Table Name to Skip
- name: sync-workspace-info
is_account_level: true
description: upload workspace config to all workspaces in the account where ucx is installed
- name: report-account-compatibility
is_account_level: true
description: aggregation of UCX output of multiple workspaces in the account.
If --workspace-ids is not provided, it will use all workspaces present in the account.
flags:
- name: workspace-ids
description: List of workspace IDs to create account groups from.
- name: manual-workspace-info
description: only supposed to be run if cannot get admins to run `databricks labs ucx sync-workspace-info`
- name: create-table-mapping
description: create initial table mapping for review
- name: ensure-assessment-run
description: ensure the assessment job was run on a workspace
- name: validate-external-locations
description: validates and provides mapping to external table to external location and shared generation tf scripts
- name: repair-run
description: Repair Run the Failed Job
flags:
- name: step
description: name of the step
- name: revert-migrated-tables
description: remove notation on a migrated table for re-migration
flags:
- name: schema
description: Schema to revert (if left blank all schemas in the workspace will be reverted)
- name: table
description: Table to revert (if left blank all tables in the schema will be reverted). Requires schema parameter to be specified.)
- name: delete_managed
description: Revert and delete managed tables
- name: move
description: move tables across schema/catalog withing a UC metastore
flags:
- name: from-catalog
description: from catalog name
- name: from-schema
description: schema name to migrate.
- name: from-table
description: table names to migrate. enter * to migrate all tables
- name: to-catalog
description: target catalog to migrate schema to
- name: to-schema
description: target schema to migrate tables to
- name: alias
description: |
alias tables across schema/catalog withing a UC metastore
create a view pointing to the "from" table
if a view is aliased, recreates the same view in the target schema/catalog
flags:
- name: from-catalog
description: from catalog name
- name: from-schema
description: from schema
- name: from-table
description: table names to alias. enter * to migrate all tables
- name: to-catalog
description: target catalog to migrate schema to
- name: to-schema
description: target schema to migrate tables to
- name: principal-prefix-access
description: For azure cloud, identifies all storage account used by tables in the workspace, identify spn and its
permission on each storage accounts. For aws, identifies all the Instance Profiles configured in the workspace and
its access to all the S3 buckets, along with AWS roles that are set with UC access and its access to S3 buckets.
The output is stored in the workspace install folder.
flags:
- name: subscription-id
description: Subscription to scan storage account in
- name: aws-profile
description: AWS Profile to use for authentication
- name: create-missing-principals
description: For AWS, this command identifies all the S3 locations that are missing a UC compatible role and
creates them. It takes single-role optional parameter.
If set to True, it will create a single role for all the S3 locations.
flags:
- name: aws-profile
description: AWS Profile to use for authentication
- name: single-role
description: (Optional) Create a single role for all the S3 locations (default:True)
- name: create-uber-principal
description: For azure cloud, creates a service principal and gives STORAGE BLOB READER access on all the storage account
used by tables in the workspace and stores the spn info in the UCX cluster policy. For aws,
it identifies all s3 buckets used by the Instance Profiles configured in the workspace.
flags:
- name: subscription-id
description: Subscription to scan storage account in
- name: aws-profile
description: AWS Profile to use for authentication
- name: validate-groups-membership
description: Validate groups to check if the groups at account level and workspace level have different memberships
table_template: |-
Workspace Group Name\tMembers Count\tAccount Group Name\tMembers Count\tDifference
{{range .}}{{.wf_group_name}}\t{{.wf_group_members_count}}\t{{.acc_group_name}}\t{{.acc_group_members_count}}\t{{.group_members_difference}}
{{end}}
- name: migrate-credentials
description: Migrate credentials for storage access to UC storage credential
flags:
- name: subscription-id
description: Subscription to scan storage account in
- name: aws-profile
description: AWS Profile to use for authentication
- name: create-account-groups
is_account_level: true
description: |
Creates account level groups for all groups in workspaces provided in --workspace-ids.
If --workspace-ids is not provided, it will use all workspaces present in the account.
flags:
- name: workspace-ids
description: List of workspace IDs to create account groups from.
- name: migrate-locations
description: Create UC external locations based on the output of guess_external_locations assessment task.
flags:
- name: subscription-id
description: Subscription to scan storage account in
- name: aws-profile
description: AWS Profile to use for authentication
- name: create-catalogs-schemas
description: Create UC external catalogs and schemas based on the destinations created from create_table_mapping command.
This command is supposed to be run before migrating tables to UC.
- name: cluster-remap
description: Re-mapping the cluster to UC
- name: revert-cluster-remap
description: Reverting the Re-mapping of the cluster from UC
- name: migrate-local-code
description: (Experimental) Migrate files in the current directory to be more compatible with Unity Catalog.
- name: show-all-metastores
is_account_level: true
description: Show all metastores available in the same region as the specified workspace
flags:
- name: workspace-id
description: (Optional) Workspace ID to show metastores for
- name: assign-metastore
is_account_level: true
description: Enable Unity Catalog features on a workspace by assign a metastore to it
flags:
- name: workspace-id
description: (Optional) Workspace ID to assign a metastore to
- name: metastore-id
description: (Optional) If there are multiple metastores in the region, specify the metastore ID to assign
- name: default-catalog
description: (Optional) Default catalog to assign to the workspace. If not provided, it will be hive_metastore
- name: migrate-tables
description: |
Trigger the migrate-tables workflow and, optionally, migrate-external-hiveserde-tables-in-place-experimental
workflow and migrate-external-tables-ctas workflow.
- name: migrate-dbsql-dashboards
description: Migrate DBSQL dashboards by replacing legacy HMS tables in DBSQL queries with the corresponding new UC tables.
flags:
- name: dashboard-id
description: (Optional) DBSQL dashboard ID to migrate. If no dashboard ID is provided, all DBSQL dashboards in the workspace will be migrated.
- name: revert-dbsql-dashboards
description: Revert DBSQL dashboards that have been migrated to their original state before the migration.
flags:
- name: dashboard-id
description: (Optional) DBSQL dashboard ID to revert. If no dashboard ID is provided, all migrated DBSQL dashboards in the workspace will be reverted.