The focus of this lab module is auto data quality - uniqueness checks, where you can specify column and have autoDQ check for duplicates.
Successful completion of prior modules
5 minutes or less
Data Quality Overview
About Auto Data Quality
Use Auto Data Quality
- Understand options for data quality in Dataplex
- Practical knowledge of running Auto Data Quality - uniqueness checks feature
We will use the same table as in the Data Profiling lab module.
Familiarize yourself with the table, from the BigQuery UI by running the SQL below-
SELECT * FROM oda_dq_scratch_ds.customer_master LIMIT 20
Click on the scan and define rules. Lets start with recommendations from Data profiling results.
Lets check all the fields for quality scan and click on "run now".
This concludes the module. Proceed to the next module.