Skip to content

gregbeaumont/E2ESynapseDemo

 
 

Repository files navigation

Healthcare CMS Demo using Azure Synapse Analytics, Azure Data Factory and Power BI

UPDATE: As of 3/5/2021, Azure Synapse (SQL DW) has been upgraded to Synapse Analytics (current GA version) which has Synapse Workspace, Serverless Pool, On-demand SQL Pool and more!

UPDATE: As of 2/17/2021, a new and updated version of this solution is now available. It includes new 2018 data (now 148M total rows), optimized ADF data flows, a more efficient dimensional design, and a new Power BI report titled 'Medicare Part D Report V2.' More info about these changes is coming soon.

Audience

If you belong to one of the below categories, you have landed at the right place!

  1. If you are new to Azure Synapse and/or Azure Data Factory or just getting started on them.
  2. If you are looking to set up a Data Warehousing environment using Azure Synapse, Azure Data Factory and PowerBI in just four steps.
  3. If you are interested in reporting on Centers for Medicare and Medicaid Services (CMS) data with Azure Synapse and PowerBI.
  4. If you are interested in testing a combination of PowerShell script and ARM template that deploys multiple Azure Services for an end-to-end solution.
  5. If you want to explore a Power BI solution over Azure Synapse that uses a Composite Model with Aggregations and Materialized Views.

Purpose

The purpose of this project is to 1) automate the deployment of Azure Resources (Azure Resource Group, Azure Data Lake Storage, Azure Synapse and Azure Data Factory) 2) automate downloading CMS data from https://cms.gov and load into Azure Synapse Tables. This CMS data can then be used for reporting using Power BI.

Pre-requisites:

Azure Account with permissions to create Azure Resources (Storage, Azure Data Factory, Synapse). A tutorial of deploying the Azure ARM Template is here: https://youtu.be/-YnF2EHzTzs

4 Easy Steps to Deployment:

  1. Login to Azure Portal and open Cloud Shell from the top navigation bar. Make sure to select PowerShell prompt. Alternatively, you can use PowerShell from your local machine, click here.

  2. Clone this Git Repository and switch to the new directory (using commands below)

    git clone https://github.com/gregbeaumont/E2ESynapseDemo.git

    cd E2ESynapseDemo

  3. Start script and provide Resource Group Name and User credentials (See example below. Note: Don't use the same names as in example below!)

    PS /home/YOURUSERNAME/E2ESynapseDemo> ./setup.ps1

    Do you want to use any existing resource(s) or create all resources from scratch? Enter 1 for NEW or 2 for EXISTING: 1

    Enter New Resource Group Name: CMSDemo100

    Default Synapse Admin Username is: sqladminuser

    Enter Synapse Admin Password: Password0~

  4. Next step is to get the Power BI tempate and connect to the Synapse instance. A tutorial of deploying the parameterized Power BI Template file is here: https://youtu.be/_mslQZM7NrU

Note:If you are using Azure Portal Cloud Shell, you might see errors due to inactivity. Please ignore them as the script would continue to run in the background. Total time for end to end deployment is around 1 hour.

Architectures

Documentation for the Medicare Fee-For Service Provider Utilization & Payment Part D Prescriber Data can be found here

All related Architectures for this solution and PowerBI dashboard images can be found here

Script Explained

Step by step process how the deployment script performs is explained here

Contributing

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

License

Copyright (c) Microsoft Corporation. All rights reserved. Licensed under the MIT License. This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • PowerShell 87.5%
  • TSQL 12.5%