This repository hold the ETL files for ZL.
For implementers, see Puppet
The recommended Java version is OpenJDK 8 JDK
You must have access to MySQL databases for the instances that you plan to use as sources. The recommendation is that these databases are replicas of production DBs, not the actual production instances, as a precaution to ensure no production data is inadvertently affected by the ETL process.
- You must have access to a SQL Server target instance into which to ETL from MySQL
- You can use the Docker instance described here.
- Create a directory to serve as your execution environment directory
- Install the jobs and datasources into this directory
- For developers, create a symbolic link to the datasources and jobs folders of this project
- For implementers, download the zip artifact from maven and extract it into this directory
- Install an application.yml file into this directory
- For developers, copy or create a symolic link to example-application.yml, and modify settings to match your database configuration
- For implementers, install the application.yml file, and ensure all of the database settings are setup correctly
- For more details on configuration options for application.yml, see the PETL project
- Install the PETL executable jar file
- For developers, you can clone and build the PETL application locally and create a symbolic link to target/petl-*.jar
- For developers or implementers, you can download the latest PETL jar from Bamboo
The directory structure should look like this:
configurations/datasources#
.
├── openmrs-cange.yml
├── openmrs-hinche.yml
├── openmrs-hiv.yml
├── openmrs-humci.yml
├── openmrs-lacolline.yml
├── openmrs-mirebalais.yml
├── openmrs-saint_marc_hsn.yml
├── openmrs-thomonde.yml
├── warehouse-hiv.yml
└── warehouse.yml
configurations/jobs#
- create-partitions.yml
- create-source-views-and-functions.yml
- import-to-table-partition.yml
- refresh-base-tables.yml
- refresh-hiv-data.yml
- refresh-humci-data.yml
- refresh-openmrs-data.yml
- sql folder
To execute the ETL pipeline
service petl start