-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #184 from georgetown-cset/data-airflow
Airflow pipeline creation updates
- Loading branch information
Showing
60 changed files
with
2,446 additions
and
484 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
FROM ubuntu:20.04 | ||
|
||
# Set up system dependencies | ||
RUN apt -y update | ||
RUN apt-get -y update | ||
RUN apt-get install -y build-essential libssl-dev libffi-dev python3-dev python3-pip curl | ||
|
||
# Grab files we need to run | ||
ADD requirements.txt /parat/requirements.txt | ||
ADD parat_scripts/* /parat/ | ||
|
||
# install gsutil and put it on the path for airflow to use | ||
ENV CLOUDSDK_INSTALL_DIR /usr/local/gcloud/ | ||
RUN curl -sSL https://sdk.cloud.google.com | bash | ||
ENV PATH $PATH:/usr/local/gcloud/google-cloud-sdk/bin | ||
|
||
# Install python dependencies | ||
WORKDIR /parat | ||
ENV AIRFLOW_GPL_UNIDECODE=yes | ||
RUN pip3 install -r requirements.txt | ||
# Make sure the above config succeeded | ||
RUN python3 -m pytest test_aggregate_organizations.py -k test_add_location |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.