Skip to content

Latest commit

 

History

History
 
 

etl

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

ETL System

This module provides an ETL system for the backend that enables the acquisition, cleaning, and storing of the data of interest into the database from raw Gas Station dataset.

The dataset files are obtained from the following sites:

Additionally, this module uses open geographical data from DIVA-GIS to find city and state from coordinates

Requirements

Python 3.x, any version of pip, and virtualenv (or venv)

It is also necessary that the PostgreSQL database exists and is running.

The libspatialindex-dev library must be installed in the environment

Finally, the DATABASE_URL environment variable is required to be set in the system with the connection string for the PostgreSQL database.

Execution

Use either python main.py or python3 main.py according to your system.