This module provides an ETL system for the backend that enables the acquisition, cleaning, and storing of the data of interest into the database from raw Gas Station dataset.
The dataset files are obtained from the following sites:
- File 1 (places.xml): Listado de Estaciones de Servicio con Georreferencia, XML de CRE
- File 2 (prices.xml): Listado de Precios Comerciales de Gasolina y Diesel por Estación de Servicio, XML de CRE
Additionally, this module uses open geographical data from DIVA-GIS to find city and state from coordinates
Python 3.x, any version of pip, and virtualenv (or venv)
It is also necessary that the PostgreSQL database exists and is running.
The libspatialindex-dev library must be installed in the environment
Finally, the DATABASE_URL
environment variable is required to be set in the system with the connection string for the PostgreSQL database.
Use either python main.py
or python3 main.py
according to your system.