Skip to content

Latest commit

 

History

History
41 lines (25 loc) · 2.06 KB

README.md

File metadata and controls

41 lines (25 loc) · 2.06 KB

DengAI: Predicting Disease Spread

Overview

This project predicts local epidemics of dengue fever to help fight life-threatening pandemics. This project was created as an entry to the DengAI: Predicting Disease Spread competition hosted by Driven Data.

The project predicts the number of Dengue Fever cases reported each week in the following locations:

  • San Juan (Puerto Rico)
  • Iquitos (Peru)

Data

The predictor variables include environmental variables describing changes in temperature, precipitation, vegitation, and more.

Total Cases of Dengue Fever Over Time By City: San Juan and Iquitos

Usage

To allow interactive visualisation of the total cases over time by city, we built a simple Streamlit app. Click here to discover patterns in the data!

Predictive Models

This is a time series project using Random Forest and Negative Binomial regression models to predict the total cases of Dengue fever over time in the different cities.

We used the Mean Squared Error (MSE) metric for evaluating the model.

Repository Structure

  • data/ - Contains the training and test datasets.
  • notebooks/ - Jupyter notebooks with exploratory data analysis and model development.
  • submissions/ - Prediction files ready for submission to the competition.
  • images/ - Visualizations generated during analysis, including the Scatterplot of total cases by city.

Development

Clone the repository, install the required packages listed in requirements.txt, and run the Jupyter notebooks to replicate the analysis and predictions.

Contributions

Contributions are welcome. Please open an issue or pull request if you would like to contribute to the project.

License

This project is open-source and available under the MIT license.