- Two different datasets were used to train machine-learning models to predict the medical outcome of a COVID patient.
- Exploratory data analysis was performed using histograms and box plots on the datasets to understand the data distribution and detect outliers.
- The datasets were cleaned and missing values were imputed based on their specific distributions or modes. The 2 datasets were joined based on closed location.
- Grid Search parameter tuning with 5-fold cross-validation was performed to tune parameters for K-nearest-neighbor, Random Forest and ADA Boost classifiers.
- A model of Random Forest was trained on the training dataset using the best parameters. The model was tested on an unseen test dataset to achieve an accuracy of 81% over all outcomes.
-
Notifications
You must be signed in to change notification settings - Fork 0
zubayerkader/COVID-Outcome-Predictor
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Data Preprocessing
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published