Final project for CS 4661 - Introduction to Data Science.
Install uv which we use for project management.
This project uses data from Kaggle's Playground Series, Binary Prediction
of Smoker Status using Bio-Signals. After accepting the competition rules,
download the data and store the .csv files in a data/raw
directory:
└── data
└── raw <- The original, immutable data dump.
├── sample_submission.csv
├── test.csv
└── train.csv
Open a Jupyter notebook in the notebooks/
directory with:
uv run jupyter notebook notebooks
Store Jupyter notebooks in the notebooks/
directory. Follow the naming
convention for notebooks used by Cookiecutter Data Science.
You should add a cell at the top of notebooks with the following:
%load_ext autoreload
%autoreload 2
This should make code from the smoker_status
module importable.
Before commiting, use ruff
to format your Python code:
uvx ruff format