Skip to content

Final project for CS 4661 - Introduction to Data Science

License

Notifications You must be signed in to change notification settings

250MHz/smoker-status

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

smoker-status

Final project for CS 4661 - Introduction to Data Science.

Getting Started

Install uv which we use for project management.

This project uses data from Kaggle's Playground Series, Binary Prediction of Smoker Status using Bio-Signals. After accepting the competition rules, download the data and store the .csv files in a data/raw directory:

└── data
    └── raw            <- The original, immutable data dump.
        ├── sample_submission.csv
        ├── test.csv
        └── train.csv

Open a Jupyter notebook in the notebooks/ directory with:

uv run jupyter notebook notebooks

Developing

Store Jupyter notebooks in the notebooks/ directory. Follow the naming convention for notebooks used by Cookiecutter Data Science.

You should add a cell at the top of notebooks with the following:

%load_ext autoreload
%autoreload 2

This should make code from the smoker_status module importable.

Before commiting, use ruff to format your Python code:

uvx ruff format

About

Final project for CS 4661 - Introduction to Data Science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •