Skip to content

This repository contains three machine learning projects: Sales Prediction, Titanic Survival Prediction, and Credit Card Fraud Detection. Full executive summaries can be accessed via google drive inside the repository.

Notifications You must be signed in to change notification settings

HiroshiJoe/CODSOFT

Repository files navigation

CODSOFT Data Science Projects Repository

By: Heroshi Joe Abejuela (Data Science Intern at CODSOFT)

Overview

This repository contains three key projects completed during my internship at CODSOFT. Each project applies various machine learning and statistical techniques to solve real-world problems. Below, you will find a brief description of each project along with links to the corresponding code and results. The full executive summaries of the projects are available for download here.

Projects

1. Sales Prediction Using Linear Regression

This project involves predicting product sales based on advertising expenditures across TV, radio, and newspapers. By applying linear regression, we explore the relationships between advertising channels and their impact on sales. The model was evaluated based on metrics such as Mean Squared Error and R-squared, providing insights into which advertising channel contributed most to sales.

Key Techniques:

  • Linear Regression Model
  • Model evaluation using MSE and R²

2. Titanic Survival Prediction

The Titanic survival prediction project uses machine learning algorithms to predict passenger survival during the Titanic disaster based on features like age, gender, and ticket class. We applied Logistic Regression, Random Forest, and Gradient Boosting models and compared their performance to identify the best-performing model.

Key Techniques:

  • Logistic Regression, Random Forest, and Gradient Boosting
  • Model evaluation using accuracy, F1-score, and Kappa score

3. Credit Card Fraud Detection

This project focuses on detecting fraudulent credit card transactions from highly imbalanced datasets. Various machine learning models such as Logistic Regression and Random Forest were applied, along with data balancing techniques to enhance fraud detection. The models were evaluated on accuracy, F1-score, and ROC AUC.

Key Techniques:

  • Logistic Regression, Random Forest
  • Data balancing techniques (oversampling and undersampling)
  • Model evaluation using ROC AUC

Contact

For any queries or feedback, feel free to reach out:

About

This repository contains three machine learning projects: Sales Prediction, Titanic Survival Prediction, and Credit Card Fraud Detection. Full executive summaries can be accessed via google drive inside the repository.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published