Udacity_Customer-Segmentation

Analysis of demographics data for customers of a mail-order sales company in Germany, incl. the comparison of it against demographics information for the general population. The goal was to identify the parts of the population that best describe the core customer base of the company. The model was then applied on a third dataset with demographics information to predict the response rate of future possible customers.

1. Installation

The main analysis was conducted with a Jupyter Notebook in a conda environment. The Kernel was Python 3 (Version 3.6).

The main libraries used were:

Numpy (1.19.1)
threadpoolctl (2.1.0)
SKlearn (0.23.2)
imbalanced-learn (0.7.0)
graphviz (0.14.1)
XGBoost (1.2.0)
Pandas
Matplotlib
Seaborn
Pickle

Anaconda should provide essential libraries. XGBoost, graphviz, imbalanced has to be installed via !pip. SKlearn may have to be updated to the latest version. Therefore, you have to run the following command:

!pip install scikit-learn==0.23.2

2. File Description and Folder Structure

├── 01_proposal
│   └── capstone_proposal.md -----------------------------# PROVIDES THE INITIAL PROPOSAL ABOUT THE ANALYSIS, STEPS, ALGORITHM USED,....
├── 02_images --------------------------------------------# PROVIDES THE IMAGES USED IN THE PROPOSAL AND IN THE PROJECT REPORT
│   ├── Customer_Distribution.PNG
│   ├── PCA_analysis.png
│   ├── XGBoost_trained.png
│   └── ....
├── 03_dataset information
│   ├── DIAS Attributes - Values 2017.xlsx ---------------# PROVIDES INFORMATION ON EACH COLUMN IN THE DATASET INCL. GIVEN ATTRIBUTES 
│   └── DIAS Information Levels - Attributes 2017.xlsx ---# PROVIDES A HIGH-LEVEL CLUSTERING ACROSS ALL >300 COLUMNS IN THE DATASETS
├── README.md --------------------------------------------# README FILE PROVIDING GENERAL INFORMATION ON REPOSITORY STRUCTURE AND INSTALLATION REQUIREMENTS
├── helper_classification.py -----------------------------# HELPER FUNCTIONS TO PROCESS, VISUALIZE AND TRAIN SUPERVISED LEARNING MODEL AND CONNECTED DATASETS
├── helper_segmentation.py -------------------------------# HELPER FUNCTIONS TO PROCESS, VISUALIZE AND TRAIN UNSUPERVISED LEARNING MODEL AND CONNECTED DATASETS
├── project_report.md ------------------------------------# DETAILED DOCUMENTATION OF ANALYSIS RESULTS INCL. VISUALIZATIONS AND INTERPRETATION 
└── Arvato Project Workbook.ipynb ------------------------# MAIN WORKBOOK FOR ANALYSIS, DOCUMENTATION OF ALL CODE, STEPS AND RESULTS

3. Results

The main findings of the conducted analysis and the respective written code can be found in project_report.md.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Udacity_Customer-Segmentation

Table of Contents

1. Installation

2. File Description and Folder Structure

3. Results

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
01_proposal		01_proposal
02_images		02_images
03_dataset information		03_dataset information
Arvato Project Workbook.ipynb		Arvato Project Workbook.ipynb
README.md		README.md
helper_classification.py		helper_classification.py
helper_segmentation.py		helper_segmentation.py
project_report.md		project_report.md
project_report.pdf		project_report.pdf

PelJon/Udacity_Customer-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Udacity_Customer-Segmentation

Table of Contents

1. Installation

2. File Description and Folder Structure

3. Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages