This repository contains project materials for the Spring 2024 STAT 208 class, specifically for Team 8. All materials are the property of Team 8, University of California, Riverside, A. Gary Anderson School of Management. Please do not republish the materials without authors' consents.
This project is made possible with the support from Professor Brandon Wales UCR @Brandon-Wales
This project is also available to view on Kaggle.com, pay a visit and upvote for us!
For those who are new to this folder, the Project-Code.ipynb
and Project-Code.html
files are our main focus. This project is completed using Python 3.12.0. We are also attaching the documents which we will write for this project. The data is originally obtained from Kaggle.com, link will be attached below. Feel free to explore more options beyond this analysis report.
This project is centered around Customer Segmentation analysis. We aim to provide better insights into customer behaviors and preferences to help businesses tailor their strategies effectively.
This project will follow the template.
Section 1
: Introduction / Why This Topic / Pre-analysisSection 2
: Descriptive AnalysisSection 3
: Model Selection- Part 1: Predictive Modeling
- Part 2: Customer Segmentation Clustering
- Part 3: Customer Segmentation Prediction (Classification)
- Part 4: Feature Importance Analysis for Part 1 Predictive Modeling
Section 4
: Business Insights / SuggestionsSection 5
: Conclusion
Project-Code.ipynb
: Jupyter notebook containing the code for our customer segmentation analysis.Project-Code.html
: HTML export of the Jupyter notebook for easy viewing.Project-Proposals.pdf
: Documents that report our initial ideas, studies, and finding before launching the analysis.Data Folder
: Contains the datasets used for analysis. There aretrain.csv
andtest.csv
.train.csv
will be mainly used in descriptive analysis whiletest.csv
will be mainly used in building models like predictive modeling. Disclaimer: The data is obtained from Kaggle.com Customer Segmentation Dataset published by Abishek Sudarshan. All data are used for educational purposes only. Do not republish original author Sudarshan's work without approval. License: Data files are copyrighted by the original authors.Final-Reports.pdf
: Documentation and reports generated during the project.Final Presentations.pptx
: Slides and other materials for presenting our findings.
To get started with the project, clone this repository to your local machine using the following command:
git clone https://github.com/Jen-uis/Customer-Segmentation-Analysis
- Open the Project-Code.ipynb file in Jupyter Notebook.
- Follow the instructions in the notebook to run the analyses.
- Review the results and insights provided in the output cells.
This project is licensed under the MIT License. See the LICENSE file for more details.
We welcome contributions ONLY from our Team 8. If you are a member of Team 8, please follow these steps to contribute:
- Fork the repository.
- Create a new branch:
git checkout -b feature-branch
- Make your changes and commit them:
git commit -m 'Add some feature'
- Push to the branch:
git push origin feature-branch
- Create a new Pull Request.
Note: If you have successfully push your changes to the branch, Nathaniel will review the final request and merge the changes to main
if your file is approved.
If you have any questions or need further information, please contact our team at: [email protected]
Authors:
- Leader: Nathaniel Zhu @Jen-uis
- Youyi Fu (Chris)
- Xiaoya Wei (Shirley)
- Xujuan Liang (Teresa)