Skip to content

Analyzing music streaming service data & predicting churn users.

Notifications You must be signed in to change notification settings

kshitijjagatkar/Project_Sparkify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Project_Sparkify

This project is about sparkify music streaming service which is a hypothetical concept of music streaming platform, user's can listen music on their devices, it is similar kind of service like spotify, pandora etc. Our main goal of the project is to find the user's who left the service or the user who is going to churn.

Read my blog on this project as well here

  1. Project Motivation
  2. Project Summary
  3. Project Components
  4. File Descriptions
  5. Installation & Run
  6. Results

1. Project Motivation

I recently gained some knowladge about Spark and it's libraries, I found out this project can help me build an overall intution about spark & it's environment. I applied these skills to analyze the data which proveded by the udacity for this project.

2. Project Summary

This project is about sparkify music streaming service which is a hypothetical concept of music streaming platform, user's can listen music on their devices, it is similar kind of service like spotify, pandora etc. we will be analyzing it's data which is pretty interesting. The data is about user's interaction with their service, like user's data contains their gender, location, which songs they listen to, which are the pages they visit, did they upgrade or downgrade & so on, again music data like songs we don't have much data related to songs, again we have log informatin of the user like, when did a user registered with the serive, sessions, is the errors they encounter then what are those errors, timestamp of each action, when did a perticular song was played and all, like this we have a lot explore from this data in this project.

Our main goal of the project is to find the user's who left the service or the user who is going to churn. For that we have all the information almost or we don't, we have to find that in this project pretty much exciting han? So, without the further go let's dive into it & start working on the data. To make our path easy we have set small milestones as follows which helps us understand the high level of the process.

3. Project Components

This project basically divided into sections as follows:

  • Business Understanding
  • Data Understanding
  • Data Preprocessing
  • Feature Engineering
  • Data preparation for Modeling
  • Modeling
  • Evaluating Model
  • Displaying Result
  • Justification & Conclusion

4. File Structure & Description

  • Data

    | - data/data_link.txt : link of the data is provided

    | - Project_Sparkify.ipynb : main notebook file.

    *README.md : this files has all description of the project.

5. Installation & Run

This project needs the spark installed on your machine if you are planning to run on your local machine along with that anaconda distribution for notebook functionality & that's all for the run pyspark & you are good to go on local env. I have also included google colab code if you are trying to run it on cloud, you will only need the data file.

About

Analyzing music streaming service data & predicting churn users.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published