Project_Sparkify

This project is about sparkify music streaming service which is a hypothetical concept of music streaming platform, user's can listen music on their devices, it is similar kind of service like spotify, pandora etc. Our main goal of the project is to find the user's who left the service or the user who is going to churn.

Read my blog on this project as well here

Project Motivation
Project Summary
Project Components
File Descriptions
Installation & Run
Results

1. Project Motivation

I recently gained some knowladge about Spark and it's libraries, I found out this project can help me build an overall intution about spark & it's environment. I applied these skills to analyze the data which proveded by the udacity for this project.

2. Project Summary

This project is about sparkify music streaming service which is a hypothetical concept of music streaming platform, user's can listen music on their devices, it is similar kind of service like spotify, pandora etc. we will be analyzing it's data which is pretty interesting. The data is about user's interaction with their service, like user's data contains their gender, location, which songs they listen to, which are the pages they visit, did they upgrade or downgrade & so on, again music data like songs we don't have much data related to songs, again we have log informatin of the user like, when did a user registered with the serive, sessions, is the errors they encounter then what are those errors, timestamp of each action, when did a perticular song was played and all, like this we have a lot explore from this data in this project.

Our main goal of the project is to find the user's who left the service or the user who is going to churn. For that we have all the information almost or we don't, we have to find that in this project pretty much exciting han? So, without the further go let's dive into it & start working on the data. To make our path easy we have set small milestones as follows which helps us understand the high level of the process.

3. Project Components

This project basically divided into sections as follows:

Business Understanding
Data Understanding
Data Preprocessing
Feature Engineering
Data preparation for Modeling
Modeling
Evaluating Model
Displaying Result
Justification & Conclusion

4. File Structure & Description

Data

| - data/data_link.txt : link of the data is provided

| - Project_Sparkify.ipynb : main notebook file.

*README.md : this files has all description of the project.

5. Installation & Run

This project needs the spark installed on your machine if you are planning to run on your local machine along with that anaconda distribution for notebook functionality & that's all for the run pyspark & you are good to go on local env. I have also included google colab code if you are trying to run it on cloud, you will only need the data file.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Data		Data
Project_Sparkify.html		Project_Sparkify.html
Project_Sparkify.ipynb		Project_Sparkify.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project_Sparkify

1. Project Motivation

2. Project Summary

3. Project Components

4. File Structure & Description

5. Installation & Run

About

Releases

Packages

Languages

kshitijjagatkar/Project_Sparkify

Folders and files

Latest commit

History

Repository files navigation

Project_Sparkify

1. Project Motivation

2. Project Summary

3. Project Components

4. File Structure & Description

5. Installation & Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages