Skip to content

In this project, I used the OSEMN data science workflow to obtain, scrub, explore, model, and interpret a King County dataset with a multivariate linear regression to predict the sale price of houses.

License

Notifications You must be signed in to change notification settings

andrewwongls/dsc-1-final-project-online-ds-sp-000

 
 

Repository files navigation

###########################################

Author:

Andrew Wong

Email: [email protected]

Medium Blog: https://medium.com/human-science-ai

###########################################

I have leveraged on the data science workflow OSEMN (Obtain, Scrub, Model, Explore, and Interpret) as the basis of this project.

Making Sense of Houses Sale Price in King County

Key Questions

What are the predictive power of independent variables such number of house viewing, number of bedrooms, zipcode and others on the house sale price of a house?

The Behavioural Questions - In House Price Context

  1. What type of lifestyle choices of house buyers prefer?
  2. What are the highly desirable zipcode or geography in King County?
  3. What are the key house design characteristics that are highly attractive to house buyers in King County?

The Trend Questions - In House Price Context

  1. What are the latest house price trends in King County vs whole of US?
  2. What are the trends in housing construction?
  3. What is driving demand and supply of bigger or smaller house?

The Dataset

The King County House Sales dataset. We've modified the dataset to make it a bit more fun and challenging. The dataset can be found in the file "kc_house_data.csv", in this repo.

About

In this project, I used the OSEMN data science workflow to obtain, scrub, explore, model, and interpret a King County dataset with a multivariate linear regression to predict the sale price of houses.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%