Skip to content

This project utilizes Multiple Linear Regression (MLR) to predict house prices in King County, Washington, USA, using historical sales data. The dataset includes various features such as the number of bedrooms, square footage, views, and grades, with the goal of accurately predicting house prices.

Notifications You must be signed in to change notification settings

TeomanEgeSelcuk/R-Housing-Price-Predictions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

R Project on KC House Sales Data Analysis

This repository contains the R project analysis of the KC House Sales dataset, aimed at predicting house sales in King County, Washington State, USA, utilizing Multiple Linear Regression (MLR). The data, sourced from Kaggle datasets under the name "KC_Housesales_Data", comprises historical data of houses sold between May 2014 to May 2015.

Data Source

The dataset can be found at: KC House Sales Data on Kaggle

Project Overview

The project involves the following key steps:

  • Data preprocessing and exploration to understand the dataset characteristics.
  • Exploratory Data Analysis (EDA) to identify patterns, outliers, and relationships between variables.
  • Data visualization to support EDA findings.
  • Building a Multiple Linear Regression model to predict house prices.
  • Evaluating model performance and comparing it with a one-layer forward neural network as a reference.

Libraries and Tools Used

  • tidyverse for data manipulation and visualization.
  • corrplot for visualizing correlations between variables.
  • lubridate for date-time manipulation.
  • caTools, GGally, caret, and leaps for various stages of model building and evaluation.

Instructions for Use

To replicate or extend the analysis:

  1. Download the dataset from Kaggle.
  2. Install the required R libraries mentioned above.
  3. Run the R Markdown file myproject.Rmd for step-by-step execution of the analysis.

Acknowledgments

Data provided by Kaggle datasets. Analysis and model building by Teoman Selcuk as part of the MTH 404 R Project.

*Note: For detailed understanding and insights, users are encouraged to go through the R Markdown file myproject.Rmd.

About

This project utilizes Multiple Linear Regression (MLR) to predict house prices in King County, Washington, USA, using historical sales data. The dataset includes various features such as the number of bedrooms, square footage, views, and grades, with the goal of accurately predicting house prices.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages