Skip to content

Latest commit

 

History

History
42 lines (30 loc) · 2.38 KB

README.md

File metadata and controls

42 lines (30 loc) · 2.38 KB

USF BSDS 100: Intro to Data Science with R

Case Study: SFO Air Traffic Passenger Statistics

Team Members

Introduction

This group project visualized various passenger statistics of San Francisco International Airport (SFO), using the dataset published by the SF Airport Commission through DataSF. It utilized R for scripting and ggplot for visualization. It is part of the coursework for BSDS 100 Intro to Data Science with R class at the University of San Francisco.

The dataset we sourced from DataSF includes destination, origin, airlines, terminals, and passenger count between July 2005 and December 2017. The dataset contains 17,959 rows and 12 columns. The dataset is available here and the data dictionary is available here.

Procedures

For the purpose of the project, we performed data cleansing to fix incorrect and inconsistent data entries. After that, we created a few visualizations using ggplot that aims to provide insights for the following about SFO:

  1. Average monthly passengers traffic between 2006 and 2017
  2. Passengers traffic by destination/origin regions
  3. Overview on passengers traffic by domestic airlines
  4. Passengers traffic traveled by Low Cost Carrier
  5. Passengers traffic in airport terminals
  6. Passengers traffic on 1 selected domestic carrier

Visualizations

  • Bar Chart: Monthly Average Passenger Traffic between 2006 and 2017

    Screenshot

  • World Map: Passenger Traffic by International Destinations

    Screenshot

  • Stacked Line Chart: Domestic Passenger Traffic by Airline

    Screenshot

  • Stacked Bar Chart: Annual Passenger Traffic on International Low Cost Carriers

    Screenshot

  • Tree Map: Domestic Passenger Traffic of Airline and Terminal

    Screenshot

  • Heatmap: Passenger Traffic of United Airlines

    Screenshot

The detailed version of the report could be viewed in PDF format. Additionally, the rmd source code is available here and the presentation slides are available here.