Using PySpark to perform the Data Tranformations and Data Cleaning on the Soccer Data and finding the patterns in the data to answer the Analytical Questions.
- Who are the winners of the D1 division in the Germany Football Association (Bundesliga) between 2000–2010?
- Does Oktoberfest have any effect on the performance of the overall league?
- Which season of Bundesliga was the most competitive in the last decade?
- What's the best month to watch Bundesliga?