Alexa and Big Data
is an ETL and analysis project that:
- sets-up and utilizes cloud functionality from Amazon Web Services (AWS) for two production-ready tables
- analyzes 1.5M data points to evaluate Amazon's Vine reviews
Datasets from Amazon.
- Spark / Pyspark
- AWS (S3, RDS)
- Setting up AWS functionalities (buckets, RDS)
- Data preprocessing for cloud-based computing / analysis