Skip to content

Latest commit

 

History

History
23 lines (18 loc) · 488 Bytes

README.md

File metadata and controls

23 lines (18 loc) · 488 Bytes

e2e-analytics-streaming

Description

This project is an end to end streaming data pipeline fro an ecommerce data.

The tools used in this projects are

  • FastApI
  • Apache Kafka
  • Apache Spark
  • MongoDB
  • Streamlit
  • Docker

Starting the application

Download the data from https://www.kaggle.com/datasets/carrie1/ecommerce-data

  1. Start the docker service Run docker-compose up

  2. Run the jupyterlab notebook http://localhost:8888/

  3. Run the client python3 client.py