Skip to content

geewynn/e2e-analytics-streaming

Repository files navigation

e2e-analytics-streaming

Description

This project is an end to end streaming data pipeline fro an ecommerce data.

The tools used in this projects are

  • FastApI
  • Apache Kafka
  • Apache Spark
  • MongoDB
  • Streamlit
  • Docker

Starting the application

Download the data from https://www.kaggle.com/datasets/carrie1/ecommerce-data

  1. Start the docker service Run docker-compose up

  2. Run the jupyterlab notebook http://localhost:8888/

  3. Run the client python3 client.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published