TDT4225 — Large, Distributed Data Volumes
This assignment looks at an open dataset of trajectories, and the repository contains code that sets up the MongoDB database together with different queries. The program is inspired by the social media workout application Strava, where users can track activities like running, walking, biking etc and post them online with stats about their workout.
- Python
- Docker
- Clone the repository
- Add the dataset from Blackboard to the
data
folder - Run
docker-compose up --build
in the root directory - Install the requirements with
pip install -r requirements.txt
- Run
src/migrate.py
to migrate the dataset to the database