PrestaCop

This project is realized within the course Functional Data Programming at the Efrei Paris.

Problem Description

PrestaCop is a company that wants to create a drone service to help the police make parking tickets.

Each drone sends messages regularly, with:

drone location
time
drone id

And if a violation occured, with the two following additional fields:

violation code
image id

When the drone's message violation code indicates that human interaction is required (1% of the time), an alarm must be send to a human operator.

With all the data, PrestaCop wants to make statistics and improve their services. To improve those statistics, NYPD historical data should be used. However, NYPD poses two constraints:

The computers are old and not very powerful
The data is stored in a large CSV

Architecture

The basic part of this project consists of the following 5 Services, a stream and a storage solution:

csv-to-stream: Reads the NYPD CSV and publishes the rows as messages to the stream
drone-simulator: Simulates a drone, sends messages of different forms to the stream
alert-system: Consumes stream messages, raises an alarm when human interaction is required
stream-to-storage: Consumes stream messages, stores them
analysis: Reads messages out of storage, performs the analysis

The following picture depicts how the components work together:

The following picture depicts our current AWS architecture:

Data Model

As described above, the drone sends messages wit 3 or 5 fields. We used a Scala case class Message to realize this data format:

case class Message(
                    location: String,
                    time: String,
                    droneId: String,
                    violationCode: Option[String] = None,
                    violationImageId: Option[String] = None
                  )

Set up development environment

We are implementing this project in Scala, using a functional progamming approach.

Use commandline tools

Install JDK, eg. JDK-1.8:
Install sbt, see Download SBT:

Use IDE, eg. Intellij IDEA

Install JDK, eg. JDK-1.8:
Download and install Intellij IDEA
Install Scala and SBT plugins
Import project as an SBT project (only available after installing the SBT plugin)

Set up AWS environment

Open your AWS console
Create a new IAM user that is able to write/read to/from Kinesis and S3
To your home directory, add a file with the name ".aws.properties" with the following content (Add the credentials of the created user:

[default]
accessKey=addAccessKeyHere
secretKey=addSecretKeyHere

Create a Kinesis stream with one shard, called "prestacop"
Create a S3 Bucket with name and change the code to use this bucket.Env variables or command line arguments will be added later to specify it.

Attention!: Remember to always delete Kinesis after you finished your testing, as it will cost you.

Set up Kafka environment

Download Kafka (link)
Start Kafka and Zookeeper
Create "test" stream

Thoughts on Terraform

We need to implement the following steps with Terraform:

Choose a region for everything
Create an IAM user that has DynamoDB, S3, and Kinesis full access
Store the AccessKey and SecretKey in a file called ".aws.properties" in the home folder of the user
Create a Kinesis stream with one shard
Create a S3 bucket with a unique name for archiving
Create dynamoDB table with the primary key column "id", TTL enabled
Create a SNS topic for our alert service, if possible create a subsription with an email address
Create 3 EC2 instances for our services to be deployed on

Deploy our 3 Services (Stream2Storage, Alert, Analysis) to AWS (probably with terraform EC2 instances and a script), look at Ansible for the deployment

The region, kinesis stream name, sns topic arn, and dynamoDB table name must later be given as command line parameters to our services!

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
Images		Images
Old Kafka Stuff		Old Kafka Stuff
alert-system		alert-system
analysis		analysis
create-infrastructure		create-infrastructure
csv-to-stream		csv-to-stream
drone-simulator		drone-simulator
fake_analysis		fake_analysis
stream-to-storage		stream-to-storage
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PrestaCop

Problem Description

Architecture

Data Model

Set up development environment

Use commandline tools

Use IDE, eg. Intellij IDEA

Set up AWS environment

Set up Kafka environment

Thoughts on Terraform

To Do...

Questions to ask

About

Releases

Packages

Contributors 4

Languages

LeanaNeuber/PrestaCop

Folders and files

Latest commit

History

Repository files navigation

PrestaCop

Problem Description

Architecture

Data Model

Set up development environment

Use commandline tools

Use IDE, eg. Intellij IDEA

Set up AWS environment

Set up Kafka environment

Thoughts on Terraform

To Do...

Questions to ask

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages