Dummy Data Creator

This project is used to create dummy data to seed machine learning pipelines before scaling up. It is designed to be deployed on AWS using the fargate deployment method.

The project consists of:

An API & schemas set up using FastAPI
Python scripts for creating dummy data, supported data types:
- Numerical Range
- Float Range
- Date Range
- Categorical Random Choice
- Random Text Generation
Unit testing and API testing using pytest
Docker file
CI/CD Script for circle CI, which requires the variables to be defined:
- AWS_ACCESS_KEY_ID
- AWS_ACCOUNT_ID
- AWS_DEFAULT_REGION
- AWS_SECRET_ACCESS_KEY

Future Improvements:

Specific text generation
Record limit increase above 10,000
DB audit trails
Latency improvements

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.circleci		.circleci
app		app
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dummy Data Creator

Future Improvements:

About

Releases

Packages

Languages

matthewgalloway/dummy-data-API

Folders and files

Latest commit

History

Repository files navigation

Dummy Data Creator

Future Improvements:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages