This tool is designed for academic research purposes only. It collects common questions and concerns from the Kubernetes community in order to understand trends in security concerns that need to be researched and addressed or to identify other areas for further research in this field.
These community discussions are collected across the following web properties:
- Stackoverflow (kubernetes tag)
- ServerFault (kubernetes tag)
- Kubernetes Community Forum
The data is collected and stored in csv files found in this repository.
This is a Python project configured with Poetry.
In order to start using this tool you can install all the dependencies and run the scripts as specified below.
- Run
poetry install
- Run
poetry shell
- Run
python ./k8s_scrape/main.py
- A browser may pop up prompting for a captcha, solve this captcha in < 30 seconds
- Be patient and let it work