Skip to content

stackql/pystackql

Repository files navigation

"stackql logo"

PyStackQL - Python Wrapper for StackQL

Documentation Status PyPI PyPI - Downloads

StackQL is an open source developer tool which allows you to query and interact with cloud and SaaS provider APIs using SQL grammar. StackQL can be used for cloud inventory analysis, cloud cost optimization, cloud security and compliance, provisioning/IaC, assurance, XOps, and more.

PyStackQL is a Python wrapper for StackQL which allows you to use StackQL within Python applications and to use the power of Python to extend StackQL. PyStackQL can be used with pandas, matplotlib, plotly, jupyter and other Python libraries to create powerful data analysis and visualization applications.

For detailed documentation, including the API reference, see Read the Docs.

Installing PyStackQL

PyStackQL can be installed with pip as follows:

pip install pystackql

You can install from source by cloning this repository and running a pip install command in the root directory of the repository:

git clone https://github.com/stackql/pystackql
cd pystackql
pip install .

Using PyStackQL

The following example demonstrates how to run a query and return the results as a pandas.DataFrame:

from pystackql import StackQL
import pandas as pd
region = "ap-southeast-2"
stackql = StackQL()

query = """
SELECT instanceType, COUNT(*) as num_instances
FROM aws.ec2.instances
WHERE region = '%s'
GROUP BY instanceType
""" % (region)

res = stackql.execute(query)
df = pd.read_json(res)
print(df)

Using PyStackQL with Jupyter Notebook

To use the integrated Jupyter magic commands provided by PyStackQL:

  1. Load the Extension:
%load_ext pystackql
  1. Execute a Query Using Line Magic:
%stackql SHOW SERVICES IN azure
  1. Or Using Cell Magic:
%%stackql
SELECT status, count(*) as num_instances
FROM google.compute.instances
WHERE project = '$project'
AND zone = '$zone'
GROUP BY status

You can find more examples in the stackql docs or the examples in readthedocs.

Supported Operating Systems

PyStackQL (and StackQL) are supported on:

  • MacOS (arm and amd)
  • Linux
  • Windows

Supported Python Versions

PyStackQL has been tested on:

  • Python 3.7
  • Python 3.8
  • Python 3.9
  • Python 3.10
  • Python 3.11
  • Python 3.12 (MacOS and Linux only)

Licensing

PyStackQL is licensed under the MIT License. The license is available here

Building the docs

To build the docs, you will need to install the following packages:

pip install sphinx sphinx_rtd_theme sphinx-autodoc-typehints

Then, from the root directory of the repository, run:

cd docs
make html

The docs will be built in the docs/build/html directory.

Building the package

To build the package, you will need to install the following packages:

pip install setuptools wheel twine

Then, from the root directory of the repository, run:

rm -rf dist/*
python3 setup.py sdist bdist_wheel

The package will be built in the dist directory.

Testing Locally

Before testing, ensure you have all the required packages installed:

pip install -r requirements.txt
pip install psycopg

Once the dependencies are installed, you can run the tests using the provided script:

sh run_tests

This script sets up the necessary environment variables and then runs the unit tests.

Note: Make sure to set up the environment variables in the tests/creds/env_vars/test.env file or supply them in another way before running the tests. The tests may require specific configurations or access keys to connect to services.

For better isolation and reproducibility, consider using a virtual environment:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Once you're done testing, you can deactivate the virtual environment:

deactivate

Publishing the package

To publish the package to PyPI, run the following command:

twine upload --config-file .pypirc dist/*