Skip to content

Latest commit

 

History

History
67 lines (35 loc) · 3.66 KB

README.md

File metadata and controls

67 lines (35 loc) · 3.66 KB

Trino and Backblaze B2

Getting Started with Trino and Backblaze B2

Welcome to the Getting Started with Trino and Backblaze B2 Cloud Storage tutorial repository!

Currently, this repository contains two tutorials with preconfigured Docker Compose environments:

We anticipate adding more environments showing different aspects of integrating Trino with Backblaze B2, for example, using the Iceberg table format with the Iceberg REST catalog rather than the Hive metastore.

Why Trino and Backblaze B2?

Trino is an open-source distributed SQL query engine designed to query large data sets distributed over one or more heterogeneous data sources. As such, Trino can query data lakes containing files of a variety of formats, including Parquet and ORC, residing in any of a number of storage systems, including cloud object stores such as Backblaze B2 Cloud Storage, using either of the Hive or Iceberg table formats.

As a S3-compatible cloud object store, Backblaze B2 combines with Trino to provide cost-effective, high performance, analytics for large data sets.

Prerequisites

In order to use this repository you need to have Docker installed to run your service containers. Check if you have Docker installed by running docker --version. If Docker isn't found, please install Docker before trying to run these tutorials. If you're on macOS, Linux or Windows, you can install Docker Desktop. Alternatively, if you're on Linux, you can install the Docker Engine.

Layout

The first level of directories in this repo are generally organized by connector. The second level of directories contain the actual environment and documents. In order to run the environment, you need to be in one of the directories that has a docker-compose.yml file in it.

Helpful Docker commands

Start Services

docker compose up -d

Stop Services

docker compose stop

Clean Services

cleans images, containers, and network

docker system prune --all --force

cleans volumes

docker volume prune --force

Show Service Images

docker images

Login to Container

docker container exec -it <container_id> /bin/bash

Show Service Logs

docker logs <container_id>

List Services

docker container ls

List Service Process information

docker compose ps

Trino trademark and other legal notices.