Google Cloud Storage Diagnostics Shipping

HM-Diag provides the capability to periodically ship hotspot diagnostic statistics to Google Cloud Storage for centralised fleet management / feedback for services such as the Nebra Dashboard. Once ingested to Google Cloud Storage then DataFlow can be used to marshall the statistics into BigQuery for easy consumption with an SQL-like query language.

Configure Google Cloud Storage Buckets

Login to your Google Cloud Project and navigate to Google Cloud Storage.
- or just head here https://console.cloud.google.com/storage/browser
Create a new bucket with a sensible, descriptive name e.g. "hotspot-diagnostics-ingestion" for storing inbound hotspot diagnostics data with the following properties:
- Location Type: Probably fine to be a single region for development purposes. Multi-region should by considered for production deployments.
- Storage Class: Standard
- Access Control: Uniform
- Protection Tools: None
Once the bucket has been created open the bucket and head to the lifecycle tab. Add a new rule to delete objects with an age based condition of 3 days. This will auto-delete ingested data from the bucket every 3 days, by this time DataFlow should have marshalled the data into BigQuery and it's no longer required within the GCS bucket.
Switch to the permissions tab. Currently diagnostics ingestion is performed without credentials and hence the bucket must be publically accessible and allUsers principal given the Storage Legacy Bucket Writer role to allow writing to the bucket from the hotspot. The allUsers principal should have no other permissions, e.g. read, else anonymous users could read diagnostics data for all hotspots.
Create another bucket for storing DataFlow ingestion scratch space, e.g. "hotspot-ingestion-scratch". This bucket does not require any specific ACL / roles. Google Cloud will deal with this for you later when you configure DataFlow.
Upload the contents of the bigquery folder of this repository (bq_funcs.js & bq_schema.json) into the DataFlow scratch space bucket. This will be used later by the DataFlow service.

Define GCS Bucket in Balena Fleet

Open the fleet of hotspots you'd like to submit diagnostics for and add the following variable to enable diagnostics shipping.
- DIAGNOSTICS_GCS_BUCKET: helium-miner-data

Configure BigQuery Data Set

Use the left-hand drawer to navigate to the BigQuery service.
- or head here https://console.cloud.google.com/bigquery
In the BigQuery explorer click the actions menu next to your project's name and select "Add Data Set". Configure your BigQuery data set as follows:
- Data Set ID: Something sensible, e.g. "hotspot_diagnostics_data"
- Data Location: The preferred region for the data set.
- Encryption: Google Managed Key

Configure Cloud Function for Ingestion

Use the left-hand drawer to navigate to the Cloud Functions service.
Click Create Function Button
Complete the form giving the function suitable parameters:
- Name - Ingest Diagnostics GCS to BigQuery
- Region - Should be same region as the data set and GCS buckets
- Trigger Type - Cloud Storage
- Event Type - Finalise / Create
- Bucket - helium-miner-data
Click Save and Next
On the function page select:
- Runtime - Python3.9
- Entry Point - import_diagnostics_data
- Copy the contents of import_script.py to the main.py file in the inline editor.
  - Update the GCS bucket name, and Big Query dataset as required.
- Copy the contents of requirements.txt to the requirements.txt file in the inline editor.
Click Deploy

Check Functionality

Wait for miners to submit diagnostics data to the bucket.
Observe the logs for the cloud function to ensure no tracebacks / exceptions.
Check BigQuery to ensure data is inserted as expected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GCS_DIAGNOSTICS_SHIPPING.md

GCS_DIAGNOSTICS_SHIPPING.md

Google Cloud Storage Diagnostics Shipping

Configure Google Cloud Storage Buckets

Define GCS Bucket in Balena Fleet

Configure BigQuery Data Set

Configure Cloud Function for Ingestion

Check Functionality

Files

GCS_DIAGNOSTICS_SHIPPING.md

Latest commit

History

GCS_DIAGNOSTICS_SHIPPING.md

File metadata and controls

Google Cloud Storage Diagnostics Shipping

Configure Google Cloud Storage Buckets

Define GCS Bucket in Balena Fleet

Configure BigQuery Data Set

Configure Cloud Function for Ingestion

Check Functionality