You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This will reduce the BigQuery costs significantly.
This might also require changing timestamp field type from int to iso 8601 in export jobs (breaking compatibility change so will bump the version to 2.*), so that raw tables can be partitioned by this field. Now the raw tables are not partitioned which makes the enrich job scan whole table.
The text was updated successfully, but these errors were encountered:
Right now load_dag scans data in all partitions every day. In particular enrich transactions sql https://github.com/blockchain-etl/bitcoin-etl-airflow/blob/master/dags/resources/stages/enrich/sqls/transactions.sql needs to join inputs and outputs and requires scanning all past data.
An alternative is to enrich transactions in export_dag using https://github.com/blockchain-etl/bitcoin-etl#enrich_transactions.
This will reduce the BigQuery costs significantly.
This might also require changing timestamp field type from int to iso 8601 in export jobs (breaking compatibility change so will bump the version to 2.*), so that raw tables can be partitioned by this field. Now the raw tables are not partitioned which makes the enrich job scan whole table.
The text was updated successfully, but these errors were encountered: