test stable branch git fix up #687

morgsmccauley · 2024-04-22T21:38:40Z

Utility to fetch blocks for tests. Wildcard tests for Coordinator indexer rules matching with wildcards.
Allow snake case blocks to be returned from block server
Fixed timestamp field for lag log.
Refactor: Introduce Editor context refactoring (Refactor: Introduce Editor context refactoring #98)
Updated indexed historical data folder
Limit unindexed block processing to two hours of blocks.
DPLT-1021 feat: enable graphiql explorer plugin ([DPLT-1021] feat: enable graphiql explorer plugin #94)
DPLT-1028 feat: Fork Your Own indexer modal (DPLT-1028 feat: Fork Your Own indexer modal #99)
DPLT-936 message per block (DPLT-936 message per block #101)
DPLT-975 feat: support for code-exporter (DPLT-975 feat: support for code-exporter #100)
fix: fix build for no named component error
DPLT-927 Write current_historical_block_height during historical backfill (DPLT-927 Write current_historical_block_height during historical backfill #102)
DPLT-1014 feat: add historical block processing column (DPLT-1014 feat: add historical block processing column #104)
feat: Stream blocks while executing (feat: Stream blocks while executing #87)
DPLT-1019 feat: store debug list in LocalStorage ([DPLT-1019] feat: store debug list in LocalStorage #90)
DPLT-929 historical filtering (DPLT-929 historical filtering #81)
ci: Fix rust CI check workflow
DPLT-1009 feat: contract validation with wild cards ([DPLT-1009] feat: contract validation with wild cards #93)
fix: contract name
DPLT-1007 feat: Add link to QueryApi Docs ([DPLT-1007] feat: Add link to QueryApi Docs #95)
DPLT-1002 Publish current block height of indexer functions to CloudWatch (DPLT-1002 Publish current block height of indexer functions to CloudWatch #88)
DPLT-1020 Fetch indexing metadata file to determine last_indexed_block. (DPLT-1020 Fetch indexing metadata file to determine last_indexed_block. #96)
Fixed timestamp field for lag log.
Refactor: Introduce Editor context refactoring (Refactor: Introduce Editor context refactoring #98)
Updated indexed historical data folder
Limit unindexed block processing to two hours of blocks.
DPLT-1021 feat: enable graphiql explorer plugin ([DPLT-1021] feat: enable graphiql explorer plugin #94)
DPLT-1028 feat: Fork Your Own indexer modal (DPLT-1028 feat: Fork Your Own indexer modal #99)
DPLT-936 message per block (DPLT-936 message per block #101)
Shell script for test blocks, updated test for block level matching result instead of action level.
cargo fmt
Updated readme with test block info.
Fixed merge error
Refactor S3 and SQS operations into their own modules.
S3 list operation now handles continuation tokens and directory listings (via delimeter and common prefixes). MOCK queue_url logs instead of sending SQS messages.
Small adjustments from PR feedback
S3 list methods for wildcards and comma separated contracts.
[DPLT-1042] Add support for backend only mutations from Runner ([DPLT-1042] Add support for backend only mutations from Runner #109)
Full support for wildcard and CSV contract matching.
[DPLT-1044] Write latest near.org post block height to Grafana ([DPLT-1044] Write latest near.org post block height to Grafana #112)
Clippy recommended fixes
The fmt of the clippy
chore: Stub commit to trigger lambda deploy (chore: Stub commit to trigger lambda deploy #115)
DPLT-1051 Do not retry errors, send them straight to the DLQ. (DPLT-1051 Do not retry errors, send them straight to the DLQ. #117)
[DPLT-1044] Fix broken import in latest post metrics writer ([DPLT-1044] Fix broken import in latest post metrics writer #119)
Fixed log order of operations
[DPLT-1044] Handle non JSON responses from api.near.social ([DPLT-1044] Handle non JSON responses from api.near.social #120)
[DPLT-1044] Pass required JSON content type header ([DPLT-1044] Pass required JSON content type header #121)
Temporarily log block lag
Removed temp logging
chore: Add CODEOWNERS (chore: Add CODEOWNERS #125)
feat: Enable backend_only mutations by default (DPLT-1042 Enable backend_only mutations by default #124)
Chore: Update explorer plugin (Chore: Update explorer plugin #123)
feat: only allow lowercase titles in title (feat: only allow lowercase titles in title #116)
DPLT-1043 chore: update hasura links to ems links (DPLT-1043 chore: update hasura links to ems links #113)
DPLT-1034 feat: allow to edit indexer when forking (DPLT-1034 feat: allow to edit indexer when forking #127)
Correct first block request for provisioning in historical processing.
cleanup some unused variables
DPLT-1044 Expose context method for writing custom Grafana metrics (DPLT-1044 Expose context method for writing custom Grafana metrics #126)
DPLT-1044 Change HISTORICAL dimension to EXECUTION_TYPE (DPLT-1044 Change HISTORICAL dimension to EXECUTION_TYPE #130)
DPLT-1044 Expose fetch to VM runner (DPLT-1044 Expose fetch to VM runner #131)
DPLT-1055 Log multiple console.log / context.log arguments
Improvements to integration test handling of execution options.
DPLT-1044 Add context.fetchFromSocialApi (DPLT-1044 Add context.fetchFromSocialApi #134)
docs: update metatdata (docs: update metatdata #135)
DPLT-1044 Create scheduled Lambda to write lag behind near social (DPLT-1044 Create scheduled Lambda to write lag behind near social #137)
cargo fmt
DPLT-1049 Provision separate DB per user (DPLT-1049 Provision separate DB per user #132)
DPLT-1049 Fix issues with separate user DB provisioning (DPLT-1049 Fix issues with separate user DB provisioning #139)
DPLT-1049 Increase Lambda timeout to cater for separate user DB provisioning (DPLT-1049 Increase Lambda timeout to cater for separate user DB provisioning #140)
DPLT-1049 Revert separate user DB provisioning (DPLT-1049 Revert separate user DB provisioning #143)
Docs/frontend (Docs/frontend #128)
DPLT-1053 Feat: implement Following feed (DPLT-1053 Feat: implement Following feed #138)
feat: pagination Following Feed (feat: pagination Following Feed #146)
Updated integration tests with mock metrics
DPLT-1066 feat: add fallback to old feed (DPLT-1066 feat: add fallback to old feed #148)
chore: Remove backend_only mutation default (chore: Remove backend_only mutation default #150)
DPLT-1049 Provision separate DB per user (DPLT-1049 Provision separate DB per user #144)
fix: Separate user db automated & script provisioning (fix: Separate user db automated & script provisioning #151)
fix: Provision same named schema to ensure same named mutation (fix: Provision same named schema to ensure same named mutation #152)
fix: Use correct schema name in is provisioned check (fix: Use correct schema name in is provisioned check #153)
DPLT-1068 feat: likes on comments (DPLT-1068 feat: likes on comments #149)
DPLT-1124 Create script to preemptively provision user DBs (DPLT-1124 Create script to preemptively provision user DBs #155)
S3 & Historical processing errors are bubbled up to single handler. RPC block date lookup handles missing blocks.
Handle new index file folder structure that is optimized for sub-account wildcards.
cargo fmt
DPLT-1074 Queue real-time messages on Redis Streams (DPLT-1074 Queue real-time messages on Redis Streams #157)
Revert "DPLT-1074 Queue real-time messages on Redis Streams" (Revert "DPLT-1074 Queue real-time messages on Redis Streams" #160)
Revert "Revert "DPLT-1074 Queue real-time messages on Redis Streams"" (Revert "Revert "DPLT-1074 Queue real-time messages on Redis Streams"" #161)
Matches latest index files structure. Additional debugging information in logs.
cargo fmt of format
DEC-1372 feat: add moderation list support for feed (DEC-1372 feat: add moderation list support for feed #163)
DPLT-1076 create node worker to process real time streams (DPLT-1076 create node worker to process real time streams #159)
DPLT-1084 Add support for Runner infrastructure (DPLT-1084 Add support for Runner infrastructure #166)
DPLT-999 feat: fetch activeTab from localstorage to persist-tabs (DPLT-999 feat: fetch activeTab from localstorage to persist-tabs #162)
DEC-1373 flag button on posts/comments (DEC-1373 flag button on posts/comments #164)
DPLT-1083 Add Docker Compose file (DPLT-1083 Add Docker Compose file #168)
DPLT-1084 Add Hasura CLI config (DPLT-1084 Add Hasura CLI config #167)
DPLT-1129 Publish unprocessed stream messages count to Grafana (DPLT-1129 Publish unprocessed stream messages count to Grafana #169)
Revert "DPLT-1129 Publish unprocessed stream messages count to Grafana" (Revert "DPLT-1129 Publish unprocessed stream messages count to Grafana" #171)
DPLT-1129 Publish Runner metrics to Grafana (DPLT-1129 Publish Runner metrics to Grafana #170)
fix: Use correct PG env vars when adding hasura datasource (fix: Use correct PG env vars when adding hasura datasource #172)
fix: Local development & add steps in README.md (fix: Local development & add steps in README.md #173)
fix: Fetch cargo packages from registry in Docker build (fix: Fetch cargo packages from registry in Docker build #174)
ci: Add GitHub Action for Runner CI (ci: Add GitHub Action for Runner CI #175)
feat: add replacement map feature (feat: add replacement map feature #176)
chore: Remove unused transaction cache (chore: Remove unused transaction cache #182)
DPLT-1077 Process historical messages via Redis Streams (DPLT-1077 Process historical messages via Redis Streams #181)
feat: Generate insert and select methods for context object under db (feat: Generate insert and select methods for context object under db #177)
feat: add temp mint (feat: add temp mint #183)
mint nft url (mint nft url #184)
fix: Avoid writing misleading failed/skipped duration metrics (fix: Avoid writing misleading failed/skipped duration metrics #187)
chore: Revert hackathon minting code (chore: Revert hackathon minting code #186)
DPLT-1022 feat: store code in local storage (DPLT-1022 feat: store code in local storage #180)
Add darunrs.near to whitelist (Add darunrs.near to whitelist #185)
fix: Improve Scope of Table Name Regex to Resolve Errors
DPLT-1136: Implement update, upsert, and delete on context.db (DPLT-1136: Implement update, upsert, and delete on context.db #189)
DPLT-1118 Parallelize stream processing with worker threads (DPLT-1118 Parallelize stream processing with worker threads #191)
fix: Skip missing blocks in manual filtering (fix: Skip missing blocks in manual filtering #195)
DPLT-1121: Generate and Support Strongly Typed Objects for DB Methods (DPLT-1121: Generate and Support Strongly Typed Objects for DB Methods #193)
feat: Add indexer_log_entries indexes to hasura migrations (feat: Add indexer_log_entries indexes to hasura migrations #238)
Update issue templates
Update issue templates
feat: add aggregation fields by default (feat: add aggregation fields by default #265)
feat: put toggle + disclaimer (feat: put toggle + disclaimer #237)
hot-fix: change back external react url
Store Real Time Streamer Messages in Redis (Store Real Time Streamer Messages in Redis #241)
chore: add prod graphql link (chore: add prod graphql link #272)
fix: Handle errors emitted within worker threads (fix: Handle errors emitted within worker threads #270)
feat: add registeration button (feat: add registeration button #285)
fix: fix playground issue + update readme for env vars for react app (fix: fix playground issue + update readme for env vars for react app #284)
feat: remove logs when toggle pressed
hot-fix: skip context.db (hot-fix: skip context.db #294)
ignore all runner errors in V1 rather than sending them to the DLQ (ignore all runner errors in V1 rather than sending them to the DLQ #295)
Update bug_report.md
fix: upgrade eslint-config-next from 13.1.6 to 13.5.3
fix: upgrade @next/font from 13.1.6 to 13.5.3
fix: upgrade @types/node from 18.13.0 to 18.18.1
fix: upgrade @types/react from 18.0.28 to 18.2.23
fix: upgrade eslint from 8.34.0 to 8.50.0
Automatically close issues marked 'done' (Automatically close issues marked 'done' #320)
fix: Rename close-completed-issues to close-completed-issues.yml (fix: Rename close-completed-issues to close-completed-issues.yml #327)
fix: Use Histogram to prevent stale metrics being scraped (fix: Use Histogram to prevent stale metrics being scraped #332)
fix: frontend/package.json to reduce vulnerabilities
chore: Remove V1/V2 toggle (chore: Remove V1/V2 toggle #339)
feat: Use V2 endpoint to calculate social lag metric (feat: Use V2 endpoint to calculate social lag metric #344)
fix: Expose HASURA_ENDPOINT_V2 env var to Lambda pipeline (fix: Expose HASURA_ENDPOINT_V2 env var to Lambda pipeline #349)
chore: Trigger Lambda deployment (chore: Trigger Lambda deployment #350)
fix: Write UNPROCESSED_STREAM_MESSAGES metric on both failure/success (fix: Write UNPROCESSED_STREAM_MESSAGES metric on both failure/success #352)
fix: Ensure historical invocations are configured correctly (fix: Ensure historical invocations are configured correctly #353)
refactor: Aggregate worker metrics in background (refactor: Aggregate worker metrics in background #355)
refactor: Remove SQS references from Coordinator (refactor: Remove SQS references from Coordinator #365)
feat: Cancel historical backfill process before starting anew (feat: Cancel historical backfill process before starting anew #363)
fix: Ensure missing historical blocks are handled correctly (fix: Ensure missing historical blocks are handled correctly #373)
feat: Pre-Fetch Streamer Messages (feat: Pre-Fetch Streamer Messages #269)
fix: Ensure Historical Stream is cleared before pushing new messages (fix: Ensure Historical Stream is cleared before pushing new messages #375)
Revert "feat: Pre-Fetch Streamer Messages" (Revert "feat: Pre-Fetch Streamer Messages" #377)
Revert "Revert "feat: Pre-Fetch Streamer Messages"" (Revert "Revert "feat: Pre-Fetch Streamer Messages"" #378)
refactor: Speed up Historical Backfill process (refactor: Speed up Historical Backfill process #379)
fix: Ensure historical invocations are configured correctly (fix: Ensure historical invocations are configured correctly #385)
fix: Incorrect test imports (fix: Incorrect test imports #386)
refactor: Remove async indexer_rules_engine (refactor: Remove async indexer_rules_engine #387)
fix: Ensure Indexer Config Updates Are Read in Runner (fix: Ensure Indexer Config Updates Are Read in Runner #384)
fix: Coordinator Not Updating Redis Indexer Config (fix: Coordinator Not Updating Redis Indexer Config #398)
QueryAPI Logging Features + UI/UX changes (QueryAPI Logging Features + UI/UX changes #388)
chore: hot fixes for logs feature (chore: hot fixes for logs feature #411)
Fix Connection Unavailable Error (Fix Connection Unavailable Error #413)
Replace block wait metric with Histogram (Replace block wait metric with Histogram #424)
fix: Try to resolve heap memory errors (fix: Try to resolve heap memory errors #435)
fix: Fix bugs in stream message handling and minor logging improvements (fix: Fix bugs in stream message handling and minor logging improvements #438)
feat: Create initial Block Streamer service (feat: Create initial Block Streamer service #428)
[Snyk] Security upgrade node from 18.17 to 18.18.2 ([Snyk] Security upgrade node from 18.17 to 18.18.2 #383)
feat: Expose endpoint to control streams (feat: Expose endpoint to control streams #430)
test: Make block-streamer unit-testable and add tests (test: Make block-streamer unit-testable and add tests #442)
feat: support multiple contract filters (feat: support multiple contract filters #451)
refactor: Extract registry types in to own crate (refactor: Extract registry types in to own crate #453)
feat: add reload table button (feat: add reload table button #450)
Fix type gen bug (Fix type gen bug #449)
feat: Make log/state Hasura tables backend_only (feat: Make log/state Hasura tables backend_only #462)
fix: Prune Unnecessary Logs (fix: Prune Unnecessary Logs #465)
feat: Runner gRPC endpoint (feat: Runner gRPC endpoint #446)
feat: Create function to validate types and format from SQL schema
refactor: Reduce the error states and remove unnecessary Alerts
refactor: Debounce the schema validation
chore: Create tests folder for improved folder structure
feat: Add created/updated at block heights to registry (feat: Add created/updated at block heights to registry #458)
remove side effects from reformatAll function
refactor: make codeValidation reusable and separate concerns in reformatAll function
refactor: separate concern con useEffect, and react separately to changes on the indexerDetails object. - Show the schema even if it fails the validations
fix: remove dead code
refactor: Replace requestLatestBlockHeight with direct getLatestBlockHeight call
refactor: Replace requestLatestBlockHeight with direct getLatestBlockHeight call
refactor: only validate schemas different from the default
refactor: remove log
fix: fix issue when generating types, based on response type from astify method
refactor: decouple ResizableLayourEditor from Editor
chore: Add description to function
fix: change reference to variable
refactor: Make error messages constants
fix: Solve an issue on the reformatAll function, also added real-time validation when user is changing the code
fix: Reset errors if code/schemar are ok when reloading
feat: Create rust GRPC client for Runner (feat: Create rust GRPC client for Runner #491)
feat: Create initial Coordinator V2 service (feat: Create initial Coordinator V2 service #444)
Improve code/schema validation before registering (Improve code/schema validation before registering #495)
refactor: Remove hard-coded shard count (refactor: Remove hard-coded shard count #502)
fix: Ensure array is returned to Promise.all (fix: Ensure array is returned to Promise.all #504)
refactor: Configure coordinator/block-streamer via environment (refactor: Configure coordinator/block-streamer via environment #503)
feat: Toggle Runner Version (feat: Toggle Runner Version #488)
fix: Resolve Proto File Not Found Build Error (fix: Resolve Proto File Not Found Build Error #514)
feat: Capture errors thrown within Coordinator (feat: Capture errors thrown within Coordinator #515)
feat: Only start indexers set within the allowlist (feat: Only start indexers set within the allowlist #518)
feat: Add Dockerfile for Coordinator V2 (feat: Add Dockerfile for Coordinator V2 #519)
feat: Support Deployment of Block Streamer (feat: Support Deployment of Block Streamer #516)
feat: Enable Block Streams Start from V1 Interruption (feat: Enable Block Streams Start from V1 Interruption #517)
feat: Logging & Error updates (feat: Logging & Error updates #526)
fix: Prevent Coordinator from stopping V1 executors (fix: Prevent Coordinator from stopping V1 executors #544)
fix: Resolve duplicate processing of messages in Runner (fix: Resolve duplicate processing of messages in Runner #545)
fix: Executors would crash when DmlHandler.create times out (fix: Executors would crash when DmlHandler.create times out #547)
feat: Auto migrate indexers to Control Plane (feat: Auto migrate indexers to Control Plane #527)
fix: Crashed Runner Executors would continue to display RUNNING (fix: Crashed Runner Executors would continue to display RUNNING #550)
fix: Various Control Plane migration fixes (fix: Various Control Plane migration fixes #552)
feat: Add "StartBlock::Continue" to registry contract/types (feat: Add "StartBlock::Continue" to registry contract/types #548)
fix: Labels showing up in Grafana and Composed Runner fails due to Region Missing
feat: Log Worker crashes in Indexer logs
feat: Add metrics for memory footprint
fix: Set current block height for listed V1 indexers and fix rust formatting
feat: Remove executor metrics on stop (feat: Remove executor metrics on stop #557)
fix: Prevent skipping blocks from Redis Stream (fix: Prevent skipping blocks from Redis Stream #558)
feat: Handle StartBlock options within Coordinator & Block Streamer (feat: Handle StartBlock options within Coordinator & Block Streamer #553)
fix: Various block streamer issues (fix: Various block streamer issues #556)
fix: Remove throwing of error during intended retry loop (fix: Remove throwing of error during intended retry loop #563)
feat: Improve Schema Error Diplay with Glyphs (feat: Improve Schema Error Diplay with Glyphs #562)
feat: Support Star Contract Filter in Block Streamer (feat: Support Star Contract Filter in Block Streamer #572)
feat: Update frontend to handle new registry types (feat: Update frontend to handle new registry types #566)
feat: Remove allowlist from Coordinator V2 (feat: Remove allowlist from Coordinator V2 #570)
fix: view and monaco glyph interactions with debug and diffviedw (fix: view and monaco glyph interactions with debug and diffviedw #571)
feat: Add metrics to Block Streamer (feat: Add metrics to Block Streamer #579)
Feat: added CORS matching all routes (feat: added CORS matching all routes #578)
chore: Remove legacy code (chore: Remove legacy code #576)
Feat: Support Wildcard and * contract filters (feat: Support Wildcard and * contract filters #567)
feat: Cache Streamer Message from Block Streamer (feat: Cache Streamer Message from Block Streamer #582)
fix: Increase timeout of VM run function (fix: Increase timeout of VM run function #584)
541 update the frontend to dis allow use of this filter (fix: update middleware for CORS #583)
feat: Cache Database Credentials (feat: Cache Database Credentials #585)
feat: fixed reload button, unmount grid (feat: fixed reload button, unmount grid #592)
feat: Support Adding of Owners through add_user (feat: Support Adding of Owners through add_user #590)
feat: Reduce Runner Logs and DB Writes (feat: Reduce Runner Logs and DB Writes #589)
fix: Block Streamer not updating last published block (fix: Block Streamer not updating last published block #596)
fix: Cap Block Streamer Caching and Merge Redis Impl (fix: Cap Block Streamer Caching and Merge Redis Impl #599)
feat: Instrument Runner Service (feat: Instrument Runner Service #602)
Support WHERE col IN (...) in context.db.table.select and delete (Support WHERE col IN (...) in context.db.table.select and delete #606)
feat: Include indexer name in context db build failure warning (feat: Include indexer name in context db build failure warning #611)
Cache provisioning status (Cache provisioning status #607)
Fix ESLint on DmlHandler (Fix ESLint on DmlHandler #612)
fix: Substitution 'node-sql-parser' with a forked version until April 1st (Next Release) (fix: Substitution 'node-sql-parser' with a forked version until April 1st (Next Release) #597)
feat: Add pgBouncer to QueryApi (feat: Add pgBouncer to QueryApi #615)
feat: Expose near-lake-primitives to VM (feat: Expose near-lake-primitives to VM #613)
feat: Schedule log partition jobs during provisioning (feat: Schedule log partition jobs during provisioning #625)
test: Add integration tests for Indexer (test: Add integration tests for Indexer #627)
feat: Support Case Sensitive Schemas (feat: Support Case Sensitive Schemas #624)
fix: DmlHandler using wrong port (fix: DmlHandler using wrong port #630)
fix: Configure cron db from environment (fix: Configure cron db from environment #628)
chore: Disable new logs table while still in development (chore: Disable new logs table while still in development #632)
Introducing Logging Class (Disabled Usage of Logger) (feat: Introducing Logging Class (Disabled Usage of Logger) #608)
feat: Avoid unnecessary status updates (feat: Avoid unnecessary status updates #637)
feat: Code for Set Status and Blockheight through Postgres (feat: Code for Set Status and Blockheight through Postgres #634)
fix: Unresolved comments in feat: Introducing Logging Class (Disabled Usage of Logger) #608 (fix: Unresolved comments in #608 #640)
feat: Provision logs for existing users (feat: Provision logs for existing users #636)
feat: Add GCP compatible logging format to Block Streamer (feat: Add GCP compatible logging format to Block Streamer #655)
Introduce provisioning of Logs Table for new and existing users (feat: Introduce provisioning of Logs Table for new and existing users #643)
refactor: Convert IndexerConfig to Class (refactor: Convert IndexerConfig to Class #646)
feat: Conditionally provision metadata table (feat: Conditionally provision metadata table #658)
fix: type generation on load (fix: type generation on load #648)
Enable Logging functionality to both new and old Log Tables (feat: Enable Logging functionality to both new and old Log Tables #657)
feat: Count S3 get requests made by near-lake-framework (feat: Count S3 get requests made by near-lake-framework #662)
fix: fix cron in provisioning (fix: fix cron in provisioning #669)
feat: Enable Metadata Table Writes (feat: Enable Metadata Table Writes #659)
fix: Use compatible versions across inter-dependant crates (fix: Use compatible versions across inter-dependant crates #671)
fix: Reduce requests made to Near Lake S3 (fix: Reduce requests made to Near Lake S3 #665)
feat: Add more metrics for Lake Cache (feat: Add more metrics for Lake Cache #672)
feat: Retry Tracking and Permissions for Tables in Hasura (feat: Retry Tracking and Permissions for Tables in Hasura #663)
feat: Rename logs and metadata tables (feat: Rename logs and metadata tables #677)
fix: Continue Replacement of Logs/Metadata tables after untracking failures (fix: Continue Replacement of Logs/Metadata tables after untracking failures #675)
fix: Remove deletion of old logs and metadata tables (fix: Remove deletion of old logs and metadata tables #679)
feat: Write GCP compatible logs from Runner (feat: Write GCP compatible logs from Runner #680)
fix: Write to winston instead of console (fix: Write to winston instead of console #681)
fix: Specify missing log levels (fix: Specify missing log levels #682)
fix: Correct context.set graphql query (fix: Correct context.set graphql query #683)
Use new Metadata Table for status and block height (Use new Metadata Table for status and block height #676)
fix: Add back yarn frontend (fix: Add back yarn frontend #685)
feat: Expose log count metric from Runner (feat: Expose log count metric from Runner #684)

Runner is lacking instrumentation. It is responsible for many things and it's become hard to understand what tasks contribute to the overall latency of an indexer. In addition, we are now at a point where we need to drive down latencies to facilitate new * indexer use cases such as access keys. I've chosen to instrument Runner with OpenTelemetry. Tracing generally requires 3 items: An instrumented service, a trace collector, and a trace visualizer. The service is responsible for collecting and transmitting trace data to the collector. The collector should be able to receive trace data with little fuss to prevent performance impacts to the instrumented service. The collector then processes the trace data and transmits the processed data to the visualizer. The visualizer visualizes trace data and allows for filtering on traces. The benefit of OpenTelemetry over other options like Zipkin and Jaeger is that GCP already supports ingesting OpenTelemetry data. As such, we don't need to provision a collector ourselves, and can instead leverage GCP's existing collector & visualizer Tracing service. For local development, traces can be output to console, a Zipkin all-in-one container or to GCP (Requires Cloud Trace Agent role and specifying project ID). This is done by simply initializing the NodeSDK differently. In addition, we do not want to enable traces in prod yet, so by not specifying any exporter. This creates a No-Op Trace Exporter which won't attempt to record traces. No code changes were made changing code execution path. All tests pass with no changes, aside from having to replace snapshots due to changes in tabbing of mutation strings. I have manually verified mutation strings are still the same by stripping whitespace and checking against original.

Add support for `context.db.Table.select({column_name: ['a.near', 'b.near']})`. The same support is added for `delete`. Frontend support is added. I also improved parameter naming to reflect SQL statements like `where`, `values` and `set`.

context.db build failures are just logged instead of blocking to allow complex schemas but with only graphql calls available. However, these logs are repeatedly output and not tagged with an indexer name. This adds an indexer name to the log to aid debugging.

Checking provisioning status through Hasura takes 70-100ms on Dev. This PR caches the provisioning status inside of `Provisioner` and does make extra requests to Hasura on every run.

… 1st (Next Release) (#597) A temporary change in our codebase. Replacing the usage of node-sql-parser with kevin-node-sql-parser until April 1st. The reason for this substitution is that the official release of node-sql-parser lacks a version release for the additional SQL statements required for our current project needs. In the interim, this forked version addresses these shortcomings and allows us to incorporate the required SQL statements. Please note that this is a temporary measure, and we plan to revert to the official node-sql-parser version after April 1st, once the required features are officially available. See last comment for details

QueryApi has experienced issues with Postgres connections since the introduction of * indexers due to how QueryApi creates these connections through Application level connection pools. Since we can't use one pool for all workers, I've introduced PgBouncer as a Middleware to serve as an additional connection pooler in front of the DB.

- Upgrade `near-lake-primitives` to `0.2.0`, which includes `borsh` - Expose entire `near-lake-primitives` library to VM via `primitives`, e.g. borsh can be accessed via `primitives.borsh.fromBorsh()`

This PR expands provisioning to also schedule the cron jobs for adding/deleting log partitions. It assumes: 1. The `cron` database exists and has `pg_cron` enabled (near/near-ops#1665) 2. The `__logs` table exists and has the partition functions defined (#608) In relation to this flow, the high-level steps are: 1. Use an admin connection to the `cron` database to grant the required access to the user 2. Use a user connection to the `cron` database to schedule the jobs The cron job is executed under the user which schedules the job, therefore the user _must_ schedule the job as they are the only ones who have access to their schemas. If the admin were to schedule the job the job itself would fail as it doesn't have the required access. Merging this before 2. is fine, the jobs will just fail, but should start to succeed after it has been implemented.

This PR adds a very basic integration test for `Indexer`. It uses `testcontainers` to stand up both `postgres` and `hasura` so that `Indexer` can talk to real components rather than mocks. The test uses `Indexer` directly, which means S3/Redis are still somewhat mocked/ignored. We can add those in later if need be. This is essentially just the scaffolding for integration testing which can be expanded over time. The suite includes only 1 very basic test, which if successful should provide a fair amount of confidence that things are working as expected. The flow includes: provisioning, writing data to Postgres, and then asserting its existence via GraphQL. All errors bubble up from `Indexer` so this test should catch most problems. This PR points to #625, as so I could test the `pg_cron` flow via this integration test :)

Indexer schemas can have quoted or unquoted table & column names. However, QueryApi always quotes table names and does not quote column names during SQL query construction for context.db. This is because the AST generated form parsing the schema does not include if the identifier was quoted or not. However, recent updates tot he parsing library has added this functionality in. I've updated QueryApi to quote both the table name and the column names if they were quoted originally, and leave them unquoted otherwise. In addition, I've replaced kevin-node-sql-parser back with the original package now that the 5.0 update has released. I've also added a typscript examples folder for convenience, as well as a script to clear the local postgres database.

DmlHandler was using port number handed to it by Hasura, which is 5432. We want it to use 6432 which is the port specified by the env variable. 6432 points to pgBouncer.

The admin/cron connection is currently hard-coded to the `cron` database, but this needs to be configurable so that we can use the default DB (`postgres`) locally. Additionally, this PR combines the `pgbouncer` / `pg_cron` init scripts and uses the combined output in both docker compose and integration tests.

Prototype Draft: Integrated new logs schema for. We are writing to both default/public/indexer_logs_entries and provisioning a new table on new indexers under the user's provisioned database and respective schema. https://www.loom.com/share/ff21d7099cac403d9152c905f7e4ddcc?sid=5828ae99-377b-4510-ac8c-76c02fd232f2

Quick & dirty PR which short-circuits updating the status via GraphQL when it is unchanged. We don't need to check whether block height has changed, as that is updated in a separate call later on. I've also updated the integration tests to assert the output of status/logs.

I introduce the code necessary to perform status and last processed block height writes through Postgres. I also refactored DmlHandler and its usage in Indexer as caching of the database credentials allows for a simplification of its constructor.

Feat: created logEntry class and test cases Chore: relocated createLogs to abstracted func Chore: renamed schema idx to prefix with '__'

The provisioning flow will not be run for existing Indexers, this PR adds a separate provisioning check/step which sets up the partitioned logs table for existing users. I've opted for a in-code approach as a "manual" migration script requires specific timing, i.e. we'd need to deploy the logs change, ensuring all new Indexers are provisioned correct, and then migrate all existing users to ensure that no Indexers are missed. But since the logs provisioning change is coupled with the logging itself, existing Indexers would fail to log until the migration is complete. My only concern for this approach is a "thundering herd". After this is deployed, all Indexers will attempt to provision there logs table at the same time - I'll monitor this in Dev. As this code is temporary, I didn't bother adding instrumentation/unit-tests, nor worry about the performance impact. It will be removed promptly. This is dependant on #608 and should be merged after.

This PR adds [tracing-stackdriver](https://github.com/NAlexPear/tracing-stackdriver), which outputs logs in a GCP compatible JSON format. To enable this, the `GCP_LOGGING_ENABLED` environment variable must be set. Further, I've added additional context to errors to aid debugging. near/near-ops#1695

First few commits is from this branch #640. Created this branch based off the initial branch. This PR intends to introduce the creation of the logs table by provisioning the logsSchema and the follow CRON jobs for new Users but does not use or writeLogs to the new logsTable itself. If this is merged by itself new users will have unused log table but the provisioning will occur. To provision existing users - #636

Migrating any data related to the Indexer into a common class to simplify data interactions with things like AccountId, which are common. I've also added an integ test for context DB.

Enable conditional provisioning of metadata table.

The object post astify() we receive from node-sql-parser changed. We can think about Version Control in the future. Quick fix here: returning the object to how it was and adding an additional layer on Editor to ensure mounting of types

Uncommented functionality so we actually start writing logs to new the Tables that have been provisioned in #643. Old logging implementation remains untouched as still functions (although it has been renamed from writeLog -> writeLogOld). We are writing to both log tables. ### 1. Provisioning and Logging (to both tables) for a new Indexer https://www.loom.com/share/3ad6d6ea3368412e8896340a74759ffb?sid=4d5379e8-5401-41bf-9e38-d0f8e8c4eca5 ### 2. Logging (to both tables) for a existing Indexer https://www.loom.com/share/4ba411f2bcb740e1842650f695ffb347?sid=253ced68-9d4c-459f-871b-b0a3ee00cd91 ### Provisioning and Logging new logs table for a existing Indexer (that does not have logs table) https://www.loom.com/share/2aa7c0cc882f4dbdb9e51fc2a9e9b7b9?sid=1aa511fe-3054-4d27-9996-2b9fddc44ed8

Depends on near/near-lake-framework-rs#102 This PR exposes a new metrics which counts the number of Get requests made to S3 by `near-lake-framework`. I wanted to start tracking this metric _before_ I merge the change which reduces them, so I can measure the impact of that change. The easiest way to track these requests was to pass a custom `S3Client` to `near-lake-framework`, so we can hook in to the actual requests made. The custom `S3Client` (`LakeS3Client`) is exactly the same as the default implementation in `near-lake-framework` itself, but with the added metric. This is essentially part 1 for #419, as the "reduction" in requests will build on this custom client, adding caching/de-duplication.

CRON statement functions was attempting to access a non-existent scoped property. Added syntax for dynamic sql generation to properly traverse. Tested by setting cron fn_create_partition to trigger every 30 seconds. Previously we would not see the Non Trackable functions and we would get the original error message below. Now we are able to view the 2 Non Trackable functions and the row succeeds. <img width="686" alt="Screenshot 2024-04-16 at 7 39 10 PM" src="https://github.com/near/queryapi/assets/42101107/b67e6e49-2f66-46d8-a41e-e1b51a6a2f06"> <img width="1378" alt="Screenshot 2024-04-16 at 8 02 24 PM" src="https://github.com/near/queryapi/assets/42101107/fbb4946e-8f23-4fc3-8765-0fad676897d1"> Original error `ERROR: function fn_delete_partition(unknown, date, unknown, unknown) does not exist LINE 1: SELECT fn_delete_partition('kevin33_near_component_01.__logs... ^ HINT: No function matches the given name and argument types. You might need to add explicit type casts.`

Enable writes of Status and Last Processed Block Height to Metadata table. Reorganizes provisioning to ensure writing of PROVISIONING status. Ensures IndexerMeta is available for writing error logs.

- fix: Use compatible types across inter-dependant crates - fix: Clippy

Each `BlockStream` uses its own dedicated `near-lake-framework` instance, and hence manages its own connection with S3. This leads to many duplicate S3 requests, particularly across the large majority of Indexers which follow the network tip, which request the same block data at the same time. This PR introduces a shared S3 client to be used across all `near-lake-framework` instances. `SharedLakeS3Client` ensures that duplicate requests made within a short time-frame, including those made in parallel, result in only a single request to S3. ## Cache Strategy This implementation will mostly impact `BlockStream`s following the network tip, i.e. `From Latest`. These streams will wait for new data in Near Lake S3, and request it as soon as it is available, at the same time. Therefore, it would be enough to cache the result alone, by the time we actually prime the cache, all other requests would have missed it and fired a request of their own. Locking while the request is in-flight also is not feasible, as this would force _every_ request to execute in sequence. Instead of caching the result of the request, we cache its computation. The first request initiates the request and stores its `Future`, then all subsequent requests retrieve that `Future` from cache and `await` its result, ensuring only one underlying request at most. ## Performance Impact My main concern with this implementation is the impact it will have on performance. Each request made must block to check the cache, introducing contention/delays. The lock is only held while checking the cache, and not while the request is being made, so my hope is that it does not impact too much. This may be something that needs to be iterated over time. From local testing the impact seemed to be negligible, but that was with 5 Indexers, it may be worse with many. I've added a metric to measure lock wait time, to determine whether this contention is becoming a problem.

Logs Table and Metadata Table are necessary for important functions of QueryApi. Hasura often invisibly fails to track tables and add permissions. These operations need to be successful, so I added a check which verifies tracking and permissions are correct, and reattempts them if not. When successful, the result is cached. In a successful case, the expensive hasura calls (getTableNames and getTrackedTablesWithPermissions) are done at most twice. I also combined the conditional provisioning functions since they are verified to work already.

The logs and metadata tables were created with a `__` prefix. Unfortunately, it turns out that the prefix is a reserved prefix used by Hasura. So, we are renaming the tables to be prefixed with `sys_`, which is not reserved, to the best of my knowledge. The specific process for the migration is: 1. Delete and recreate the cron DB in dev. This deletes the BD and any scheduled jobs. 2. Delete logs/metadata tables and any created partitions. 3. Create new tables. 4. Use new tables successfully.

…ilures (#675) Errors in provisioning should be logged to the machine as they can potentially be overwritten by errors in the finally block of the parent try catch. We ideally want to move the provisioning out to its own try catch but this is a simple fix for the time being. In addition the PRs to replace the logs/metadata tables failed due to untracking being partially successful. This PR allows untracking errors.

All tables have been migrated to using the new tables. This code hook for deleting old tables is no longer necessary.

This PR adds `winston` to introduce structured logging, and also write GCP compatible logs when `GCP_LOGGING_ENABLED` is set.

`context.set` was constructing an incorrect query #646 - this corrects that query.

Frontend now queries the new metadata table instead of the old one. <img width="1013" alt="image" src="https://github.com/near/queryapi/assets/22734869/4eae0f19-eda6-4e28-b244-cb78453aeeea"> Indexers which have not successfully run since the introduction of the new logs table will not have a last processed block height since they never processed a block successfully. So, I opted to set it as N/A and leave a tooltip on hover which says why its N/A. <img width="1015" alt="image" src="https://github.com/near/queryapi/assets/22734869/38bc3565-41e1-4562-9966-371337e673fb">

My due diligence was insufficient. As it turns out, yarn.lock IS used by the frontend during development. Adding it back.

Creates a new `winston` transport method to count logs by level, and exposes a new prometheus metric to record this value. Additionally, metrics recorded on the main thread were not captured. This was not an issue as the majority of metrics were done within the worker. But since we also log on main thread, this PR updates metrics aggregation to expose main thread metrics.

darunrs and others added 30 commits March 15, 2024 13:56

Cache provisioning status (#607)

053483e

Checking provisioning status through Hasura takes 70-100ms on Dev. This PR caches the provisioning status inside of `Provisioner` and does make extra requests to Hasura on every run.

Fix ESLint on DmlHandler (#612)

d4337a3

feat: Expose near-lake-primitives to VM (#613)

41ccc6a

- Upgrade `near-lake-primitives` to `0.2.0`, which includes `borsh` - Expose entire `near-lake-primitives` library to VM via `primitives`, e.g. borsh can be accessed via `primitives.borsh.fromBorsh()`

fix: DmlHandler using wrong port (#630)

9810b0b

DmlHandler was using port number handed to it by Hasura, which is 5432. We want it to use 6432 which is the port specified by the env variable. 6432 points to pgBouncer.

chore: Disable new logs table while still in development (#632)

4e1ac2f

fix: Unresolved comments in #608 (#640)

89dadf5

Feat: created logEntry class and test cases Chore: relocated createLogs to abstracted func Chore: renamed schema idx to prefix with '__'

refactor: Convert IndexerConfig to Class (#646)

fe81959

Migrating any data related to the Indexer into a common class to simplify data interactions with things like AccountId, which are common. I've also added an integ test for context DB.

feat: Conditionally provision metadata table (#658)

e64bbfd

Enable conditional provisioning of metadata table.

fix: type generation on load (#648)

1db0cdd

The object post astify() we receive from node-sql-parser changed. We can think about Version Control in the future. Quick fix here: returning the object to how it was and adding an additional layer on Editor to ensure mounting of types

feat: Enable Metadata Table Writes (#659)

492d95c

Enable writes of Status and Last Processed Block Height to Metadata table. Reorganizes provisioning to ensure writing of PROVISIONING status. Ensures IndexerMeta is available for writing error logs.

fix: Use compatible versions across inter-dependant crates (#671)

bf0c121

- fix: Use compatible types across inter-dependant crates - fix: Clippy

morgsmccauley and others added 12 commits April 18, 2024 20:46

feat: Add more metrics for Lake Cache (#672)

a9bd527

fix: Remove deletion of old logs and metadata tables (#679)

1a877eb

All tables have been migrated to using the new tables. This code hook for deleting old tables is no longer necessary.

feat: Write GCP compatible logs from Runner (#680)

6f46574

This PR adds `winston` to introduce structured logging, and also write GCP compatible logs when `GCP_LOGGING_ENABLED` is set.

fix: Write to winston instead of console (#681)

b168b2b

fix: Specify missing log levels (#682)

d89e00c

fix: Correct context.set graphql query (#683)

77f0ebd

`context.set` was constructing an incorrect query #646 - this corrects that query.

fix: Add back yarn frontend (#685)

c262f9f

My due diligence was insufficient. As it turns out, yarn.lock IS used by the frontend during development. Adding it back.

morgsmccauley requested a review from a team as a code owner April 22, 2024 21:41

morgsmccauley closed this Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test stable branch git fix up #687

test stable branch git fix up #687

morgsmccauley commented Apr 22, 2024 •

edited by jira bot

Loading

test stable branch git fix up #687

test stable branch git fix up #687

Conversation

morgsmccauley commented Apr 22, 2024 • edited by jira bot Loading

morgsmccauley commented Apr 22, 2024 •

edited by jira bot

Loading