Prod Release 03/07/24 #851

morgsmccauley · 2024-07-03T01:22:43Z

fix: Remove role restriction for removing functions (fix: Remove role restriction for removing functions #833)
feat: Limit Redis Stream length (feat: Limit Redis Stream length #834)
chore: Remove println (chore: Remove println #838)
feat: Add script to suspend Indexers (feat: Add script to suspend Indexers #829)
refactor: Refactor Editor into TypeScript with Smaller Components and Separate Concerns (refactor: Refactor Editor into TypeScript with Smaller Components and Separate Concerns #830)
fix: Add catch blocks to prevent unhandled rejections (fix: Add catch blocks to prevent unhandled rejections #842)
fix: Add metrics for reliably measuring Block Stream/Executor health (fix: Add metrics for reliably measuring Block Stream/Executor health #843)
fix: Report worker metrics even when no messages in Stream (fix: Report worker metrics even when no messages in Stream #844)
feat: line position now persist for indexer.js when switching tabs (feat: line position now persist for indexer.js when switching tabs #835)
feat: enable scroll past last line in monaco (feat: enable scroll past last line in monaco #846)

Also runs `cargo format` (1st commit) closes: #822

This PR introduces back pressure to the Redis Stream in Block Streamer, ensuring that the stream does not exceed a specified maximum length. This is achieved by blocking the `redis.publish_block()` call, intermittently polling the Stream length, and publishing once it falls below the configured limit. To aid testing, the current `RedisClient` struct has been split in to two: - `RedisCommands` - thin wrapper around redis commands to make mocking possible. - `RedisClient` - provides higher-level redis functionality, e.g. "publishing blocks", utilising the above. In most cases, `RedisClient` will be used. The split just allows us to test `RedisWrapper` itself.

This PR adds a Node script to Runner to suspend Indexers due to inactivity. The script will: 1. Call Coordinator to disable the indexer 2. Write to the Indexers logs table to notify of suspension Note that as Coordinator is in a private network, you must tunnel to the machine to expose the gRPC server. This can be achieved via running the following in a separate terminal: ```sh gcloud compute ssh ubuntu@queryapi-coordinator-mainnet -- -L 9003:0.0.0.0:9003 ``` The following environment variables are required: - `HASURA_ADMIN_SECRET` - `HASURA_ENDPOINT` - `PGPORT` - `PGHOST` All of which can be found in the Runner compute instance metadata: ```sh gcloud compute instances describe queryapi-runner-mainnet ``` Usage: `npm run script:suspend-indexer -- <accountId> <functionName>`

… Separate Concerns (#830) Refactored Editor Component to TypeScript. This refactoring involved breaking down the Editor file into smaller chunks and separating concerns into distinct components. Also did some minor work on validator to ts as it is a major consumer in the editor. It is setup to later iterate on some additional test for validators.

Promises without rejection handlers, i.e. `.catch` or `try catch`, will throw "unhandled rejection" errors, which bubble up to the worker thread causing it to exit. This PR adds handlers to the various `simultaneousPromises` triggered within the Executor, to avoid the described.

…843) The current methods for determining both Block Stream and Executor health is flawed. This PR addresses these flaws by adding new, more reliable, metrics for use within Grafana. ### Block Streams A Block Stream is considered healthy if `LAST_PROCESSED_BLOCK` is continuously incremented, i.e. we are continuously downloading blocks from S3. This is flawed for the following reasons: 1. When the Redis Stream if full, we halt the Block Stream, preventing it from processing more blocks 2. When a Block Stream is intentionally stopped, we no longer process blocks To address these flaws, I've introduced a new dedicated metric: `BLOCK_STREAM_UP`, which: - is incremented every time the Block Stream future is polled, i.e. the task is doing work. A static value means unhealthy. - is removed when the Block Stream is stopped, so that it doesn't trigger the false positive described above ### Executors An Executor is considered unhealthy if: it has messages in the Redis Stream, and no reported execution durations. The latter only being recorded on success. The inverse of this is used to determine "healthy". This is flawed for the following reasons: 1. We distinguish the difference between a genuinely broken Indexer, and one broken due to system failures 2. "health" is only determined when there are messages in Redis, meaning we catch the issue later than possible To address these I have added the following metrics: 1. `EXECUTOR_UP` which is incremented on every Executor loop, like above, a static value means unhealthy. 2. `SUCCESSFUL_EXECUTIONS`/`FAILED_EXECUTIONS` which track successful/failed executions directly, rather than tracking using durations. This will be useful for tracking health of specific Indexers, e.g. the `staking` indexer should never have failed executions.

We skip reporting metrics if there are no messages in the pre-fetch queue/Redis Stream. This is especially problematic for `EXECUTOR_UP`, as we won't increment the metric even though we are processing. This PR moves the metrics logic so that it is always reported, even when no messages in the stream.

) Added logic to monaco lifecycle method and react lifecycle methods to persist line/cursor position between the swapped files.

set scroll past last line in monaco flag to true

morgsmccauley and others added 10 commits June 25, 2024 17:26

fix: Remove role restriction for removing functions (#833)

5a38775

Also runs `cargo format` (1st commit) closes: #822

chore: Remove println (#838)

9ec0ced

feat: line position now persist for indexer.js when switching tabs (#835

791afc2

) Added logic to monaco lifecycle method and react lifecycle methods to persist line/cursor position between the swapped files.

feat: enable scroll past last line in monaco (#846)

a9c25c3

set scroll past last line in monaco flag to true

morgsmccauley requested a review from a team as a code owner July 3, 2024 01:22

morgsmccauley closed this Jul 3, 2024

morgsmccauley reopened this Jul 3, 2024

morgsmccauley merged commit 932c277 into stable Jul 3, 2024
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prod Release 03/07/24 #851

Prod Release 03/07/24 #851

morgsmccauley commented Jul 3, 2024 •

edited

Loading

Prod Release 03/07/24 #851

Prod Release 03/07/24 #851

Conversation

morgsmccauley commented Jul 3, 2024 • edited Loading

morgsmccauley commented Jul 3, 2024 •

edited

Loading