Add a proper / minimal health check endpoint #1933

colearendt · 2021-09-03T00:45:55Z

Environment

PostgreSQL version: v8.0.0
PostgREST version: v8.0.0
Operating system:

Description of issue

Related to #1565

When running Postgrest in Kubernetes (I have been toying around with a helm chart), there is not the option to use a HEAD request as suggested #1565 . Further, my own personal /rpc/health_check endpoint requires a round-trip to the database. It would be nice if postgrest had its own configurable health endpoint (if the path is configurable, then you do not have to worry about conflicts with user-defined tables/functions).

This is particularly useful when the schema generated by postgrest (and thus the response at /) is large, and we want health checks to be minimal as far as overhead / responsiveness is concerned. This is the case for readiness probes in Kubernetes.

My current health check implementation, which uses an /rpc to return a simple {} and HTTP 200.

BEGIN;

--
-- Name: health_check(); Type: FUNCTION; Schema: public; Owner: dba
--

CREATE OR REPLACE FUNCTION "public"."health_check"() RETURNS jsonb
    LANGUAGE plpgsql
    AS $$
BEGIN

RETURN '{}'::jsonb;

END;
$$;


ALTER FUNCTION "public"."health_check"() OWNER TO dba;
GRANT ALL ON FUNCTION "public"."health_check"() TO dba;
GRANT EXECUTE ON FUNCTION "public"."health_check"() TO PUBLIC;

COMMIT;

The text was updated successfully, but these errors were encountered:

steve-chavez · 2021-09-03T01:06:39Z

Hm, so Kubernetes doesn't do HEAD requests kubernetes/kubernetes#49937 (don't see any strong objections in the issue, maybe they could add the capability)

if the path is configurable, then you do not have to worry about conflicts with user-defined tables/functions).

Can you add headers for GET requests on Kubernetes? Another option might be supporting a special header at the root / endpoint.

(Related: #1891)

colearendt · 2021-09-03T10:49:16Z

Thanks for the quick response!! Ahh nice - I had missed the httpHead issue. It would be nice if Kubernetes add that 😞 It's a pity that the issue rotted.

Yes, a header would be possible as well (I am already using this since my health check is not in my default schema 😉 ) - headers are configurable with something like this:

readinessProbe:
  httpGet:
    httpHeaders:
      - name: Accept-Profile
        value: myschema

steve-chavez · 2021-09-03T18:15:52Z

Cool. Then we'd need to find a suitable header for this. It could a custom media type like the one we use for a single json object - Accept: application/vnd.pgrst.object+json.

We could also take advantage of this header to do #1526.

wolfgangwalther · 2021-09-06T12:42:44Z

Then we'd need to find a suitable header for this. It could a custom media type like the one we use for a single json object - Accept: application/vnd.pgrst.object+json.

What about using Prefer: return=minimal on the root endpoint for that purpose? That sounds more like what we're trying to do here - which is kind of a poor-mans-HEAD-request, right?

steve-chavez · 2021-09-06T16:17:17Z

What about using Prefer: return=minimal on the root endpoint for that purpose?

Fully agree. That would solve the issue and also be consistent with what we have now.

steve-chavez · 2021-09-22T01:43:26Z

Another thing about using HEAD at the root endpoint is that it executes a couple of complex queries to do OpenAPI.

So this minimal root endpoint should execute a simpler query like select 1 to check the connection. Since we have the listener worker running, another option would be to check on AppState to see if we have a healthy connection.

steve-chavez · 2021-09-24T02:55:33Z

There was an idea on #1526 (comment) about using a second port.

One alternative would be to use a secondary port to host endpoints for liveness/metrics etc. This would avoid creating a breaking change where we reserve e.g. /internal or a similar base path, and trivially allow users to not expose these endpoints externally.

So we could have a management-port=3001 config that enables an internal management api. The health checks could be reachable at

GET localhost:3001/healthy

This would also let us have a consistent interface for metrics(#1526)

@wolfgangwalther @colearendt WDYT?

wolfgangwalther · 2021-09-26T10:00:36Z

So we could have a management-port=3001 config that enables an internal management api. The health checks could be reachable at

How would that work with unix sockets?

Maybe we can just make it a management-endpoint='internal' or similiar config to enable the /internal endpoint, which could then serve all kinds of stuff (/internal/healthy, /internal/metrics, ...)?

Disabled by default, thereby no breaking change and we don't hardcode any endpoint that will then be blocked for regular use.

steve-chavez · 2021-09-26T21:56:01Z

How would that work with unix sockets?

Hm, I guess the same - one extra unix socket for the management api, i.e. a config like management-unix-socket.

Maybe we can just make it a management-endpoint='internal

This seems simpler though.

which could then serve all kinds of stuff (/internal/healthy, /internal/metrics, ...)?

Instead of nested routes, maybe they could use query strings, like /internal?select=healthy or /internal?select=metrics. This would be somewhat consistent to our design and perhaps we can reuse some functions.

One downside of the management-endpoint in comparison to the second port/socket is that it's exposed to the public by default. Perhaps we should require a JWT to consume this endpoint?

steve-chavez · 2021-09-26T22:22:25Z

How would that work with unix sockets?

Maybe this doesn't matter in this case though, if you can hit the admin unix socket you might as well do a systemctl status postgrest.service. The use case is having an admin interface for other http services like kubernetes/prometheus.

(Down the road, we might get requests to implement these sort of endpoints - reload, quit, ..)

colearendt · 2021-09-27T02:24:15Z

I do like the second port approach, and find that to be consistent with what other services offer, especially with prometheus metrics, etc. Most health-check libraries (Kubernetes, Prometheus, etc.) do not have great support for authentication (or run in a non-interactive context where authentication is challenging), so having the endpoint be unauthenticated is ideal.

Ensuring that the health check is actually tied to the health of the other service / database connection is key too. I have seen applications where the "main" service goes down, but the health check stays up / happy 🙈 😢

steve-chavez · 2022-01-08T18:59:47Z

There's now proper health check endpoints on the latest pre-release: https://github.com/PostgREST/postgrest/releases/tag/v9.0.0.20220107

steve-chavez mentioned this issue Sep 3, 2021

Feature Request: metrics or instrumentation #1526

Closed

steve-chavez added enhancement a feature, ready for implementation difficulty: beginner Pure Haskell task labels Sep 6, 2021

steve-chavez removed the difficulty: beginner Pure Haskell task label Dec 15, 2021

steve-chavez mentioned this issue Dec 17, 2021

feat: minimal health check #2092

Merged

4 tasks

steve-chavez closed this as completed in #2092 Dec 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a proper / minimal health check endpoint #1933

Add a proper / minimal health check endpoint #1933

colearendt commented Sep 3, 2021

steve-chavez commented Sep 3, 2021 •

edited

Loading

colearendt commented Sep 3, 2021 •

edited

Loading

steve-chavez commented Sep 3, 2021

wolfgangwalther commented Sep 6, 2021

steve-chavez commented Sep 6, 2021

steve-chavez commented Sep 22, 2021

steve-chavez commented Sep 24, 2021

wolfgangwalther commented Sep 26, 2021

steve-chavez commented Sep 26, 2021

steve-chavez commented Sep 26, 2021

colearendt commented Sep 27, 2021

steve-chavez commented Jan 8, 2022

Add a proper / minimal health check endpoint #1933

Add a proper / minimal health check endpoint #1933

Comments

colearendt commented Sep 3, 2021

Environment

Description of issue

steve-chavez commented Sep 3, 2021 • edited Loading

colearendt commented Sep 3, 2021 • edited Loading

steve-chavez commented Sep 3, 2021

wolfgangwalther commented Sep 6, 2021

steve-chavez commented Sep 6, 2021

steve-chavez commented Sep 22, 2021

steve-chavez commented Sep 24, 2021

wolfgangwalther commented Sep 26, 2021

steve-chavez commented Sep 26, 2021

steve-chavez commented Sep 26, 2021

colearendt commented Sep 27, 2021

steve-chavez commented Jan 8, 2022

steve-chavez commented Sep 3, 2021 •

edited

Loading

colearendt commented Sep 3, 2021 •

edited

Loading