Add documentation about the available metrics #73

bdattoma · 2023-09-05T10:47:51Z

Based on watsonx requirements, we should make available these metrics, at least:

However, users won't find metrics with the same name and some of them need to be computed by combination. Examples:

failed inference requests over defined time period: you must do sth like tgi_batch_inference_count-tgi_batch_inference_success plus adding the time period syntax
Memory consumption: there isn't a specific istio/tgi/caikit metric for it (at least, i didn't find it). I thought users can compute it with sth similar to: sum(container_memory_working_set_bytes{pod='<isvc_predictor_pod_name>',namespace='<isvc_namespace>',container='',}) BY (pod, namespace)

Moreover, there are additional metrics which deserves to be documented, like tgi_request_generated_tokens_count

The text was updated successfully, but these errors were encountered:

bdattoma · 2023-09-05T10:48:23Z

heyselbi mentioned this issue Sep 5, 2023

Restructure and Enhance the QuickStart documentation #58

Closed

heyselbi added the kind/documentation Improvements or additions to documentation label Sep 5, 2023

heyselbi assigned VedantMahabaleshwarkar Sep 5, 2023

Jooho added the rhods-2.5 label Nov 22, 2023

Provide feedback