Health
The “server live” API indicates if the inference server is able to receive and respond to metadata and inference requests. The “server live” API can be used directly to implement the Kubernetes livenessProbe
.
OK
GET /v2/health/live HTTP/1.1
Host:
Accept: */*
OK
No content
The “model ready” health API indicates if a specific model is ready for inferencing. The model name and (optionally) version must be available in the URL. If a version is not provided the server may choose a version based on its own policies.
OK
GET /v2/models/{model_name}/versions/{model_version}/ready HTTP/1.1
Host:
Accept: */*
OK
No content
The “model ready” health API indicates if a specific model is ready for inferencing. The model name and (optionally) version must be available in the URL. If a version is not provided the server may choose a version based on its own policies.
OK
GET /v2/models/{model_name}/ready HTTP/1.1
Host:
Accept: */*
OK
No content
Was this helpful?