Installation

This section provides instructions on how to install the Model Performance Metrics module on a Kubernetes cluster. It is deployed using a Helm chart, which is stored in a private Google Artifact Registry, managed by Seldon. The Helm chart makes use of a Docker image, which is also stored in the same registry.

Prerequisites

This guide assumes you have received Seldon Artifact Registry access credentials in a JSON format. If this is not the case, please reach out to us.

Before installing the module, ensure you have the following:

Access to the private Google Artifact Registry where the Helm chart and the Docker image are stored.
Kubernetes cluster configured and running.
Seldon Core 2 installed and running on the Kubernetes cluster.
Kafka installed, configured with Seldon Core 2 and running on the Kubernetes cluster or reachable from it.
PostgreSQL installed and running on the Kubernetes cluster reachable from it.
Istio installed and running on the Kubernetes cluster (Optional). Only needed if you wish to make the module externally accessible from the cluster. Requires an Istio Gateway to be set up as well.

Components compatibility matrix

Component

Supported Versions

Importance

Kubernetes

minor versions of 1.27 or later

Required for deploying the Helm chart

Seldon Core 2

minor versions of 2.8 or later

Required for producing inference response events to Kafka

Kafka

minor versions of 3.3.1 or later

Required for Kafka Consumer, consuming inference responses produced by Seldon Core 2

PostgreSQL

version 10 - version 15

Required for storing metrics data, inference responses and more

Istio

minor versions of 1.17 or later

Optional for exposing the Model Performance Metrics module externally from the cluster

Required CLIs

Ensure you have the following CLIs installed on your local machine:

kubectl
docker
Helm
gcloud

Accessing the artifacts

Note

In the following examples, we assume you have received the credentials in a JSON format and stored them in a file named credentials.json.

Authenticate to the Google Artifact Registry

To be able to pull the Helm chart and the Docker image from the Google Artifact Registry, you need to authenticate with the Docker CLI first.

REGISTRY=europe-west2-docker.pkg.dev
cat credentials.json | docker login -u _json_key --password-stdin ${REGISTRY}

The result of this command should be a successful login to the Google Artifact Registry:

Login Succeeded

To pull the Docker image, you can use the following command:

docker pull europe-west2-docker.pkg.dev/seldon-registry/metrics-server/metrics-server:0.1.0

The result of this command should be a successful pull of the Docker image.

To be able to pull the Helm chart from the Google Artifact Registry, you can use the following command:

helm pull oci://europe-west2-docker.pkg.dev/seldon-registry/charts/metrics-server --version 0.1.0 --untar

This command will pull the Helm chart from the Google Artifact Registry and extract it in the current directory. You can then navigate to the directory and inspect the Helm chart.

Note

If you prefer to pull the Helm chart in a tar format, you can remove the --untar flag from the command.

Helm Chart Components

The Helm chart consists of the following components:

Chart.yaml: Contains the metadata of the Helm chart.
templates/: Contains the Kubernetes resources to be deployed on the cluster.
- deployment.yaml: Contains the deployment configuration for the service.
- service.yaml: Contains the service configuration for the service.
- virtualservice.yaml: Contains the virtual service(Istio) configuration for the service (optional).
values.yaml: Contains the default values for the Helm chart.

Configuring the Helm Chart

Helm Chart Values

After downloading the Helm chart, you can inspect the values.yaml file to see the default values for the Helm chart. There are values that must be set to install the Chart, values that must be set for the module to initialise successfully and values that are optional. The following table lists all the possible values, their description and their importance.

Chart

Helm Value Key

Description

Importance

appName

The name of the application. This value is used to label the resources created by the Helm chart.

Required to install chart

namespace

The namespace where the Deployment, the Service and, optionally, the VirtualService will be deployed.

Required to install chart

image

The Docker image to be used by the service. This value should be the full path to the image in the Google Artifact Registry, excluding the tag.

Required to install chart

imageTag

The Docker image version tag. We use Semantic Versioning for tagging the Docker image.

Required to install chart

imagePullSecretName

The name of the Kubernetes secret that contains the credentials to pull the Docker image from the Google Artifact Registry.

Required to install chart

General

Helm Value Key

Description

Importance

logLevel

The log level for the module. The value is case-insensitive and can be either one of: "disabled", "trace", "debug", "info", "warn", "error", "fatal", "panic". Defaults to "info".

Optional for the module

Virtual Service

Helm Value Key

Description

Importance

istioVirtualService.create

Whether or not to create an Istio VirtualService. Defaults to false.

Optional to create the VirtualService

istioVirtualService.gateway

The gateway name for the Istio VirtualService. Must be supplied if istioVirtualService.create is true. An example default value could be istio-system/seldon-gateway.

Optional to create the VirtualService

istioVirtualService.path

The prefix path which follows the external load balancer IP and before the Model Performance Metrics module's endpoints. An example value could be /metrics-server/.

Optional to create the VirtualService

Kafka Metadata

Helm Value Key

Description

Importance

kafka.metadata.retryMax

The total number of times to retry a metadata request when the cluster is in the middle of a leader election. Defaults to 3.

Optional for the module

kafka.metadata.retryBackoff

How long to wait for leader election to occur before retrying. Defaults to 250ms.

Optional for the module

kafka.metadata.refreshFrequency

How frequently to refresh the cluster metadata in the background. Defaults to 10s.

Optional for the module

kafka.metadata.full

Whether or not to maintain a full set of metadata for all topics, or just the minimal set that has been necessary so far. Defaults to true.

Optional for the module

kafka.metadata.allowAutoTopicCreation

Whether or not to allow auto-create topics in metadata refresh. For production environments, it is recommended to disable automatic topic creation. Defaults to true.

Optional for the module

Kafka Auth

Helm Value Key

Description

Importance

kafka.auth.sasl.enabled

Whether or not to enable SASL_SSL(with SCRAM) authentication. Defaults to false.