Max gRPC Message Size

Running this notebook

You will need to start Jupyter with settings to allow for large payloads, for example:

jupyter notebook --NotebookApp.iopub_data_rate_limit=1000000000
from IPython.core.magic import register_line_cell_magic


@register_line_cell_magic
def writetemplate(line, cell):
    with open(line, "w") as f:
        f.write(cell.format(**globals()))

Setup Seldon Core

Use the setup notebook to Setup Cluster with Ambassador Ingress and Install Seldon Core. Instructions also online.

!kubectl create namespace seldon
Error from server (AlreadyExists): namespaces "seldon" already exists
!kubectl config set-context $(kubectl config current-context) --namespace=seldon
Context "kind-kind" modified.
VERSION = !cat ../version.txt
VERSION = VERSION[0]
VERSION

We now add in our model config file the annotations "seldon.io/rest-timeout":"100000" and "seldon.io/grpc-timeout":"100000"

Create Seldon Deployment

Deploy the runtime graph to kubernetes.

Get predictions

Send a small request which should succeed.

Send a large request which will fail as the default for the model will be 4G.

Allowing larger gRPC messages

Now we change our SeldonDeployment to include a annotation for max grpx message size.

Send a request via ambassador. This should succeed.

Last updated

Was this helpful?