Max gRPC Message Size
Running this notebook
You will need to start Jupyter with settings to allow for large payloads, for example:
jupyter notebook --NotebookApp.iopub_data_rate_limit=1000000000from IPython.core.magic import register_line_cell_magic
@register_line_cell_magic
def writetemplate(line, cell):
with open(line, "w") as f:
f.write(cell.format(**globals()))Setup Seldon Core
Use the setup notebook to Setup Cluster with Ambassador Ingress and Install Seldon Core. Instructions also online.
!kubectl create namespace seldonError from server (AlreadyExists): namespaces "seldon" already exists!kubectl config set-context $(kubectl config current-context) --namespace=seldonContext "kind-kind" modified.VERSION = !cat ../version.txt
VERSION = VERSION[0]
VERSIONWe now add in our model config file the annotations "seldon.io/rest-timeout":"100000" and "seldon.io/grpc-timeout":"100000"
Create Seldon Deployment
Deploy the runtime graph to kubernetes.
Get predictions
Send a small request which should succeed.
Send a large request which will fail as the default for the model will be 4G.
Allowing larger gRPC messages
Now we change our SeldonDeployment to include a annotation for max grpx message size.
Send a request via ambassador. This should succeed.
Last updated
Was this helpful?