Tensorflow Benchmark

Using a pretrained model for Tensorflow flowers dataset

  • Load test the model at fixed rate

  • Benchmark the model to find maximum throughput and saturation handling

Setup

  • Create a 3 node GCP cluster with n1-standard-8 node

  • Install Seldon Core

TODO

  • gRPC

  • Run vegeta on separate node to model servers using affinity/taints

!kubectl create namespace seldon
Error from server (AlreadyExists): namespaces "seldon" already exists
!kubectl config set-context $(kubectl config current-context) --namespace=seldon
Context "do-lon1-k8s-1-16-10-do-0-lon1-1594477430912" modified.
import sys

sys.path.append("../")
from vegeta_utils import *

Put Taint on Nodes

Benchmark with Saturation Test

Run test to gather the max throughput of the model

Load Tests with HPA

Run with an HPA at saturation rate to check:

  • Latencies affected by scaling

Last updated

Was this helpful?