Tensorflow Benchmark
Using a pretrained model for Tensorflow flowers dataset
Load test the model at fixed rate
Benchmark the model to find maximum throughput and saturation handling
Setup
Create a 3 node GCP cluster with n1-standard-8 node
Install Seldon Core
TODO
gRPC
Run vegeta on separate node to model servers using affinity/taints
!kubectl create namespace seldonError from server (AlreadyExists): namespaces "seldon" already exists!kubectl config set-context $(kubectl config current-context) --namespace=seldonContext "do-lon1-k8s-1-16-10-do-0-lon1-1594477430912" modified.import sys
sys.path.append("../")
from vegeta_utils import *Put Taint on Nodes
Benchmark with Saturation Test
Run test to gather the max throughput of the model
Load Tests with HPA
Run with an HPA at saturation rate to check:
Latencies affected by scaling
Last updated
Was this helpful?