Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
manage configs
Manage and activate configuration files for the CLI
seldon -
seldon config activate - activate config
seldon config add - add config
seldon config deactivate - deactivate config
seldon config list - list configs
seldon config remove - remove config
get status for experiment
get status for experiment
seldon experiment - manage experiments
get list of experiments
get list of experiments and whether they are active
seldon experiment - manage experiments
manage models
load and unload and get status for models
seldon -
seldon model infer - run inference on a model
seldon model list - get list of models
seldon model load - load a model
seldon model metadata - get model metadata
seldon model status - get status for model
seldon model unload - unload a model
Learn more about using Seldon CLI commands
Seldon provides a CLI for easy management and testing of model, experiment, and pipeline resources. The Seldon CLI allows you to view information about underlying Seldon resources and make changes to them through the scheduler in non-Kubernetes environments. However, it cannot modify underlying manifests within a Kubernetes cluster. Therefore, using the Seldon CLI for control plane operations in a Kubernetes environment is not recommended.
The following table provides more information about when and where to use these command line tools.
Primary purpose
Simplifies management for non-Kubernetes users, abstracting control plane operations like load, unload, status.
Kubernetes-native, manages resources via Kubernetes Custom Resources (CRs) like Deployments, Pods, etc.
Control Plane Operations
Executes operations such as load
and unload
models through scheduler gRPC endpoints, without interaction with Kubernetes.
Interacts with Kubernetes, creating and managing CRs such as SeldonDeployments and other Kubernetes resources.
Data Plane Operations
Abstracts open inference protocol to issue infer
or inspect
requests for testing purposes.
Used indirectly for data plane operations by exposing Kubernetes services and interacting with them.
Visibility of Resources
Resources created using Seldon CLI are internal to the scheduler and not visible as Kubernetes resources.
All resources that are created using kubectl
are visible and manageable within the Kubernetes environment.
The CLI talks to 3 backend services on default endpoints:
The Seldon Core 2 Scheduler: default 0.0.0.0:9004
The Seldon Core inference endpoint: default 0.0.0.0:9000
The Seldon Kafka broker: default: 0.0.0.0:9092
These defaults will be correct when Seldon Core 2 is installed locally as per the docs. For Kubernetes, you will need to change these by defining environment variables.
For a default install into the seldon-mesh
namespace if you have exposed the inference svc
as a loadbalancer you will find it at:
Use above IP at port 80:
For a default install into the seldon-mesh
namespace if you have exposed the scheduler svc as a loadbalancer you will find it at:
Use above IP at port 9004:
The Kafka broker will depend on how you have installed Kafka into your Kubernetes cluster. Find the broker IP and use:
You can create a config file to manage connections to running Seldon Core 2 installs. The settings will override any environment variable settings.
The definition is shown below:
An example below shows an example where we connect via TLS to the Seldon scheduler using our scheduler client certificate:
To manage config files and activate them you can use the CLI command seldon config
which has subcommands to list, add, remove, activate and decative configs.
For example:
For running with Kubernetes TLS connections on the control and/or data plane, certificates will need to be downloaded locally. We provide an example script which will download certificates from a Kubernetes secret and store them in a folder. It can be found in hack/download-k8s-certs.sh
and takes 2 or 3 arguments:
e.g.:
get list of models
get the list of all models with their status
seldon model - manage models
- manage models
get status for server
get the status for a server
seldon server - manage servers
get list of servers
get the available servers, their replicas and loaded models
seldon server - manage servers
run inference on a pipeline
call a pipeline with a given input and get a prediction
seldon pipeline - manage pipelines
manage servers
get status for servers
seldon -
seldon server list - get list of servers
seldon server status - get status for server