Example
docker pull \
europe-west2-docker.pkg.dev/seldon-registry/llm/mlserver-local-embeddings:0.7.0!cat models/embed/model-settings.json{
"name": "embed",
"implementation": "mlserver_local_embeddings.LocalEmbeddingsRuntime",
"parameters": {
"extra": {
"backend": "sentence-transformers",
"config": {
"model_settings": {
"model_name_or_path": "all-MiniLM-L6-v2",
"device": "cpu"
}
}
}
}
}Starting the Runtime
Sending Requests
Deploying on Seldon Core 2
Last updated
Was this helpful?