Serving LightGBM models
Out of the box, mlserver
supports the deployment and serving of lightgbm
models. By default, it will assume that these models have been serialised using the bst.save_model()
method.
In this example, we will cover how we can train and serialise a simple model, to then serve it using mlserver
.
Training
To test the LightGBM Server, first we need to generate a simple LightGBM model using Python.
Our model will be persisted as a file named iris-lightgbm.bst
.
Serving
Now that we have trained and saved our model, the next step will be to serve it using mlserver
. For that, we will need to create 2 configuration files:
settings.json
: holds the configuration of our server (e.g. ports, log level, etc.).model-settings.json
: holds the configuration of our model (e.g. input type, runtime to use, etc.).
settings.json
settings.json
model-settings.json
model-settings.json
Start serving our model
Now that we have our config in-place, we can start the server by running mlserver start .
. This needs to either be ran from the same directory where our config files are or pointing to the folder where they are.
Since this command will start the server and block the terminal, waiting for requests, this will need to be ran in the background on a separate terminal.
Send test inference request
We now have our model being served by mlserver
. To make sure that everything is working as expected, let's send a request from our test set.
For that, we can use the Python types that mlserver
provides out of box, or we can build our request manually.
As we can see above, the model predicted the probability for each class, and the probability of class 1
is the biggest, close to 0.99
, which matches what's on the test set.
Last updated