This package provides a MLServer runtime compatible with MLflow models.
You can install the runtime, alongside mlserver
, as:
The MLflow inference runtime introduces a new dict
content type, which decodes an incoming V2 request as a dictionary of tensors. This is useful for certain MLflow-serialised models, which will expect that the model inputs are serialised in this format.