Serving HuggingFace Transformer Models
# Import required dependencies
import requestsServing
model-settings.json
model-settings.json%%writefile ./model-settings.json
{
"name": "transformer",
"implementation": "mlserver_huggingface.HuggingFaceRuntime",
"parameters": {
"extra": {
"task": "text-generation",
"pretrained_model": "distilgpt2"
}
}
}Send test inference request
Using Optimum Optimized Models
Send Test Request to Optimum Optimized Model
Testing Supported Tasks
Question Answering
Sentiment Analysis
GPU Acceleration
Testing with CPU
Testing with GPU
Adaptive Batching with GPU
Last updated
Was this helpful?
