# Models

This section covers various aspects of optimizing model performance in Seldon Core 2, from initial load testing to infrastructure setup and inference optimization. Each subsection provides detailed guidance on different aspects of model performance tuning:

## [Load Testing](/seldon-core-2/user-guide/performance-tuning/models/load-testing.md)

Learn how to conduct effective load testing to understand your model's performance characteristics:

* Determining load saturation points
* Understanding closed-loop vs. open-loop testing
* Determining the right number of replicas based on your configuration (model, infrastructure, etc.)
* Setting up reproducible test environments
* Interpreting test results for autoscaling configuration

## [Inference](/seldon-core-2/user-guide/performance-tuning/models/inference.md)

Explore different approaches to optimize inference performance:

* Choosing between gRPC and REST protocols
* Implementing adaptive batching
* Optimizing input dimensions
* Configuring parallel processing with workers
* Understanding CPU vs. GPU utilization
* Optimizing your model artefact

## [Infrastructure Setup](/seldon-core-2/user-guide/performance-tuning/models/infrastructure-setup.md)

Understand how to configure the underlying infrastructure for optimal model performance:

* Choosing between CPU and GPU deployments
* Setting appropriate CPU specifications
* Configuring thread affinity
* Managing memory allocation
* Optimizing resource utilization

Each of these aspects plays a crucial role in achieving optimal model performance. We recommend starting with [load testing](/seldon-core-2/user-guide/performance-tuning/models/load-testing.md) to establish a baseline, then using the insights gained to inform your [infrastructure setup](/seldon-core-2/user-guide/performance-tuning/models/infrastructure-setup.md) and [inference optimization](/seldon-core-2/user-guide/performance-tuning/models/inference.md) strategies.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.seldon.ai/seldon-core-2/user-guide/performance-tuning/models.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
