File vart.py#

class Runner

Abstract base class for executing model inference operations.

The Runner class defines a unified interface for running synchronous and asynchronous inference tasks on machine learning models. It provides methods for retrieving tensor metadata, executing computations, and managing asynchronous job execution.

Key Features:

  • Query input and output tensor information, including support for zero-copy operations.

  • Perform synchronous inference with input and output tensors.

  • Submit asynchronous inference jobs and manage their lifecycle via job handles or callbacks.

  • Support for both polling/waiting and callback-based asynchronous execution models.