File vart.py#
-
class Runner
Abstract base class for executing model inference operations.
The Runner class defines a unified interface for running synchronous and asynchronous inference tasks on machine learning models. It provides methods for retrieving tensor metadata, executing computations, and managing asynchronous job execution.
Key Features:
Query input and output tensor information, including support for zero-copy operations.
Perform synchronous inference with input and output tensors.
Submit asynchronous inference jobs and manage their lifecycle via job handles or callbacks.
Support for both polling/waiting and callback-based asynchronous execution models.