ONNX Runtime Python Inference#

Connect to the Board#

Refer to the Board Setup and OSPI and SD Card Boot Flow to setup and boot the VEK385 board.

Model Execution#

Using Directory Structure Format:

Locate your compiled model directory: cache_dir/cache_key

Copy the entire cache_key directory to your target board:

scp -r cache_dir/cache_key user@target-board:/path/to/models/

On the target board, point your inference application to the copied directory
Verify the vaiml_par_0 subdirectory and all files are present

Using Flat-Buffer Format (.rai file):

Locate your compiled model file: cache_dir/cache_key/cache_key.rai

Copy the .rai file to your target board:

scp cache_dir/cache_key/cache_key.rai user@target-board:/path/to/models/

On the target board, point your inference application to the cache_key directory that contains the .rai file
The model is loaded using memory-mapped access for efficient inference

Run the Application#

After copying the compiled model to the board, run the application:

python3 <running_script_name>.py

During the execution of the Python script, the ONNX session automatically detects the presence of a pre-compiled model within the current directory, thereby bypassing any model recompilation process.

ONNX Runtime Python Inference

Contents

ONNX Runtime Python Inference#

Connect to the Board#

Model Execution#

Run the Application#