QNN HTP Predict
Predict responses of QNN model or QNN context binary for HTP (NPU) backend
Since R2025b
Libraries:
Embedded Coder Support Package for Qualcomm Hexagon Processors /
QNN
Description
The QNN HTP Predict block predicts responses of a deep learning network represented as a QNN model or QNN context binary for the HTP (NPU) backend of Qualcomm® AI Direct Engine based on the given input data.
You use the QNN HTP Predict block to select a QNN model as a compiled
shared object (.so) for running on a x86-based host. For the target, you
can select either a compiled shared object (.so) or a QNN context binary
file (.bin) that are optimized to run on HTP (NPU) backend.
The code generated using this block can be deployed to one of these boards that are available under the Hardware board parameter in Configuration Parameters:
Qualcomm Android Board
Qualcomm Linux Board
Qualcomm Hexagon Android Board, with Processor Version
cDSPQualcomm Hexagon Linux Board, with Processor Version
cDSP
The block also provides the option to dequantize outputs to single-precision, if required.
Ports
Input
Output
Parameters
Extended Capabilities
Version History
Introduced in R2025b