You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
QNN Compilation path Support in QEFFBaseModel class.
Signed-off-by: Shubham Agrawal <[email protected]>
Added enable_qnn as arg in _compile
Signed-off-by: Shubham Agrawal <[email protected]>
Reduced number of models tested for QNN, and skipped Whisper model
Signed-off-by: Shubham Agrawal <[email protected]>
:num_cores (int): Number of cores to utilize in each device ``Defaults to 16``.
99
99
:mxfp6_matmul (bool): Use MXFP6 to compress weights for MatMul nodes to run faster on device. ``Defaults to False``.
100
100
:mxint8_kv_cache (bool): Use MXINT8 to compress KV-cache on device to access and update KV-cache faster. ``Defaults to False``.
101
-
:compiler_options: Pass any compiler option as input. Any flag that is supported by ``qaic-exec`` can be passed. Params are converted to flags as below:
101
+
:compiler_options: Pass any compiler option as input.
102
+
Following flag can be passed in compiler_options to enable QNN Compilation path.
103
+
:enable_qnn (bool): Enables QNN Compilation. ``Defaults to False. if not passed.``
104
+
:qnn_config (str): Path of QNN Config parameters file. ``Defaults to None. if not passed``
105
+
for QAIC compilation path, any flag that is supported by ``qaic-exec`` can be passed. Params are converted to flags as below:
f"Custom IO file {custom_io_file_name} is not present at the expected path {custom_io_file_path}. Please pass the correct file path or rerun infer/export API"
f"Custom IO file {custom_io_file_name} is not present at the expected path {custom_io_file_path}. Please pass the correct file path or rerun infer/export API"
0 commit comments