Reading mxfp6_matmul for QNN Compilation path from compile API arguments #499

shubhagr-qc · 2025-07-07T10:41:36Z

Added mxfp6_matmul as a recognized argument in QEFFBaseModel::_compile. Its required because for VLM models, qefficient is over-writing mxfp6 value as false for vision case and to use the same value for QNN we are reading it from compile API arguments and made mxfp6 matmul parameter as immutable from qnn_config.json.

Signed-off-by: Shubham Agrawal <[email protected]>

Reading mxfp6_matmul for QNN Compilation path from compile API arguments

6c01a73

Signed-off-by: Shubham Agrawal <[email protected]>

shubhagr-qc requested review from quic-rishinr, ochougul, quic-hemagnih and quic-amitraj as code owners July 7, 2025 10:41

quic-rishinr added the 1.21.0 label Jul 10, 2025

quic-rishinr assigned shubhagr-qc Jul 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reading mxfp6_matmul for QNN Compilation path from compile API arguments #499

Reading mxfp6_matmul for QNN Compilation path from compile API arguments #499

shubhagr-qc commented Jul 7, 2025

Uh oh!

Uh oh!

Reading mxfp6_matmul for QNN Compilation path from compile API arguments #499

Are you sure you want to change the base?

Reading mxfp6_matmul for QNN Compilation path from compile API arguments #499

Conversation

shubhagr-qc commented Jul 7, 2025

Uh oh!

Uh oh!