[Android/XNNPACK] SIGSEGV in XNNWeightsCache::look_up_or_insert during memcmp on MediaTek Dimensity 6100+ (Galaxy M15)

<html><head></head><body><h1>🐛 Describe the Bug</h1>
<h2>1. System Information</h2>

Field | Detail
-- | --
ExecuTorch Version | 1.1.0
OS Platform | Android 14 (One UI 6)
Target Architecture | arm64-v8a
Crashing Hardware | Samsung Galaxy M15 5G (MediaTek Dimensity 6100+ / Mali-G57 MC2)
Working Control Hardware | Samsung Galaxy A16 4G (MediaTek Helio G99 / Mali-G57 MC2)
RAM Variants Tested | Issue occurs on 4 GB / 6 GB / 8 GB variants of the M15


<hr>
<h2>2. Expected Behavior</h2>
<p>When calling <code>module.forward()</code> on a background <code>HandlerThread</code>, the XNNPACK delegate should successfully parse the memory-mapped <code>.pte</code> file, repack the convolution weights into the <code>XNNWeightsCache</code>, and execute the inference graph. This exact behavior succeeds flawlessly over 100 consecutive times on the control device (Galaxy A16).</p>
<hr>
<h2>3. Actual Behavior &amp; Crash Log</h2>
<p>On the Galaxy M15, the application instantly terminates with a native <strong>Segmentation Fault (SIGSEGV)</strong> during the XNNPACK backend initialization phase. The crash specifically occurs deep within the C standard library's memory comparison function (<code>__memcmp_aarch64</code>) while XNNPACK is attempting to index the weights cache.</p>
<h3>Backtrace</h3>
<pre><code>pid: 0, tid: 13533 &gt;&gt;&gt; com.borzai.vu &lt;&lt;&lt;

backtrace:
#00 pc 0x00000000000a398c  /apex/com.android.runtime/lib64/bionic/libc.so (__memcmp_aarch64+12)
#01 pc 0x0000000000362fa8  /data/app/.../split_config.arm64_v8a.apk!libexecutorch_jni.so
     (executorch::backends::xnnpack::delegate::XNNWeightsCache::look_up_or_insert(
       executorch::backends::xnnpack::delegate::XNNWeightsCache*,
       xnn_weights_cache_look_up_key const*, void*, unsigned long)+92)
#02 pc 0x0000000000403dc8  /data/app/.../split_config.arm64_v8a.apk!libexecutorch_jni.so ...
#04 pc 0x00000000004019cc  /data/app/.../split_config.arm64_v8a.apk!libexecutorch_jni.so
     (create_convolution2d_nhwc_f32+700)
#05 pc 0x0000000000401b20  /data/app/.../split_config.arm64_v8a.apk!libexecutorch_jni.so
     (xnn_create_convolution2d_nhwc_f32+264)
     ...
#08 pc 0x00000000003603b0  /data/app/.../split_config.arm64_v8a.apk!libexecutorch_jni.so
     (executorch::backends::xnnpack::delegate::XNNCompiler::compileModel(
       void const*, unsigned long,
       executorch::backends::xnnpack::delegate::XNNExecutor*,
       executorch::backends::xnnpack::delegate::XNNWeightsCache*,
       xnn_workspace*, executorch::runtime::NamedDataMap const*)+1340)
#09 pc 0x0000000000362274  /data/app/.../split_config.arm64_v8a.apk!libexecutorch_jni.so
     (executorch::backends::XnnpackBackend::init(
       executorch::runtime::BackendInitContext&amp;,
       executorch::runtime::FreeableBuffer*,
       executorch::runtime::ArrayRef&lt;executorch::runtime::CompileSpec&gt;) const+176)
     ...
#14 pc 0x0000000000393c50  /data/app/.../split_config.arm64_v8a.apk!libexecutorch_jni.so
     (executorch::extension::module::Module::execute(...))
#20 pc 0x000000000044a0fa  /data/app/.../base.apk
     (org.pytorch.executorch.Module.execute+58)
#24 pc 0x0000000000d404da  /data/app/.../base.apk
     (com.borzai.vu.MainActivity.runInferenceOnThread+90)
</code></pre>
<hr>
<h2>4. Context and Architectural Analysis</h2>
<p>The underlying cause might be <strong>hardware/kernel-specific</strong> rather than a universal memory leak, shown by the successful execution on the structurally similar Helio G99 (A16). Both processors utilize the identical ARM Cortex-A76/A55 Big.LITTLE architecture and share the 4 GB RAM baseline.</p>
<p>Given that <code>memcmp</code> triggers the fault, a <strong>null pointer or misaligned memory read</strong> is being passed into <code>look_up_or_insert</code>.</p></body></html>

cc @GregoryComer @digantdesai @cbilgin @kirklandsign

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Android/XNNPACK] SIGSEGV in XNNWeightsCache::look_up_or_insert during memcmp on MediaTek Dimensity 6100+ (Galaxy M15) #17669

🐛 Describe the Bug

1. System Information

2. Expected Behavior

3. Actual Behavior & Crash Log

Backtrace

4. Context and Architectural Analysis

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Field	Detail
ExecuTorch Version	1.1.0
OS Platform	Android 14 (One UI 6)
Target Architecture	arm64-v8a
Crashing Hardware	Samsung Galaxy M15 5G (MediaTek Dimensity 6100+ / Mali-G57 MC2)
Working Control Hardware	Samsung Galaxy A16 4G (MediaTek Helio G99 / Mali-G57 MC2)
RAM Variants Tested	Issue occurs on 4 GB / 6 GB / 8 GB variants of the M15

[Android/XNNPACK] SIGSEGV in XNNWeightsCache::look_up_or_insert during memcmp on MediaTek Dimensity 6100+ (Galaxy M15) #17669

Description

🐛 Describe the Bug

1. System Information

2. Expected Behavior

3. Actual Behavior & Crash Log

Backtrace

4. Context and Architectural Analysis

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions