RFC: PTE Size Inspector Design

### 🚀 The feature, motivation and pitch

### Problem
Currently, users who export models to ExecuTorch have no tool to inspect what contributes to the size of the resulting .pte file. This is a concern because the file must fit within the available memory on devices, which are often very limited.

### Goal
Users can understand what contributes to the overall size of a PTE size using a commandline tool or python script/notebook.

### RFC 

# Design


## Overview

Have 3 entry points to inspect .pte file size:



* Before having the .pte:
    * **Entry Point 1:** Python util function <code>size_distribution(exec_prog)</code>. Use at the end of the export script. 
* When already have the .pte:
    * <strong>Entry Point 2: </strong>Command Line tool, <code>pteinspect</code>, to Inspect the .pte file. Use in a terminal.
    * <strong>Entry Point 3: </strong>Python util function <code>size_distribution_from_pte(pte_file)</code>. Use in a python script or in a python notebook. 

In order to get detailed sizing information in the **delegate blobs**, allow delegates to implement hooks to decrypt the delegate blob. It’s optional for the delegate authors to implement. See the comments below for more discussion on this. 


## Details


### Class <code>SizeDistribution</code> and <code>size_distribution, size_distribution_from_pte</code> util functions


<code>SizeDistribution</code> is a recursive data class designed to hold size distribution information. It also comes with <code>SizeDistribution.to_dataframe()</code> to get size distribution details in the format of a pandas dataframe.

<code>size_distribution</code> are convenient util functions to get a <code>SizeDistribution</code> instance from an <code>ExecutorchProgramManager</code> instance or a .pte file. 


#### User Interface

```python
# User code: in export.py or in a notebook, call size_distribution() after to_executorch() is called in the export process
from executorch.devtools.ptetools import size_distribution

...
exec_prog = edge_program.to_executorch()
size_dists = size_distribution(exec_prog)
print(size_dists)
# If want to use dataframe
df = size_dists.to_dataframe()
```

```python
# User code: in a notebook or a python script file, call size_distribution() after user already has a .pte file
from executorch.devtools.ptetools import size_distribution_from_pte

file_path = "/path/to/your/.pte"
size_dists = size_distribution_from_pte(file_path)
# Rest is the same as the example usages above
```

Example output of <code>print(size_dists)</code>

```
Program Flatbuffer: 51.23 KB
Constant Tensors: 112 B
	conv_0_weight: 64 B
	conv_1_weight: 48 B
Delegate Blobs: 13.90 MB
	XnnpackBackend_0: 8.85 MB
	XnnpackBackend_1: 5.05 MB
```

Print in human-readable scale units:



* If size >= 1 GB, use GB
* If size >= 1 MB, use MB
* If size >= 1 KB, use KB
* If size &lt; 1 KB, use B

Example <code>df</code> when printed out


<table>
  <tr>
   <td><strong>Name</strong>
   </td>
   <td><strong>Size (bytes)</strong>
   </td>
   <td><strong>Level</strong>
   </td>
  </tr>
  <tr>
   <td>Total Size
   </td>
   <td>13951248
   </td>
   <td>0
   </td>
  </tr>
  <tr>
   <td>Program Flatbuffer
   </td>
   <td>51232
   </td>
   <td>1
   </td>
  </tr>
  <tr>
   <td>Constant Tensors
   </td>
   <td>112
   </td>
   <td>1
   </td>
  </tr>
  <tr>
   <td>conv_0_weight
   </td>
   <td>64
   </td>
   <td>2
   </td>
  </tr>
  <tr>
   <td>conv_1_weight
   </td>
   <td>48
   </td>
   <td>2
   </td>
  </tr>
  <tr>
   <td>Delegate Blobs
   </td>
   <td>13900016
   </td>
   <td>1
   </td>
  </tr>
  <tr>
   <td>XnnpackBackend_0
   </td>
   <td>8845856
   </td>
   <td>2
   </td>
  </tr>
  <tr>
   <td>XnnpackBackend_1
   </td>
   <td>5054160
   </td>
   <td>2
   </td>
  </tr>
</table>



#### Implementation 
```python
# New file: executorch/devtools/ptetools.py


class SizeDistribution:
	"""Sizes represented in a recursive structure"""
def __init__(
self, 
name: str, 
size: int, 
components:  Optional[List['Size_Distribution']] = None
):
       	self.name = name
       	self.size = size
       	self.components = components 

	def __str__(self, level=0):
       """String representation for displaying the hierarchy with indentation."""
       indent = "  " * level
       result = f"{indent}{self.name}: {self.size} bytes\n"
       for component in self.components:
           result += component.__str__(level + 1)
       return result

def to_dataframe(self):
	""" Format the class into dataframe """


def size_distribution(exec_prog: ExecutorchProgramManager) -> SizeDistribution:
	"""
	Args:
	    exec_program: ExecuTorch program
Returns: 
    Hierarchical size distribution of different components of exec_program 
	"""
	
def size_distribution_from_pte(file_path: str) -> SizeDistribution:
	"""
	Args:
	    File path of a .pte file
	Returns:
	    Hierarchical size distribution of different components of the .pte 
	"""
```

### Command Line tool, <code>pteinspect</code>

This is useful for users who don’t necessarily export the model themselves, and have a .pte and want to understand the size of it. Users can also call this from a bash script to do pte file analysis. 

Example user flow:
```
$ pteinspect [options] pte_file
```

where,
* [options]: Various command-line options that determine the output and level of detail displayed by pteinspect.
* pte_file: The PTE file to be inspected.


<table>
  <tr>
   <td><strong>Option</strong>
   </td>
   <td><strong>Description</strong>
   </td>
  </tr>
  <tr>
   <td>-l
   </td>
   <td>List the top level components of the .pte file and the size of each of them 
   </td>
  </tr>
  <tr>
   <td>-e <em>component_name</em>
   </td>
   <td>List all the components inside <em>component_name </em>and the size of each of them
   </td>
  </tr>
  <tr>
   <td>-h
   </td>
   <td>Display information in the headers
   </td>
  </tr>
  <tr>
   <td>--help
   </td>
   <td>Provides a help message listing all available options 
   </td>
  </tr>
</table>

## Alternatives Considered 

Considered having an *interactive commandline tool*, but decided to move away from it because *a commondline with arguments* is more scriptable, and the style also matches more with ELF tools, which is widely used in the industry. 

Also considered combining different pte tools (file inspection, file modification, etc.) into one tool. Decided to have separate tools for different features to match with ELF tools style, and also give users confidence that they wouldn’t accidentally modify the file when they only want to inspect it. 

## Release Plan

**Milestone 1 (1 week)**: Define Python class and write Python APIs

**Milestone 2 (1 week)**: Write the commandline tool


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RFC: PTE Size Inspector Design #7088

🚀 The feature, motivation and pitch

Problem

Goal

RFC

Design

Overview

Details

Class `SizeDistribution` and `size_distribution, size_distribution_from_pte` util functions

User Interface

Implementation

Command Line tool, `pteinspect`

Alternatives Considered

Release Plan

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Name	Size (bytes)	Level
Total Size	13951248	0
Program Flatbuffer	51232	1
Constant Tensors	112	1
conv_0_weight	64	2
conv_1_weight	48	2
Delegate Blobs	13900016	1
XnnpackBackend_0	8845856	2
XnnpackBackend_1	5054160	2

Option	Description
-l	List the top level components of the .pte file and the size of each of them
-e component_name	List all the components inside component_name and the size of each of them
-h	Display information in the headers
--help	Provides a help message listing all available options

RFC: PTE Size Inspector Design #7088

Description

🚀 The feature, motivation and pitch

Problem

Goal

RFC

Design

Overview

Details

Class SizeDistribution and size_distribution, size_distribution_from_pte util functions

User Interface

Implementation

Command Line tool, pteinspect

Alternatives Considered

Release Plan

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Class `SizeDistribution` and `size_distribution, size_distribution_from_pte` util functions

Command Line tool, `pteinspect`