`hesse` 🐍

Introduction

The goal of this package is to simplify the computation of certain Hessian matrices.

In particular, suppose that we are interested in computing the Hessian matrix of model with respect to its parameters. The existing paradigm is to make a "functional version" of model's forward pass

def functional_forward(params):
    return torch.func.functional_call(model, params, inputs)

and then compute its Hessian

params = dict(model.named_parameters())
hessian = torch.func.hessian(functional_forward)(params)

The output hessian is a dictionary of dictionaries, such that hessian["P"]["Q"] is the Hessian matrix block correponding to named parameters P and Q. Extra work is required if we want to assemble the full Hessian matrix, if we want to modify this process to obtain a diagonal approximation of the full Hessian matrix, and so on.

This package aims to remove the extra work, by providing user-friendly wrappers for torch.func transforms and matrix assembly.

Installation

In an existing Python 3.9+ environment:

git clone https://github.com/dscamiss/hesse/
pip install ./hesse

Examples

Setup

Import packages.

import hesse
import torch

Create a toy multi-input, multi-output model.

class MimoModel(torch.nn.Module):
    """Multi-input, multi-output demo model."""

    def __init__(self, m: int) -> None:
        super().__init__()
        self.A = torch.nn.Parameter(torch.randn(m, m))
        self.B = torch.nn.Parameter(torch.randn(m, m))

    def forward(self, x, y):
        """
        Run forward pass.

        Args:
            x: First input tensor of shape (b, n).
            y: Second input tensor of shape (b, n).

        Returns:
            The matrix

            [ tr(A^t A) x_{    0, :}   tr(B^t B) y_{    0, :} ]
            [ tr(A^t A) x_{    1, :}   tr(B^t B) y_{    1, :} ]
            [           :                        :            ]
            [ tr(A^t A) x_{m - 1, :}   tr(B^t B) y_{m - 1, :} ].
        """
        row_1 = torch.trace(self.A.T @ self.A) * x
        row_2 = torch.trace(self.B.T @ self.B) * y
        return torch.hstack((row_1, row_2))

Make an instance of MimoModel and batch inputs.

model = MimoModel(2)

x = torch.Tensor(
    [
        [1.0, 2.0],
        [3.0, 4.0],
    ]
)
y = -1.0 * x

Model Hessians

Computing the Hessian matrix of model is easy:

hessian = hesse.model_hessian_matrix(model=model, inputs=(x, y))

Generally speaking, the shape of the Hessian matrix will be (batch_size, output_size, ...). In this instance, batch_size = 2 and output_size = 4, so that hessian has shape (2, 4, 8, 8).

To compute the Hessian matrix with respect to a subset of the model parameters, just provide a list of parameter names:

hessian = hesse.model_hessian_matrix(model=model, inputs=(x, y), params=["A"])

Loss function Hessians

Create a loss criterion and batch target output:

criterion = torch.nn.MSELoss()
target = torch.randn(2, 4)

Computing the Hessian matrix of the loss function criterion(model(inputs), target) is easy:

loss_hessian = hesse.loss_hessian_matrix(
    model=model,
    criterion=criterion,
    inputs=(x, y),
    target=target,
)

As above, we can compute the Hessian matrix with respect to a subset of the model parameters:

loss_hessian = hesse.loss_hessian_matrix(
    model=model,
    criterion=criterion,
    inputs=(x, y),
    target=target,
    params=["A"],
)

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
.github/workflows		.github/workflows
examples		examples
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`hesse` 🐍

Introduction

Installation

Examples

Setup

Model Hessians

Loss function Hessians

About

Uh oh!

Releases

Packages

Languages

License

dscamiss/hesse

Folders and files

Latest commit

History

Repository files navigation

hesse 🐍

Introduction

Installation

Examples

Setup

Model Hessians

Loss function Hessians

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`hesse` 🐍

Packages