[MRG] Bures-Wasserstein Gradient Descent for Bures-Wasserstein Barycenters #680

clbonet · 2024-10-19T16:41:36Z

Types of changes

This PR aims to add the Bures-Wasserstein gradient descent solver to compute Bures-Wasserstein barycenters (see e.g. Gradient descent algorithms for Bures-Wasserstein barycenters or Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent).

Restructured ot.gaussian.bures_wasserstein_barycenter to allow to use different methods
Added the previous fixed-point algorithm in ot.gaussian.bures_barycenter_fixpoint
Added the Bures-Wasserstein gradient descent in ot.gaussian.bures_barycenter_gradient_descent
Added an iteration over the methods in the test test_bures_wasserstein_barycenter
Added a test test_fixedpoint_vs_gradientdescent_bures_wasserstein_barycenter
Added batch version of ot.gaussian.bures_wasserstein_distance
Trace can be computed for batchs of matrices. The choices of the implementation of the trace are made following the runtimes reported by the following codes (on a CPU):

import torch
import jax
import numpy as np
import tensorflow as tf
import jax.numpy as jnp

A = np.random.rand(1000, 100, 100)

%timeit np.einsum("...ii", A) # 109 μs ± 1.71 μs per loop
%timeit np.trace(A, axis1=-2, axis2=-1) # 116 μs ± 1.79 μs
%timeit A.diagonal(axis1=-2, axis2=-1).sum(-1) # 114 μs ± 2.87 μs per loop

A = torch.rand(1000, 100, 100)

%timeit torch.einsum("...ii", A)  # 3.17 ms ± 1.1 ms per loop 
%timeit A.diagonal(dim1=-2, dim2=-1).sum(-1) # 3.1 ms ± 879 μs per loop

A = tf.random.uniform((1000, 100, 100))

@tf.function
def trace_sum(A):
    return tf.einsum("...ii", A)

@tf.function
def trace_sum_v2(A):
    return tf.reduce_sum(tf.linalg.diag_part(A), axis=-1)

# Warm-up execution
trace_sum(A)  
trace_sum_v2(A)

# Benchmarking
%timeit trace_sum(A) # 486 μs ± 21.1 μs per loop
%timeit trace_sum_v2(A) # 430 μs ± 36.1 μs per loop
%timeit tf.linalg.trace(A) # 404 μs ± 18.2 μs per loop

# For jax, the results might look different using jit
A = jnp.ones((1000, 100, 100)) 

%timeit jnp.einsum("...ii", A) # 13.6 ms ± 324 μs per loop
%timeit jax.vmap(jnp.trace)(A) # 12.1 ms ± 457 μs per loop
%timeit A.diagonal(axis1=-2, axis2=-1).sum(-1) # 1.64 ms ± 3.62 ms per loop

Motivation and context / Related issue

The Bures-Wasserstein gradient descent comes with convergence guarantees to solve Bures-Wasserstein barycenters. Moreover, it can also be used in a stochastic way when there are too much Gaussian. Thus, it is a good alternative to the fixed-point algorithm currently implemented.

How has this been tested (if it applies)

I added a test test_fixedpoint_vs_gradientdescent_bures_wasserstein_barycenter to assess both methods returns the same barycenter. I also added the itertools to test_bures_wasserstein_barycenter.

PR checklist

I have read the CONTRIBUTING document.
The documentation is up-to-date with the changes I made (check build artifacts).
All tests passed, and additional code has been covered with new tests.
I have added the PR and Issue fix to the RELEASES.md file.

rflamary

Small comments. I will let @antoinecollas do a proper review he is the expert in Riemannian optimization

ot/utils.py

ot/gaussian.py

codecov · 2024-10-31T21:51:45Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.13%. Comparing base (79eb337) to head (1444648).
Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #680      +/-   ##
==========================================
+ Coverage   97.10%   97.13%   +0.03%     
==========================================
  Files         100      100              
  Lines       20115    20369     +254     
==========================================
+ Hits        19532    19786     +254     
  Misses        583      583

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

rflamary

This is great. A few tests especialy about errors are missing

ot/gaussian.py

rflamary

Just a few questions and then we can merge

rflamary · 2025-03-05T14:34:21Z

RELEASES.md

@@ -8,6 +8,10 @@
 - Automatic PR labeling and release file update check (PR #704)
 - Reorganize sub-module `ot/lp/__init__.py` into separate files (PR #714)
 - Fix documentation in the module `ot.gaussian` (PR #718)
+- Refactored `ot.bregman._convolutional` to improve readability (PR #709)


I dont' see that in the PR

Mmh, I think I did a mistake when merging with the master at some point. (It was deleted from Line 46 of the Releases.md, and it seemed to be in the wrong releases of POT)

rflamary · 2025-03-05T14:36:06Z

ot/backend.py

@@ -1363,7 +1363,8 @@ def solve(self, a, b):
        return np.linalg.solve(a, b)

    def trace(self, a):
-        return np.trace(a)
+        return np.einsum("...ii", a)


is that faster or slower? we need an idea

rflamary · 2025-03-05T14:39:30Z

ot/gaussian.py

    Returns
    -------
-    W : float
+    W : float if ms and md of shape (d,), array-like (n,) if ms of shape (n,d), mt  of shape (d,), array-like (m,) if ms of shape (d,) and mt of shape (m,d), array-like (n,m) if ms of shape (n,d) and mt of shape (m,d)


too complicated API, do float if d, and for the rest use a parameter that return paireed or cross distances

clbonet added 4 commits October 17, 2024 21:34

bw barycenter with batched sqrtm

7eb14d2

BWGD for barycenters

869955c

sbwgd for barycenters

be985d1

Test fixed_point vs gradient_descent

9a43369

rflamary reviewed Oct 21, 2024

View reviewed changes

ot/utils.py Outdated Show resolved Hide resolved

ot/gaussian.py Outdated Show resolved Hide resolved

cedricvincentcuaz and others added 6 commits October 23, 2024 23:23

Merge branch 'master' into bwgd_barycenter

016704b

fix test bwgd

8afc00b

nx exp_bures

b2b0bca

update doc

d287a2a

Merge branch 'master' into bwgd_barycenter

9377405

fix merge

4f648bb

doc exp bw

b821ee8

rflamary reviewed Nov 5, 2024

View reviewed changes

ot/gaussian.py Show resolved Hide resolved

ot/gaussian.py Show resolved Hide resolved

ot/gaussian.py Show resolved Hide resolved

ot/gaussian.py Outdated Show resolved Hide resolved

ot/gaussian.py Show resolved Hide resolved

clbonet and others added 12 commits November 5, 2024 22:29

First tests stochastic + exp

d22028b

exp_bures with einsum

dffa0cf

type Id test

f3e911a

up test stochastic

97f2261

test weights

7594393

Add BW distance with batchs

6c48b3c

step size SGD BW Barycenter

ba806ff

Merge branch 'master' into bwgd_barycenter

7ab365a

Merge branch 'master' into bwgd_barycenter

447a1a6

batchable BW distance

d4045f1

Merge branch 'master' into bwgd_barycenter

f669a8e

Merge branch 'master' into bwgd_barycenter

6c0a2a0

github-actions bot added Tests ot.utils ot.backend ot.gaussian labels Dec 17, 2024

cedricvincentcuaz and others added 9 commits January 13, 2025 16:29

Merge branch 'master' into bwgd_barycenter

5da317f

Merge branch 'master' into bwgd_barycenter

2b317e2

RELEASES.md

50994ed

precommit

bad385f

Add ot.gaussian.bures

0b20759

Add arg backend

fe3d9db

up stop criteria sgd Gaussian barycenter

506a524

Fix release

c640ecb

fix doc

41ebffc

clbonet changed the title ~~[WIP] Bures-Wasserstein Gradient Descent for Bures-Wasserstein Barycenters~~ [MRG] Bures-Wasserstein Gradient Descent for Bures-Wasserstein Barycenters Mar 4, 2025

rflamary reviewed Mar 5, 2025

View reviewed changes

rflamary and others added 5 commits March 11, 2025 09:28

Merge branch 'master' into bwgd_barycenter

3a7af81

change API bw

3a7effc

up test bures_wasserstein_distance

f41b093

up test bures_wasserstein_distance

0a9d499

up test bures_wasserstein_distance

1444648

rflamary approved these changes Mar 12, 2025

View reviewed changes

rflamary merged commit d25770c into PythonOT:master Mar 12, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Bures-Wasserstein Gradient Descent for Bures-Wasserstein Barycenters #680

[MRG] Bures-Wasserstein Gradient Descent for Bures-Wasserstein Barycenters #680

clbonet commented Oct 19, 2024 •

edited

Loading

rflamary left a comment

codecov bot commented Oct 31, 2024 •

edited

Loading

rflamary left a comment

rflamary left a comment

rflamary Mar 5, 2025

clbonet Mar 5, 2025

rflamary Mar 5, 2025

rflamary Mar 5, 2025

[MRG] Bures-Wasserstein Gradient Descent for Bures-Wasserstein Barycenters #680

[MRG] Bures-Wasserstein Gradient Descent for Bures-Wasserstein Barycenters #680

Conversation

clbonet commented Oct 19, 2024 • edited Loading

Types of changes

Motivation and context / Related issue

How has this been tested (if it applies)

PR checklist

rflamary left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 31, 2024 • edited Loading

Codecov Report

rflamary left a comment

Choose a reason for hiding this comment

rflamary left a comment

Choose a reason for hiding this comment

rflamary Mar 5, 2025

Choose a reason for hiding this comment

clbonet Mar 5, 2025

Choose a reason for hiding this comment

rflamary Mar 5, 2025

Choose a reason for hiding this comment

rflamary Mar 5, 2025

Choose a reason for hiding this comment

clbonet commented Oct 19, 2024 •

edited

Loading

codecov bot commented Oct 31, 2024 •

edited

Loading