Implemented multi-fidelity bayesian search for the auto-tuner #1019

FranciscoThiesen · 2025-10-25T03:41:58Z

No description provided.

meta-cla · 2025-10-25T03:42:05Z

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

oulgen · 2025-10-25T06:03:04Z

kicked off the CI, but you'll probably need to update the requirements file

FranciscoThiesen · 2025-10-30T17:44:22Z

@oulgen do you mind taking a look whenever time permits?

oulgen · 2025-10-31T03:28:42Z

Hey @FranciscoThiesen thank you for implementing a new autotuning algorithm. Could you share some results? Perhaps you can compare to PatternSearch in terms of

Convergence time
Best perf found

also please make sure the tests and the lint are passing

oulgen · 2025-10-31T04:08:51Z

also please update your new unit test file to be the same style as rest of the test using a class

FranciscoThiesen · 2025-10-31T04:31:31Z

@oulgen do you have any available GPUs that could be used for convergence analysis + best perf comparison? I definitely agree that having this is a must to assess how good is MFBO versus the current hill-climbing approach.

I think this will really shine in terms of reducing the total time/resources that the auto-tuner takes, while still finding good solutions.

I see that the CI runs a few of the tests on GPUs and unfortunately I don't have a personal one that I can use for my OS contributions.

oulgen · 2025-10-31T04:49:02Z

@oulgen do you have any available GPUs that could be used for convergence analysis + best perf comparison? I definitely agree that having this is a must to assess how good is MFBO versus the current hill-climbing approach.

I think this will really shine in terms of reducing the total time/resources that the auto-tuner takes, while still finding good solutions.

I see that the CI runs a few of the tests on GPUs and unfortunately I don't have a personal one that I can use for my OS contributions.

I can give it a try, but no promises on when.

FranciscoThiesen · 2025-11-03T20:00:13Z

@oulgen got GPUs here for running the convergence analysis. Will run it whenever time permits and then share the results.

jansel

How well does this work? Can you share some data comparing this to some other search methods in terms of best perf over time tuning?

jansel · 2025-11-04T04:45:11Z

helion/autotuner/acquisition.py

This looks algorithm specific, let's move all the related files to a subfolder.

jansel · 2025-11-04T04:45:55Z

helion/autotuner/acquisition.py

+
+from typing import TYPE_CHECKING
+
+import numpy as np


Let's use torch to avoid adding a numpy dependency.

jansel · 2025-11-04T04:48:26Z

helion/autotuner/base_search.py

        Args:
            config: The configuration to benchmark.
            fn: A precompiled version of config.
+            fidelity: Number of repetitions for benchmarking (default: 50).


Let's rename to the repeat or samples.

jansel · 2025-11-04T04:50:43Z