ENH: add ability to handle multi-dimensional thresholds #788

allglc · 2025-11-04T13:34:47Z

Description

Add the ability to handle multi-dimensional thresholds (lambdas)

'predict_function` can now also be a general function (X *params) -> 0/1
predict_params is now an argument (even when only one-dimensional lambda) and docstring should be clearer
best_predict_param is a tuple for multi-dimensional parameters
added an automatic flag is_multi_dimensional_param (based on predict_params dimension)
_get_predictions_per_param handles general predict_functions, but will process all parameter values sequentially (i don't know how to do it easily in parallel).
get_predictions_per_param will check the prediction values in the calibration step (using a new argument is_calibration_step, because I don't want to check at test time as it is not necessary and it might happen that when predicting a single probability, which can happen at test time, a value of 0 or 1 and would raise a warning)
predictions are checked so that for one-dimensional parameters, predictions should not be 0 or 1 all the time, and for multi-dimensional parameters, they should be 0 or 1 all the time.
I don't think it's necessary to add a custom error message when the dimension of the parameters do not match the inputs of the general predict function, as the default message seems explicit (cf test_error_multi_dim_params_dim_mismatch)

To manage the two types of predict functions in `init` there are a few options:

Manual flag in the arguments of BinaryClassificationController
auto detection based on the function signature (does it takes one or more arguments) but it can cause issues when the predict_proba can also take several arguments (e.g. XGBoost)
add another argument e.g., predict_function_general and the user has to provide at least this or the original predict_function.
Automatic check of predict_params.shape[1] > 1: the user has to provide a custom array of predict_params to do multi params

I will go with option 4.

codecov-commenter · 2025-11-05T15:56:32Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 100.00%. Comparing base (5ee9406) to head (72d7a5c).
⚠️ Report is 16 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##            master      #788    +/-   ##
==========================================
  Coverage   100.00%   100.00%            
==========================================
  Files           56        56            
  Lines         6325      6547   +222     
  Branches       360       378    +18     
==========================================
+ Hits          6325      6547   +222

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

GBrelurut · 2025-11-05T17:42:21Z

mapie/risk_control/binary_classification.py

        "fpr" for false positive rate.
        - A custom instance of BinaryClassificationRisk object

+    predict_params : NDArray, default=np.linspace(0, 0.99, 100)


2 questions :

do we want the argument being called predict_params ? couldn't it be something like 'list-thresholds ?'

can we imagine a case where the argument is a function/generator for optimal exploration ?

I agree that the name (following the existing _predict_params name) is confusing. I think params is the good term in the general setting (also called parameters in LTT paper), it's only in the one-dimensional case that it is defined as a threshold. We have to decide before merging as it's a user facing argument that we cannot change later. Maybe externally we can define the arguments list_multi_dimensional_parameters for multi-dimensional case and list_thresholds for the one-dimensional case?

I think if the user has a function/generator, they can just format its output and give it in predict_params

I think we should go with list_params, specifying in the docstring it is form multidimensional cases. We probably want to keep the name short for the users.

GBrelurut · 2025-11-06T11:00:35Z

mapie/risk_control/binary_classification.py

+            y_pred = y_pred.astype(int)
+        else:
+            try:
+                predictions_proba = self._predict_function(X)[:, 1]


Y-a-t-il une raison pour ne pas mettre la fonction de prédiction en multiparam dans le try ?

Oui car c'est pas le même appel de fonction et la deuxième erreur n'est pas adaptée (on peut peut-être adapter pour quand même tester la première erreur en multi-dim)

allglc added 9 commits November 4, 2025 14:28

predict_params as argument

be70d05

add docstring and _check_if_multi_dimensional_param

0b58630

_get_predictions_per_param handles multidim parameters

46c7f90

fixes and function to check predictions

2d379a5

tests for multi-dim params prediction and multi-dim check

df8ed1f

self.best_predict_param can be a tuple

b9fca92

add functional tests

5ba73cc

fix typing issue

93fe673

fix typing old mypy

72d7a5c

minor fixes

05cddaf

allglc marked this pull request as ready for review November 5, 2025 16:04

add warning when all params are valid

5c398df

GBrelurut reviewed Nov 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH: add ability to handle multi-dimensional thresholds #788

ENH: add ability to handle multi-dimensional thresholds #788

Uh oh!

allglc commented Nov 4, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Nov 5, 2025

Uh oh!

GBrelurut Nov 5, 2025

Uh oh!

allglc Nov 7, 2025

Uh oh!

GBrelurut Nov 10, 2025

Uh oh!

GBrelurut Nov 6, 2025

Uh oh!

allglc Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ENH: add ability to handle multi-dimensional thresholds #788

Are you sure you want to change the base?

ENH: add ability to handle multi-dimensional thresholds #788

Uh oh!

Conversation

allglc commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

To manage the two types of predict functions in __init__ there are a few options:

Uh oh!

codecov-commenter commented Nov 5, 2025

Codecov Report

Uh oh!

GBrelurut Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

allglc Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

GBrelurut Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

GBrelurut Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

allglc Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

allglc commented Nov 4, 2025 •

edited

Loading

To manage the two types of predict functions in `init` there are a few options: