Skip to content

Commit 04f3f4d

Browse files
feat(EstimatorReport): Display the feature permutation importance and mean decrease impurity (probabl-ai#1365)
Closes probabl-ai#1319 and probabl-ai#1323 Todo: - ~[ ] Add example~ deferred to probabl-ai#1390 - [x] Coverage at 100% - [x] Add API docs - [x] Check for `# TODO` in code - [x] Check what happens if `scoring` is a callable (is it cached?) - scoring was not included in the cache key at all! Fixed - [x] Check what happens if `random_state` is a `RandomState` instance (is it cached?) - If `random_state` is a RandomState then the call should not be cached, because reusing a `RandomState` gives a different result. --------- Co-authored-by: Sylvain Combettes <[email protected]>
1 parent bc1f56d commit 04f3f4d

16 files changed

+1515
-79
lines changed

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@ skore is a Python open-source library designed to help data scientists apply rec
3232
- **Diagnose**: catch methodological errors before they impact your models.
3333
- `train_test_split` supercharged with methodological guidance: the API is the same as scikit-learn's, but skore displays warnings when applicable. For example, it warns you against shuffling time series data or when you have class imbalance.
3434
- **Evaluate**: automated insightful reports.
35-
- `EstimatorReport`: feed your scikit-learn compatible estimator and dataset, and it generates recommended metrics and plots to help you analyze your estimator. All these are computed and generated for you in 1 line of code. Under the hood, we use efficient caching to make the computations blazing fast.
35+
- `EstimatorReport`: feed your scikit-learn compatible estimator and dataset, and it generates recommended metrics, feature importance, and plots to help you evaluate and inspect your estimator. All these are computed and generated for you in 1 line of code. Under the hood, we use efficient caching to make the computations blazing fast.
3636
- `CrossValidationReport`: get a skore estimator report for each fold of your cross-validation.
3737
- `ComparisonReport`: benchmark your skore estimator reports.
3838

examples/getting_started/plot_skore_getting_started.py

+11-2
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,8 @@
1212
# #. Get assistance when developing your ML/DS projects to avoid common pitfalls
1313
# and follow recommended practices.
1414
#
15-
# * :class:`skore.EstimatorReport`: get an insightful report on your estimator
15+
# * :class:`skore.EstimatorReport`: get an insightful report on your estimator,
16+
# for evaluation and inspection
1617
#
1718
# * :class:`skore.CrossValidationReport`: get an insightful report on your
1819
# cross-validation results
@@ -82,11 +83,19 @@
8283
roc_plot.plot()
8384
plt.tight_layout()
8485

86+
# %%
87+
# Furthermore, we can inspect the model using the permutation feature importance:
88+
89+
# %%
90+
log_reg_report.feature_importance.feature_permutation().T.boxplot(vert=False)
91+
plt.tight_layout()
92+
8593
# %%
8694
# .. seealso::
8795
#
8896
# For more information about the motivation and usage of
89-
# :class:`skore.EstimatorReport`, see :ref:`example_estimator_report`.
97+
# :class:`skore.EstimatorReport`, see :ref:`example_estimator_report` for evaluation
98+
# and :ref:`example_feature_importance` for inspection.
9099

91100

92101
# %%

0 commit comments

Comments
 (0)