DOC Rework plot_roc.py example #24200

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

glemaitre merged 27 commits into scikit-learn:main from ArturoAmorQ:plot_roc

Oct 13, 2022

Member

ArturoAmorQ commented Aug 18, 2022 •

edited

Loading

Reference Issues/PRs

Fixes #13546.
Fixes #24288.
Takes #24192 into account.

What does this implement/fix? Explain your changes.

As discussed in #13546, the current state of the roc_plot.py example gives different macro-averaged AUC than roc_auc_score because they use different averaging strategies. This PR aims to clarifying the different averaging strategies by implementing a "tutorialization".

Any other comments?

Side effect: Implements notebook style as intended in #22406.

ArturoAmorQ added 5 commits

July 20, 2022 14:13


          First step to improve notebook style

c66606e


          Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

579bcc7

…nto plot_roc


          Checkpoint

f98650d


          Checkpoint

0d05524


          First official version

dda6d55

github-actions bot added the Documentation label


          Fix conflicts

1c90a60

cmarmo reviewed

View reviewed changes

Contributor

cmarmo left a comment

Hi @ArturoAmorQ, thanks for your work.
I've made some comments, mainly about the format.

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

ogrisel reviewed

View reviewed changes

Member

ogrisel left a comment

Here is a partial review. More to come in a few days hopefully.

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

ArturoAmorQ and others added 6 commits

August 26, 2022 16:45


          Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

d816837

…nto plot_roc


          Apply suggestions from code review

fca94f6

Co-authored-by: Olivier Grisel <[email protected]>
Co-authored-by: Chiara Marmo <[email protected]>


          Merge branch 'plot_roc' of github.com:ArturoAmorQ/scikit-learn into p…

d73e16f

…lot_roc


          Update references to roc_plot.py from User Guide

e6cd87f


          Apply suggestion from cmarmo

fedcdc0


          Add AUC to chance level labels

410fc62

ArturoAmorQ mentioned this pull request

Add plotting AUC plotting tools to "plot_roc" example #24288

Closed

ArturoAmorQ changed the title ~~DOC Rework roc_plot.py example~~ DOC Rework plot_roc.py example

ArturoAmorQ mentioned this pull request

FEA Add support for micro averaging in ovr-roc-auc #24338

Merged

glemaitre self-requested a review

September 13, 2022 08:51

glemaitre reviewed

View reviewed changes

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

glemaitre reviewed

View reviewed changes

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

glemaitre reviewed

View reviewed changes

Member

glemaitre left a comment

I think that we can state that the example is much better :)
I really like the narrative, I find it intuitive.

I put some nitpicking that could clarify the code.

examples/model_selection/plot_roc.py Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

ArturoAmorQ and others added 6 commits

September 14, 2022 14:02


          Apply suggestions from code review

f5912df

Co-authored-by: Guillaume Lemaitre <[email protected]>


          Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

be11856

…nto plot_roc


          Apply suggestions from code review

22c9b57

Co-authored-by: Guillaume Lemaitre <[email protected]>


          Improve narrative

50ef6dd


          Add demo on ravel

2b2b567


          Add conclusion on averaging strategies

e8b659b

glemaitre reviewed

View reviewed changes

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

glemaitre reviewed

View reviewed changes

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py

+              # Interpolate all ROC curves at these points
+              mean_tpr = np.zeros_like(fpr_grid)
+              for i in range(n_classes):

Member

glemaitre Sep 15, 2022

If you don't want to use sp.interpolate.interp1d then we should add a comment that interp is doing linear interpolation. Another downside of using interp is that values need to be ordered.

Member Author

ArturoAmorQ Sep 15, 2022

I am hesitating about what is the best, as we actually use np.interp to compute the tpr in the private function _binary_roc_auc_score. Also it would add boilerplate code below. Maybe the easiest is to indeed add a comment on linear interpolation.

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

ArturoAmorQ and others added 2 commits

September 15, 2022 11:42


          Apply suggestions from code review

7a49c08

Co-authored-by: Guillaume Lemaitre <[email protected]>


          Apply suggestions from code review

4f3bf65

Co-authored-by: Guillaume Lemaitre <[email protected]>

ogrisel approved these changes

View reviewed changes

Member

ogrisel left a comment

Thanks @ArturoAmorQ for refactoring and polishing this example. It really looks good. Just a few more suggestions below:

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc.py Outdated Show resolved Hide resolved

ArturoAmorQ and others added 2 commits

September 15, 2022 17:11


          Apply suggestions from code review

673d4d9

Co-authored-by: Olivier Grisel <[email protected]>


          Improve wording

f97913d

Contributor

haiatn commented Oct 4, 2022

What is missing so this could be merged?

Member Author

ArturoAmorQ commented Oct 5, 2022

What is missing so this could be merged?

I still have to tweak the plot legends to reduce their overlapping with the figure, I just haven't had the time. But (hopefully) this will be ready to merge soon ;) thanks for your interest, @haiatn!

ArturoAmorQ added 4 commits

October 6, 2022 16:10


          Iter wording

7408fd4


          Avoid overlapping legends on plots

467a658


          Format tweaks

9d00ff2


          Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

129d9ed

…nto plot_roc

ArturoAmorQ requested a review from glemaitre

October 6, 2022 14:36


          Remove unnecessary lines of code

06c8877

ArturoAmorQ added the Waiting for Reviewer label

glemaitre merged commit a8f0858 into scikit-learn:main

Member

glemaitre commented Oct 13, 2022

Thanks @ArturoAmorQ LGTM.

ArturoAmorQ deleted the plot_roc branch

October 14, 2022 14:56

ArturoAmorQ mentioned this pull request

DOC Improve narrative of plot_roc_crossval example #24710

Merged

glemaitre added a commit to glemaitre/scikit-learn that referenced this pull request


          DOC Rework plot_roc.py example (scikit-learn#24200)

7b58086

Co-authored-by: Olivier Grisel <[email protected]>
Co-authored-by: Chiara Marmo <[email protected]>
Co-authored-by: Guillaume Lemaitre <[email protected]>

ArturoAmorQ mentioned this pull request

DOC Rework Detection Error Tradeoff example #24818

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Documentation Waiting for Reviewer