Support AUC on CV for classification problem #477

oooo26 · 2023-01-29T12:58:55Z

Use (negative) AUC as CV score for LogisticRegression and MultinomialRegression;
Both OvO and OvR Algorithm are implemented for multinomial case;
Speed and performance have been tested.

codecov · 2023-01-29T13:01:47Z

Codecov Report

Base: 96.27% // Head: 97.72% // Increases project coverage by +1.45% 🎉

Coverage data is based on head (c903187) compared to base (994fe73).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #477      +/-   ##
==========================================
+ Coverage   96.27%   97.72%   +1.45%     
==========================================
  Files          27        7      -20     
  Lines        2977      968    -2009     
==========================================
- Hits         2866      946    -1920     
+ Misses        111       22      -89

Flag	Coverage Δ
Python	`97.72% <100.00%> (+0.14%)`	⬆️
rpackage	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
python/abess/linear.py	`99.11% <ø> (ø)`
python/abess/pca.py	`100.00% <ø> (ø)`
python/abess/utilities.py	`91.11% <ø> (ø)`
python/abess/bess_base.py	`98.87% <100.00%> (+0.05%)`	⬆️
python/abess/decomposition.py	`94.56% <100.00%> (+0.51%)`	⬆️
R-package/R/abesspca.R
R-package/R/utility.R
R-package/R/print.abessrpca.R
R-package/R/deviance.abess.R
R-package/R/initialization.R
... and 15 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

oooo26 · 2023-01-29T13:04:11Z

Besides, I think ic_type can be renamed as eval_type because this argument is not only for IC now.
And shall we add an argument to specify OvO/OvR for MultinomialRegression?

Mamba413 · 2023-01-29T14:23:02Z

@oooo26 ，shall we change ic_type to eval_type?

Mamba413 · 2023-01-29T14:33:35Z

@oooo26 ，shall we change ic_type to eval_type?

may be other. you can search scikit-learn for reference

oooo26 · 2023-01-29T15:03:10Z

may be other. you can search scikit-learn for reference

Oh Yes. They have a scoring argument in LogisticRegressionCV to control it, but not in the non-CV one. They don't seem to offer different IC type for classification too, only for Lasso.

Shall we add a cv_score or something? (Only scoring might be misunderstood since our regressor combines CV and non-CV problem.)

oooo26 · 2023-01-29T15:09:24Z

And I think ic_type/cv_score can share the same API in Cpp (that's what I have done so far), since only one of the criteria will be used.

Mamba413 · 2023-01-29T15:31:56Z

two excellent comments. just do it.

Mamba413 · 2023-01-29T15:34:54Z

@EQUIWDH you can inspect cpp code and program for cindex similarly.

oooo26 · 2023-01-29T15:55:35Z

two excellent comments. just do it.

OK! I will update soon.

And @EQUIWDH, if you would like to add other loss like c-index for CV, please see the test_loss() function in Metric.h. (Other files changed are just trivia.)

oooo26 · 2023-01-30T06:12:30Z

Add cv_score:

cv_score='test_loss': default;
cv_score='roc_auc': additional for logistic;
cv_score='roc_auc_ovo' or 'roc_auc_ovr': additional for multinomial;

Since AUC score seems to perform better on Logistic (both speed and accuracy), maybe we can set LogisticRegression(cv_score='roc_auc') as default?

oooo26 · 2023-01-30T07:06:55Z

Oh there is a small conflict on R: when using CV, the ic_type should be set as 0 so that it can use test loss as before.

But currently, it is 1. We may need to change it in R-package/R/utility.R. (Otherwise, there will be some annoying warnings...)

Mamba413 · 2023-01-31T01:54:38Z

I skim the R code in R-package now, and I think @bbayukari can quickly address this remaining warning in abesspca (see https://github.com/abess-team/abess/actions/runs/4044489654/jobs/6954695218).

oooo26 · 2023-01-31T02:31:11Z

I skim the R code in R-package now, and I think @bbayukari can quickly address this remaining warning in abesspca (see https://github.com/abess-team/abess/actions/runs/4044489654/jobs/6954695218).

Yep, just setting ic_type=0 under CV will fix it. Because ic_type=1 will represent AUC score for logistic now.

Mamba413 · 2023-01-31T06:33:35Z

I skim the R code in R-package now, and I think @bbayukari can quickly address this remaining warning in abesspca (see https://github.com/abess-team/abess/actions/runs/4044489654/jobs/6954695218).

Yep, just setting ic_type=0 under CV will fix it. Because ic_type=1 will represent AUC score for logistic now.

Seems everything is fine now! You can merge it if you also believe this is fine.

oooo26 and others added 3 commits January 28, 2023 21:23

Support AUC for LogisticRegression

2ec6699

Support AUC for Multinomial

0e8e7e9

Merge branch 'abess-team:master' into master

4d1a861

oooo26 and others added 2 commits January 29, 2023 21:08

Formatting

94a53ae

Revert API on api.cpp/.h

6e22ccb

Mamba413 linked an issue Jan 29, 2023 that may be closed by this pull request

Cox model: c-index #466

Open

Mamba413 added the enhancement New feature or request label Jan 29, 2023

oooo26 and others added 2 commits January 29, 2023 23:55

Merge branch 'abess-team:master' into master

06644f7

Add cv_score argument

2f9ac8a

oooo26 added 2 commits January 30, 2023 14:22

Bug fix

90c83be

Add warning msg

645d5bf

Set ic_type=0 while using CV in R

8663d35

Avoid duplicate warnings

b8fa2d0

oooo26 force-pushed the master branch from bc79125 to b8fa2d0 Compare January 31, 2023 02:57

Set ic_type=0 while using CV in R

c903187

oooo26 merged commit 7d2bd24 into abess-team:master Jan 31, 2023

Mamba413 mentioned this pull request Feb 10, 2023

All types of tune go wrong in PCA model #484

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support AUC on CV for classification problem #477

Support AUC on CV for classification problem #477

oooo26 commented Jan 29, 2023 •

edited

Loading

codecov bot commented Jan 29, 2023 •

edited

Loading

oooo26 commented Jan 29, 2023

Mamba413 commented Jan 29, 2023

Mamba413 commented Jan 29, 2023

oooo26 commented Jan 29, 2023

oooo26 commented Jan 29, 2023 •

edited

Loading

Mamba413 commented Jan 29, 2023

Mamba413 commented Jan 29, 2023 •

edited

Loading

oooo26 commented Jan 29, 2023 •

edited

Loading

oooo26 commented Jan 30, 2023 •

edited

Loading

oooo26 commented Jan 30, 2023 •

edited

Loading

Mamba413 commented Jan 31, 2023

oooo26 commented Jan 31, 2023 •

edited

Loading

Mamba413 commented Jan 31, 2023

Support AUC on CV for classification problem #477

Support AUC on CV for classification problem #477

Conversation

oooo26 commented Jan 29, 2023 • edited Loading

codecov bot commented Jan 29, 2023 • edited Loading

Codecov Report

oooo26 commented Jan 29, 2023

Mamba413 commented Jan 29, 2023

Mamba413 commented Jan 29, 2023

oooo26 commented Jan 29, 2023

oooo26 commented Jan 29, 2023 • edited Loading

Mamba413 commented Jan 29, 2023

Mamba413 commented Jan 29, 2023 • edited Loading

oooo26 commented Jan 29, 2023 • edited Loading

oooo26 commented Jan 30, 2023 • edited Loading

oooo26 commented Jan 30, 2023 • edited Loading

Mamba413 commented Jan 31, 2023

oooo26 commented Jan 31, 2023 • edited Loading

Mamba413 commented Jan 31, 2023

oooo26 commented Jan 29, 2023 •

edited

Loading

codecov bot commented Jan 29, 2023 •

edited

Loading

oooo26 commented Jan 29, 2023 •

edited

Loading

Mamba413 commented Jan 29, 2023 •

edited

Loading

oooo26 commented Jan 29, 2023 •

edited

Loading

oooo26 commented Jan 30, 2023 •

edited

Loading

oooo26 commented Jan 30, 2023 •

edited

Loading

oooo26 commented Jan 31, 2023 •

edited

Loading