make permutation importance test optional, functionality to use any outcome variables, optional hyperparameter input #11

zenalapp · 2019-11-11T18:37:56Z

Fixes #6
Fixes #7
Fixes #8
Fixes #10

BTopcuoglu · 2019-11-11T22:21:13Z

get_results(dataset, models, split_number, outcome="dx") errors out when outcome is defined by user.
pipeline(dataset, models, split_number, outcome="dx") works. So it must be an issue when the argument is being passed from get_results to pipeline function.

BTopcuoglu · 2019-11-12T16:18:40Z

Define outcome variable and permutation logical as arguments that can be passed in the command line. We can now run from command line with:

Rscript code/learning/main.R 1 "L2_Logistic_Regression" "dx" 0

address the last comment in SchlossLab#11

BTopcuoglu · 2019-11-12T16:39:16Z

Previous changes regarding setting outcome and perm on the command line work when the user defines the outcome (e.g. "dx") as an argument. However, if they leave that argument empty (passed as NA e.g. Rscript code/learning/main.R 1 "L2_Logistic_Regression" 0), the rest of the pipeline breaks (the first column does not get selected).

I changed the order of the arguments in get_aucs function and then instead of NULL, I uses NA:

get_results <- function(dataset, models, split_number, perm=T, outcome=NA, hyperparameters=NULL)

Changed pipeline function to have NA as well.

pipeline <- function(dataset, model, split_number, outcome=NA, hyperparameters=NULL, perm=T)

Edited outcome=NULL argument from tuning_grid function because outcome infor should be decided in the previous functions already:

tuning_grid <- function(train_data, model, outcome, hyperparameters=NULL)

BTopcuoglu · 2019-11-12T17:00:15Z

With these changes, it looks like we Fixed #6 and #7. Next step is checking if permutation works as we want in #8.

BTopcuoglu · 2019-11-12T17:26:41Z

permutation_importance function doesn't work.
Error:

Error in -sym(first_outcome) : invalid argument to unary operator
Calls: get_results ... <Anonymous> -> vars_select_eval -> map_if -> map -> .f

Caught the bug in line 87 and changed from first_outcome to outcome:

  non_correlated_otus <- full %>%
    select(-correlated_otus) %>%
    select(-sym(outcome)) %>%
    colnames()

zenalapp · 2019-11-12T17:38:13Z

Nooo I was worried this wouldn't work but I wasn't doing permutation importance so I didn't catch it. I can look into another option if you don't know of one

BTopcuoglu · 2019-11-12T17:44:17Z

Nooo I was worried this wouldn't work but I wasn't doing permutation importance so I didn't catch it. I can look into another option if you don't know of one

I'm running it now with the change I've made (it was passing first_outcome which is not a column that can be selected but now uses sym("dx") which should work theoretically, I'll keep you posted.

zenalapp · 2019-11-12T17:51:36Z

But don't we want to have dx not hard-coded?

BTopcuoglu · 2019-11-12T17:52:57Z

But don't we want to have dx not hard-coded?

No I know, I have it as:

  non_correlated_otus <- full %>%
    select(-correlated_otus) %>%
    select(-sym(outcome)) %>%
    colnames()

Instead of what it was before:

non_correlated_otus <- full %>%
    select(-correlated_otus) %>%
    select(-sym(first_outcome)) %>%
    colnames()

BTopcuoglu · 2019-11-12T20:33:56Z

Our previous attempt was unsuccessful - must be a bug with tidyverse. I made a new change:

  non_correlated_otus <- full %>%
    select(-correlated_otus)
  
  non_correlated_otus[,outcome] <- NULL
  
  non_correlated_otus <- non_correlated_otus %>%
    colnames()

Not the most beautiful code snippet I've written but it'll do:)

Fixes SchlossLab#8

BTopcuoglu · 2019-11-13T18:20:02Z

#8 We now made permutation importance optional but the data structure to run permutation is still hardcoded. We need to come back to that and fix it.

I'll now check if user-defined hyperparameters work #10 .

BTopcuoglu · 2019-11-13T19:04:44Z

The NULL options for hyperparameters are currently specific to my CRC classification problem (except random forest, where we implement Pat's code: mtry <- floor(seq(1, n_features, length=6)) ). So those need to be adjusted and expanded in the future, but overall, I'm able to set up user-defined hyperparameters as a list and it works. Fixed #10.

zenalapp added 3 commits November 11, 2019 13:13

make permutation test optional and fix hard-coding of outcome variable

3012f09

remove perm=T from function because not needed

f84804f

add option to specify hyperparameters to test for cross-validation

958d642

zenalapp changed the title ~~make permutation importance test optional & functionality to use any outcome variables~~ make permutation importance test optional, functionality to use any outcome variables, optional hyperparameter input Nov 11, 2019

include info about specifying hyperparameters

c9e4dba

fix the issue of passing outcome argument

b188ed2

add arguments to pass to outcome and permutation

2ffd593

address the last comment in SchlossLab#11

BTopcuoglu added 4 commits November 12, 2019 11:39

change from NULL to NA

8cc92b2

outcome changed from NULL to NA

076fdcb

change the position of perm and outcome

185e210

remove outcome=NULL, it is already set before

1e6862e

fix the tidyverse bug (select out outcome column)

ac0b248

Fixes SchlossLab#8

BTopcuoglu added enhancement New feature or request good first issue Good for newcomers labels Dec 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make permutation importance test optional, functionality to use any outcome variables, optional hyperparameter input #11

make permutation importance test optional, functionality to use any outcome variables, optional hyperparameter input #11

zenalapp commented Nov 11, 2019 •

edited

BTopcuoglu commented Nov 11, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

zenalapp commented Nov 12, 2019

BTopcuoglu commented Nov 12, 2019

zenalapp commented Nov 12, 2019

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 13, 2019 •

edited

BTopcuoglu commented Nov 13, 2019

make permutation importance test optional, functionality to use any outcome variables, optional hyperparameter input #11

Are you sure you want to change the base?

make permutation importance test optional, functionality to use any outcome variables, optional hyperparameter input #11

Conversation

zenalapp commented Nov 11, 2019 • edited

BTopcuoglu commented Nov 11, 2019 • edited

BTopcuoglu commented Nov 12, 2019 • edited

BTopcuoglu commented Nov 12, 2019 • edited

BTopcuoglu commented Nov 12, 2019 • edited

BTopcuoglu commented Nov 12, 2019 • edited

zenalapp commented Nov 12, 2019

BTopcuoglu commented Nov 12, 2019

zenalapp commented Nov 12, 2019

BTopcuoglu commented Nov 12, 2019 • edited

BTopcuoglu commented Nov 12, 2019 • edited

BTopcuoglu commented Nov 13, 2019 • edited

BTopcuoglu commented Nov 13, 2019

zenalapp commented Nov 11, 2019 •

edited

BTopcuoglu commented Nov 11, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 12, 2019 •

edited

BTopcuoglu commented Nov 13, 2019 •

edited