feat: validation task #983

sebffischer · 2023-12-14T17:48:01Z

TODOs:

maybe we should actually rename the test task to validation (?) But the naming is still cofusing as the resampling's test set then becomes the validation set ...
some more checks that verify that the holdout and validation task are compatible with the primary task. Pay attention to the different task types (e.g. don't check for target in clustering task).

This PR enables to solve the problem that the test rows, that can e.g. used for early stopping by xgboost, can be preprocessed in a graph learner and that early stopping xgboost in a graph learner now works.

Some explanations for the changes:

The relevant lines of code, that restricted how we can implement the preprocessing of test rows can be found here: https://github.com/mlr-org/mlr3pipelines/blob/044762e64e68c4aec39cd2e6b6e1f8ef45f135ca/R/PipeOpTaskPreproc.R#L211-L218. First, the private $.train_task(task) method modifies the 'use' rows of task in-place (usually by cbinding, but in principle, anything can happen here, and users have possibly overwritten this method when inheriting from PipeOpTaskPreproc.
After setting the state of the PipeOp, somehow the predictions must be made on the test rows, and added to the task. We previously explored row-binding them to the task, but this was inefficient, as row-binding requires to row-bind all columns, even if they were not altered by the pipeop. In a graph, this would introdcues a rbind-cbind-rbind-cbind, ..., rbind-cbind backend structure, which is a) hard to flatten and b) memory inefficient and can get possibly slow. The solution implemented in this Pull Request sidesteps this problem by simply adding the test task to the task itself, using the newly introduced AB $test_task. The test task can be conveniently created by the user, using the newly introduced $partition() method.
In practice, this now looks as follows:

library(mlr3)
library(mlr3pipelines)

task = tsk("iris")
task
#> <TaskClassif:iris> (150 x 5): Iris Flowers
#> * Target: Species
#> * Properties: multiclass
#> * Features (4):
#>   - dbl (4): Petal.Length, Petal.Width, Sepal.Length, Sepal.Width
task$divide(1:10, "test")
task
#> <TaskClassif:iris> (140 x 5): Iris Flowers
#> * Target: Species
#> * Properties: multiclass
#> * Features (4):
#>   - dbl (4): Petal.Length, Petal.Width, Sepal.Length, Sepal.Width
#> * Test Task: (10x5)

task$test_task
#> <TaskClassif:iris> (10 x 5): Iris Flowers
#> * Target: Species
#> * Properties: multiclass
#> * Features (4):
#>   - dbl (4): Petal.Length, Petal.Width, Sepal.Length, Sepal.Width

po_pca = po("pca")

taskout = po_pca$train(list(task))[[1L]]
taskout$test_task
#> <TaskClassif:iris> (10 x 5): Iris Flowers
#> * Target: Species
#> * Properties: multiclass
#> * Features (4):
#>   - dbl (4): PC1, PC2, PC3, PC4

^{Created on 2024-02-16 with reprex v2.0.2}

PipeOps always preprocess the test_task when it is provided. However, a GraphLearner only wants to do the preprocessing on the test rows, when they are needed otherwise this is unnecessary computation (as they are currently not used for the learner's $predict() step. To communicate this, the 'uses_test_task' property was introduced.
Because the 'uses_test_task' property is not fixed (its presence depends e.g. on whether he early_stopping_set parameter from XGBoost is set to "test" or "none"), it was necessary to add the ability to dynamically generate a learner's properties. This was done using the private method .contingent_properties() that can be overwritten by learners. It is necessary to set this method in the Learner base class to a function returning character(0) (and not NULL), because of a bug in R6.
Retired interface: We previously had the API task$set_row_roles(1, "test") or task$set_row_roles(1, "holdout").
Because we now introduced the $test_task field, there would have been two ways to achieve something similar. This made code messy and the interface confusing. For this reason, both the holdout and test row-roles were removed.

Because this PR breaks some existing packages (because of the removal of the 'holdout' and 'test' row roles), I have already created Pull Requests in some packages:

TODO: check whether I really got all packages (only checked those that I have locally available)

The general plan to merge this feature is to:

Make releases for these PRs:
- mlr3learners: Feat/train predict mlr3learners#288 (Xgboost, only dev and paramtest are failing)
- mlr3tuning: Fix/train predict mlr3tuning#413 (holdout set is used)
- mcboost: update vignette to not use holdout role mcboost#44 (vignette uses holdout set)
- mlr3fairness fix incorrect predict set mlr3fairness#74 (there is a bug that I did not cause)
- mlr3pipelineshttps://github.com/mlr-org/mlr3pipelines/pull/761/files (this is needed, because of the way the graphlearner sets its properties)
Merge this branch and make a release on CRAN
Implement the feature in pipelines and make a release from this branch:

feat: preprocess test_task in graph mlr3pipelines#760

Make changes in mlr3extralearners and bump mlr3 dependency
Make a gallery post about this

R/Learner.R

NEWS.md

be-marc · 2024-05-17T10:50:38Z

R/Task.R

+    #'   If `TRUE` (default), the `row_ids` are removed from the primary task's active `"use"` rows.
+    #'
+    #' @return Modified `self`.
+    divide = function(x, remove = TRUE) {


Why not two parameters?

be-marc and others added 9 commits December 7, 2023 15:23

refactor: remove task prototype when resample

2c77828

refactor: add option to store prototype

2164d4d

fix: braket

21ed459

refactor: null

cef8117

fix: browser

48f40fa

keep prototypes in state when store_models is TRUE

1c027cd

feat(Learner): uses_test_set active binding

8788355

...

b29d558

...

c2726c9

sebffischer changed the title ~~feat: uses_test_set field for learner~~ feat: contingent properties and validation support Jan 23, 2024

sebffischer added 2 commits January 23, 2024 18:27

...

73247c4

...

ddfa67d

sebffischer commented Jan 25, 2024

View reviewed changes

R/Learner.R Outdated Show resolved Hide resolved

Update R/Learner.R

dd65441

sebffischer commented Jan 25, 2024

View reviewed changes

R/Learner.R Outdated Show resolved Hide resolved

sebffischer added 2 commits January 25, 2024 13:14

Update R/Learner.R

1dd12cc

...

3d06488

sebffischer changed the title ~~feat: contingent properties and validation support~~ feat: contingent properties and test_test_rows support Jan 27, 2024

sebffischer added 2 commits January 29, 2024 08:43

allow cbinding test rows to task

c58da30

add test

02fa3dd

sebffischer changed the title ~~feat: contingent properties and test_test_rows support~~ feat: contingent properties and use_test_rows support Feb 7, 2024

allow to cbind test rows

3aabaef

sebffischer commented Feb 12, 2024

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

sebffischer added 7 commits February 12, 2024 16:13

Update NEWS.md

5b983ee

avoid unnecessary sort

9e8ae85

work on test and holdout tas

9a0e954

Merge branch 'main' into feat/train-predict

379e2f2

BREAKING_CHANGE: test/holdout task replace test/holdout roles

4cb8b8e

fix some issues regarding test task

448296d

better news

343f475

sebffischer changed the title ~~feat: contingent properties and use_test_rows support~~ feat: test and holdout task Feb 20, 2024

sebffischer added 7 commits February 20, 2024 14:56

pipelines dependency

944adfc

uber hack for revdepcheck

bdd54fb

remove remotes

c465b7f

optimization

29e21bd

refactor: partition method is now called divide

1c06fb7

comment hack

fe54dac

rename test -> validation

3bd7284

sebffischer changed the title ~~feat: test and holdout task~~ feat: validation and holdout task Mar 18, 2024

sebffischer changed the title ~~feat: validation and holdout task~~ feat: validation task Mar 19, 2024

sebffischer added 2 commits March 19, 2024 19:02

some progress

4c0f4c7

...

d47a4ba

be-marc reviewed May 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: validation task #983

feat: validation task #983

sebffischer commented Dec 14, 2023 •

edited

be-marc May 17, 2024

feat: validation task #983

Are you sure you want to change the base?

feat: validation task #983

Conversation

sebffischer commented Dec 14, 2023 • edited

be-marc May 17, 2024

Choose a reason for hiding this comment

sebffischer commented Dec 14, 2023 •

edited