Extend PipeOpLearnerCV for other resamplings #500

pfistfl · 2020-09-09T10:32:37Z

levels = c("cv", "insample") is currently very limited and not flexibly extensible i.e. for mlr3spatiotmpcv resamplings.
Might need some mechanism to allow a subset of mlr_resamplings

The text was updated successfully, but these errors were encountered:

mb706 · 2020-09-11T18:56:38Z

problem to anticipate here is that not all resamplings create a prediction for every input row (#216 will get relevant here), and some may create multiple predictios (does mean averaging usually make sense?)

pfistfl · 2020-09-11T20:47:55Z

This is rather urgent though, as the current version blocks e.g. mlr3spatiotempcv resampling and makes the PO non-extensible which popped up in at least one project.
We could perhaps allow all mlr_resamplings and just document that this might break?

In general, another possibility would be to

in case of e.g. holdout, just fill up the rest of the predictions with ǸA`
in case multiple predictions exist just create additional columns as needed.
the current return format is a data.table anyway.

mb706 · 2020-09-11T20:55:32Z

filling with NA is probably a good idea. I don't like multiple columns because the number of output columns must be the same in train() and predict. I guess if it's supposed to be quick we can just mean(). It would be nice though to do something sensible with se and prob. (But I guess we haven't solved that problem for PipeOpRegrAvg / PipeOpClassifAvg either.)

pfistfl · 2020-09-14T14:19:32Z

The number of output cols: We can just store the number of outputs in train and enforce the same length in test.
Aggregating using mean or m̀ode` might also work (perhaps this could be an option),

sumny self-assigned this Sep 30, 2020

sumny linked a pull request Oct 1, 2020 that will close this issue

open up PipeOpLearnerCV to all resampling methods #513

Open

mb706 added the Type: Enhancement label Sep 29, 2021

mb706 added the Tag: POFU label Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend PipeOpLearnerCV for other resamplings #500

Extend PipeOpLearnerCV for other resamplings #500

pfistfl commented Sep 9, 2020

mb706 commented Sep 11, 2020

pfistfl commented Sep 11, 2020

mb706 commented Sep 11, 2020

pfistfl commented Sep 14, 2020

Extend PipeOpLearnerCV for other resamplings #500

Extend PipeOpLearnerCV for other resamplings #500

Comments

pfistfl commented Sep 9, 2020

mb706 commented Sep 11, 2020

pfistfl commented Sep 11, 2020

mb706 commented Sep 11, 2020

pfistfl commented Sep 14, 2020