Bug: small batch size with categorical variables #454

rajeeja · 2019-03-19T16:47:36Z

The link below is a standalone script for replicating the error to file the bug fix with mlrMBO

https://github.com/rajeeja/mlrmbo-bug/blob/master/mlrMBOMixedIntegerTest11a.R

Please let me know if you need more details.

jakob-r · 2019-03-25T11:15:57Z

Hi,
you are using the initial design in a weird way. It is simply too small for your big search space.

Why do you generate the design with max.budget points to then only take the first 5 (propose.points).

Your initial design has to contain each discrete value at least once so that the surrogate can make predictions.

For me it works with design = generateDesign(n = 30, par.set = getParamSet(obj.fun))

rajeeja · 2019-04-12T19:55:02Z

@jakob-r Thanks!
But "Your initial design has to contain each discrete value at least once so that the surrogate can make predictions." is not sufficient if I use the learner below:

surr.rf = makeLearner("regr.randomForest",
predict.type = "se",
fix.factors.prediction = TRUE,
se.method = "bootstrap",
se.boot = 2)

res = mbo(obj.fun, design = design, learner = surr.rf, control = ctrl, show.info = TRUE)

Complete isolated example is here
https://github.com/rajeeja/mlrmbo-bug/blob/master/learner-discrete-param-bug.R

jakob-r · 2019-04-15T14:30:03Z

True, my answer is kind of restricted to the surrogate. However, I have doubts that the surrogate will work so well, especially the uncertainty estimation for unknown factors. I am curious to see results of any optimization benchmark using this approach 🙂

rajeeja · 2019-04-15T17:59:47Z

Even if I increase the propose.points to 1000, I get the error:
Error in predict.randomForest(getLearnerModel(x), newdata = .newdata, :
New factor levels not present in the training data

for this example: https://github.com/rajeeja/mlrmbo-bug/blob/master/learner-discrete-param-bug.R

What should be a fix for getting something like this to work?

rajeeja · 2019-04-15T20:00:33Z

changing surr.rf = makeLearner("regr.randomForest", 
                      predict.type = "se", 
                      fix.factors.prediction = TRUE,
                      se.method = "bootstrap", 
                      se.boot = 8)

to

surr.rf = makeLearner("regr.randomForest", 
                      predict.type = "se", 
                      fix.factors.prediction = TRUE,
)

it works. I'll update you about results from this approach. Also older version works even with se->

rajeeja · 2019-04-16T00:37:33Z

just found that changing the se.method = "bootstrap", to

se.method = "jackknife",

works.

jakob-r closed this as completed Mar 25, 2019

rajeeja mentioned this issue Apr 4, 2019

Replace categorical variables in mlrMBO with e.g., integer params with mapping. ECP-CANDLE/Supervisor#54

Open

jakob-r reopened this Apr 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: small batch size with categorical variables #454

Bug: small batch size with categorical variables #454

rajeeja commented Mar 19, 2019

jakob-r commented Mar 25, 2019

rajeeja commented Apr 12, 2019

jakob-r commented Apr 15, 2019

rajeeja commented Apr 15, 2019

rajeeja commented Apr 15, 2019 •

edited by jakob-r

rajeeja commented Apr 16, 2019

Bug: small batch size with categorical variables #454

Bug: small batch size with categorical variables #454

Comments

rajeeja commented Mar 19, 2019

jakob-r commented Mar 25, 2019

rajeeja commented Apr 12, 2019

jakob-r commented Apr 15, 2019

rajeeja commented Apr 15, 2019

rajeeja commented Apr 15, 2019 • edited by jakob-r

rajeeja commented Apr 16, 2019

rajeeja commented Apr 15, 2019 •

edited by jakob-r