chill out, Background #415

goldingn · 2018-03-06T04:14:17Z

Background has an na.omit() step before returning the dataframe, which removes rows if any column has NA values.

Ideally, this should only remove rows with NAs in the the covariate columns (those columns named in the covCols attribute), since otherwise it causes problems if there are user-defined columns (#414).

The text was updated successfully, but these errors were encountered:

timcdlucas · 2018-03-06T09:40:18Z

I'm not convinced we want to automatically na.omit() based on covariates either.

Some models can handle NAs fine.
There might be cases where NA is the most appropriate value (some grouping variables or something perhaps?)
Process modules that impute missing data may be written later and auto removing all NAs will make that difficult.

The two options that seem reasonable to me are

Leave everything by default and have a RemoveNAs process module (which I've written and just waiting to find time to upload).
Have a na.rm argument to workflow.

Personally I like the first.

AugustT · 2018-03-06T10:08:15Z

I agree 1 is the ideal solution. This would also require a review of existing models to ensure they work with NA values and addition of NA values to the module checking routines

timcdlucas · 2018-03-06T10:14:31Z

But the current models don't need to work with NA values. I guess it would be good if they could error in a useful way if they can't handle NAs. Maybe that's what you mean by "work with NA values". But the workflow I imagine is:

`worklflow(..., process = NoProcess, ...)
Oh, this model can't handle NAs and I have NAs.
`workflow(..., process = RemoveNAs, ...)
Ah good it works now. But I'll have a think about whether I should impute or use different covariates or whatever.

AugustT · 2018-03-06T10:24:02Z

Yes, I mean throw a sensible error or remove NAs with warning

timcdlucas · 2018-03-06T10:30:08Z

OK yes then I totally agree. Should throw a sensible error.

goldingn · 2018-03-06T11:05:20Z

Yeah, I agree. Best thing is to make the existing models remove NAs (with a warning) so we don't break any existing workflows,and to remove na.omit here.

timcdlucas · 2018-05-23T09:13:34Z

Just to go back to this the RemoveNAs module is now in the repo. I don't think Background has had it's na.omit line removed.

goldingn added bug module labels Mar 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chill out, Background #415

chill out, Background #415

goldingn commented Mar 6, 2018

timcdlucas commented Mar 6, 2018

AugustT commented Mar 6, 2018

timcdlucas commented Mar 6, 2018

AugustT commented Mar 6, 2018

timcdlucas commented Mar 6, 2018 •

edited

goldingn commented Mar 6, 2018

timcdlucas commented May 23, 2018

chill out, Background #415

chill out, Background #415

Comments

goldingn commented Mar 6, 2018

timcdlucas commented Mar 6, 2018

AugustT commented Mar 6, 2018

timcdlucas commented Mar 6, 2018

AugustT commented Mar 6, 2018

timcdlucas commented Mar 6, 2018 • edited

goldingn commented Mar 6, 2018

timcdlucas commented May 23, 2018

timcdlucas commented Mar 6, 2018 •

edited