Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option for NAs #1

Open
ddalevi opened this issue Aug 8, 2019 · 2 comments
Open

Option for NAs #1

ddalevi opened this issue Aug 8, 2019 · 2 comments

Comments

@ddalevi
Copy link

ddalevi commented Aug 8, 2019

Would be nice with something similar as you have for the standard cor-function where users can specify how they would like to deal with NA’s. I do agree with the comment "Missing values and cleaning data are critical to getting great correlations" but a function like this is very convinient when having a few NAs in some columns.

@mdancho84
Copy link
Collaborator

Let me look into this. I have had several students contact me about this, and I will consider.

@GitHunter0
Copy link

GitHunter0 commented Feb 12, 2021

For sure, in binarize(), just having an option to convert NAs into a separate bin would minimize this issue and is very easy to implement.

Otherwise, in the case of a numeric variable for example, you would have to drop the missing observations or convert numeric NAs into an arbitrary value (e.g. zero), which would both distort (statistical bias) the analysis in unpredictable ways.

Having the option to break numeric variables in bins by a specified criterion (frequency or interval length for e.g.) instead of number of bins would be useful too.

Despite that, overall another great package, so thanks @mdancho84

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants