-
Notifications
You must be signed in to change notification settings - Fork 185
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Imputation of missing values using ML models. #477
Comments
@levithatcher @taylorlarsen and @mmastand do you have any input on this? Otherwise we will submit a PR for its fix soon. |
We have had good luck using the following in our other work:
What do you think about these methods? We'd be grateful if you wanted to do a PR! |
I want to do PR on this finding and currently I am using below techniques:
Both of these methods work good on linear as well as non-linear type of data. |
Current package impute missing values using mean and median.
Now I have identified an approach to apply Machine Learning models for imputing the missing values:
Existing approach:
Impute missing values using Mean/Median
Drawback:
New Approach:
Impute missing values as per below:
Advantages:
So for the datasets having large number of missing values, this approach can improve the overall quality of data to be feeded to ML algorithms.
Thus perfomance of existing model can be improved using this imputation stratgey.
The text was updated successfully, but these errors were encountered: