Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

information value calculation in case of zero number of y class #42

Open
giorgos-kyriakopoulos opened this issue Jan 7, 2020 · 2 comments
Labels
question Further information is requested

Comments

@giorgos-kyriakopoulos
Copy link

Hi, iv_xy function replaces 0 with 0.9 in order to calculate information value. Would it be more appropriate to replace with a value closer to 0 , let's say 0.01?

@ShichenXie
Copy link
Owner

ShichenXie commented Jan 9, 2020

This replacement is to make sure the iv can be calculate in case of zero y labels. Replace with 0.1 or 0.9 would not affect the iv, since it might not occur in a real data sample after bining . I didn’t find any reference on this issue, if you have please post it here.

@giorgos-kyriakopoulos
Copy link
Author

In a case that a binning process results to a bin with a small number of good class and zero of bad class then I suppose that replacing 0 to almost 1 (0.9) is different than replacing 0 to 0.01. What I mean is that in real cases where default rate is very low (2%) the fact that we appear defaults(0.9) in bins where the do not exist ( by converting 0 to 0.9 instead of 0.01) might lead to misconceptions.

@ShichenXie ShichenXie added the question Further information is requested label May 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants