Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CFs with onehot encoded categorical variables #346

Open
nagainfosolutions opened this issue Nov 26, 2022 · 1 comment
Open

CFs with onehot encoded categorical variables #346

nagainfosolutions opened this issue Nov 26, 2022 · 1 comment

Comments

@nagainfosolutions
Copy link

Hello,

This isn't a bug, but more of a query on how Dice can work with data which is onehot encoded by our program.

There are several fields in our dataset that contain categorical data (eg: Gender, Country etc). Such fields are onehot encoded resulting in a change of number of columns and their names. Eg: field 'Gender' changes to => 'Gender_Male' and 'Gender_Female'

How can 'permitted ranges' and 'features_to_vary' attributes work with such data.

For instance permitted range for 'Country' field would be ['USA', 'France', 'Germany']. But after one-hot encoding the field 'Country' is removed. Instead we only have 'Country_USA' (with values 0.0 or 1.0), 'Country_France' and 'Country_Germany'.

Is there a workaround for this problem?

Thanks in advance.

@leoncena
Copy link

Hi,

Just out of curiosity (have the same problem):

Have you managed to find a solution?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants